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Psychology at the beginning of the twenty-first century has 
become a highly diverse field of scientific study and applied 
technology. Psychologists commonly regard their discipline 
as the science of behavior, and the American Psychological 
Association has formally designated 2000 to 2010 as the 
“Decade of Behavior.” The pursuits of behavioral scientists 
range from the natural sciences to the social sciences and em- 
brace a wide variety of objects of investigation. Some psy- 
chologists have more in common with biologists than with 
most other psychologists, and some have more in common 
with sociologists than with most of their psychological col- 
leagues. Some psychologists are interested primarily in the be- 
havior of animals, some in the behavior of people, and others 
in the behavior of organizations. These and other dimensions 
of difference among psychological scientists are matched by 
equal if not greater heterogeneity among psychological practi- 
tioners, who currently apply a vast array of methods in many 
different settings to achieve highly varied purposes. 

Psychology has been rich in comprehensive encyclope- 
dias and in handbooks devoted to specific topics in the field. 
However, there has not previously been any single handbook 
designed to cover the broad scope of psychological science 
and practice. The present 12-volume Handbook of Psychol- 
ogy was conceived to occupy this place in the literature. 
Leading national and international scholars and practitioners 
have collaborated to produce 297 authoritative and detailed 
chapters covering all fundamental facets of the discipline, 
and the Handbook has been organized to capture the breadth 
and diversity of psychology and to encompass interests and 
concerns shared by psychologists in all branches of the field. 

Two unifying threads run through the science of behavior. 
The first is a common history rooted in conceptual and em- 
pirical approaches to understanding the nature of behavior. 
The specific histories of all specialty areas in psychology 
trace their origins to the formulations of the classical philoso- 
phers and the methodology of the early experimentalists, and 
appreciation for the historical evolution of psychology in all 
of its variations transcends individual identities as being one 
kind of psychologist or another. Accordingly, Volume 1 in 
the Handbook is devoted to the history of psychology as 
it emerged in many areas of scientific study and applied 
technology. 


A second unifying thread in psychology is a commitment 
to the development and utilization of research methods 
suitable for collecting and analyzing behavioral data. With 
attention both to specific procedures and their application 
in particular settings, Volume 2 addresses research methods 
in psychology. 

Volumes 3 through 7 of the Handbook present the sub- 
stantive content of psychological knowledge in five broad 
areas of study: biological psychology (Volume 3), experi- 
mental psychology (Volume 4), personality and social psy- 
chology (Volume 5), developmental psychology (Volume 6), 
and educational psychology (Volume 7). Volumes 8 through 
12 address the application of psychological knowledge in 
five broad areas of professional practice: clinical psychology 
(Volume 8), health psychology (Volume 9), assessment psy- 
chology (Volume 10), forensic psychology (Volume 11), and 
industrial and organizational psychology (Volume 12). Each 
of these volumes reviews what is currently known in these 
areas of study and application and identifies pertinent sources 
of information in the literature. Each discusses unresolved is- 
sues and unanswered questions and proposes future direc- 
tions in conceptualization, research, and practice. Each of the 
volumes also reflects the investment of scientific psycholo- 
gists in practical applications of their findings and the atten- 
tion of applied psychologists to the scientific basis of their 
methods. 

The Handbook of Psychology was prepared for the pur- 
pose of educating and informing readers about the present 
state of psychological knowledge and about anticipated ad- 
vances in behavioral science research and practice. With this 
purpose in mind, the individual Handbook volumes address 
the needs and interests of three groups. First, for graduate stu- 
dents in behavioral science, the volumes provide advanced 
instruction in the basic concepts and methods that define the 
fields they cover, together with a review of current knowl- 
edge, core literature, and likely future developments. Second, 
in addition to serving as graduate textbooks, the volumes 
offer professional psychologists an opportunity to read and 
contemplate the views of distinguished colleagues concern- 
ing the central thrusts of research and leading edges of prac- 
tice in their respective fields. Third, for psychologists seeking 
to become conversant with fields outside their own specialty 
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and for persons outside of psychology seeking informa- 
tion about psychological matters, the Handbook volumes 
serve as a reference source for expanding their knowledge 
and directing them to additional sources in the literature. 
The preparation of this Handbook was made possible by 
the diligence and scholarly sophistication of the 25 volume 
editors and co-editors who constituted the Editorial Board. 
As Editor-in-Chief, I want to thank each of them for the plea- 
sure of their collaboration in this project. I compliment them 
for having recruited an outstanding cast of contributors to 
their volumes and then working closely with these authors to 
achieve chapters that will stand each in their own right as 


valuable contributions to the literature. I would like finally to 
express my appreciation to the editorial staff of John Wiley 
and Sons for the opportunity to share in the development of 
this project and its pursuit to fruition, most particularly to 
Jennifer Simon, Senior Editor, and her two assistants, Mary 
Porterfield and Isabel Pratt. Without Jennifer’s vision of the 
Handbook and her keen judgment and unflagging support in 
producing it, the occasion to write this preface would not 
have arrived. 


IRVING B. WEINER 
Tampa, Florida 


Volume Preface 


There are probably not many psychologists who have spent 
much time thinking about creating a handbook. The prevalent 
reasons for becoming a psychologist—scientific curiosity, 
the need for personal expression, or the desire for fame and 
fortune—would be unlikely to bring to mind the idea of gen- 
erating a handbook. At the same time, most would agree that 
a handbook can be remarkably useful when the need arises. 
The chapters can provide the background for a grant pro- 
posal, the organization of a course offering, or a place for 
graduate students to look for a research problem. If presented 
at the right time, the clearly worthwhile aspects of this other- 
wise most unlikely endeavor can make it an attractive oppor- 
tunity; or, at least in retrospect, one could imagine saying, 
“Well, it seemed like a good idea at the time.” Even if there 
are a few simple and sovereign principles underlying all per- 
sonality processes and social behavior, they were not con- 
sciously present when organizing this volume. Instead, what 
was terribly salient were the needs and goals of potential 
users of this volume: What would a reader need to know to 
have a good understanding of the current theoretical and em- 
pirical issues that occupy present-day thinkers and re- 
searchers? What could the highly sophisticated investigators 
who were selected to write the chapters tell the reader about 
the promising directions for future development? The chap- 
ters in this volume provide both thorough and illuminating 
answers to those questions, and, to be sure, some can be 
grouped into a few sections based on some common, familiar 
themes. For those readers who want more information about 
what chapters would be useful or who are open to being in- 
trigued by the promise of some fascinating new ideas, this is 
a good time to take a brief glimpse at what the chapters are 
about. 

An immediately pressing question for the editors centered 
on what content to include and whom to invite for the indi- 
vidual chapters. There are probably many ways to arrive sys- 
tematically at those decisions, but then there is the intuitive 
method, which is easier, at least in that it can introduce a 
slight element of self-expression. The first chapter of this 
volume is a clear manifestation of the self-expressive mode. 
It comprises the thoughts of one of this volume’s editors and 
contains a creative series of proposals concerning both the 
logic and the derivations of employing evolutionary theory as 
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a basis for generating personality attributes, personality 
being the initial topic of the two major subjects that compose 
this fifth volume of the 12-volume Handbook of Psychology. 

Chapters 1 and 2 of this book are subsumed under the gen- 
eral heading of contexts. The thought here is that both per- 
sonality and social psychology, broad though they may be in 
their own right, should be seen as components of even wider 
fields of study, namely evolution and culture. 

Evolution provides a context that relates to the processes 
of the time dimension, that is, the sequences and progressions 
of nature over the history of life on earth. Evolutionary theory 
generates a constellation of phylogenetic principles repre- 
senting those processes that have endured and continue to un- 
dergird the ontogenetic development and character of human 
functioning. As such, these principles may guide more effec- 
tive thinking about which functions of personality are likely 
to have been—and to persist to be—the most relevant in our 
studies. Similarly, culture provides a context that relates to 
the structure and processes of the space dimension, that is, 
the larger configuration of forces that surround, shape, and 
give meaning to the events that operate in the more immedi- 
ate social psychological sphere. The study of culture may ex- 
plicate the wide constellation of influences within which so- 
cial behaviors are immersed and that ever so subtly exert 
direction, transform, and control and regulate even the most 
prosaic events of ordinary social communications and rela- 
tionships. A few additional words should be said in elabora- 
tion of these two contextual chapters. 

Admittedly theoretical and speculative, the paper by 
Theodore Millon outlines several of what he has deduced as 
the universal polarities of evolution: first, the core aims of 
existence, in which the polarities of life preservation are 
contrasted with life enhancement; second, life’s fundamen- 
tal modes of adaptation, counterposing ecologic accommo- 
dation and ecologic modification; third, the major strategies 
of species replication, setting reproductive nurturance in op- 
position to reproductive propagation; and fourth, a distinctly 
human polarity, that of predilections of abstraction, com- 
posed of comparative sources of information and their 
transformational processes. Millon spells out numerous per- 
sonality implications of these polarities and articulates 
sources of support from a wide range of psychological 
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literatures, such as humanistic theory and neurobiological 
research. 

Joan G. Miller and Lynne Schaberg, in their contextual 
chapter, provide a constructively critical review of the fail- 
ings of mainstream social psychology owing to its culture- 
free assumption of societal homogeneity. The authors specify 
a number of reasons why the cultural grounding of basic 
social-psychological processes have historically been down- 
played. No less important is their articulation of the key 
conceptual formulations that have led to modern cultural psy- 
chology. Also notable are the several insights and challenges 
that stem from this new field. Equally valuable is a thorough 
review of how cultural research may bear significantly on a 
range of basic cognitive, emotional, and motivational func- 
tions. The authors conclude by outlining the many ways in 
which ongoing cultural studies can contribute new and use- 
ful theoretical constructs, as well as pertinent research ques- 
tions that may substantially enrich the character, constructs, 
and range of numerous, more basic social-psychological 
formulations. 

The next set of eight chapters of the volume represent 
the creative and reflective thinking of many of our most no- 
table theoretical contributors to personology. They range 
from the genetic and biologic to the interpersonal and factor- 
ial. Each contributor is a major player in contemporary per- 
sonality thought and research. 

Before we proceed, a few words should be said concern- 
ing the current status of personologic theory. As he wrote in a 
1990 book, Toward a New Personology, the first editor of this 
volume commented that the literature of the 1950s and 1960s 
was characterized by egregious attacks on the personality 
construct—attacks based on a rather facile and highly selec- 
tive reading of then-popular research findings. And with the 
empirical grounding of personality in question and the conse- 
quential logic of personologic coherence and behavioral 
consistency under assault, adherents of the previously valued 
integrative view of personality lost their vaunted academic 
respectability and gradually withdrew from active publica- 
tion. Personality theory did manage to weather these mettle- 
some assaults, and it began what proved to be a wide-ranging 
resurgence in the 1970s. By virtue of time, thoughtful reflec- 
tion, and, not the least, disenchantment with proposed alter- 
natives such as behavioral dogmatism and psychiatric 
biochemistry, the place of the personality construct rapidly 
regained its formal solid footing. The alternatives have justly 
faded to a status consonant with their trivial character, 
succumbing under the weight of their clinical inefficacy 
and scholarly boredom. By contrast, a series of widely ac- 
claimed formulations were articulated by a number of con- 
temporary psychological, psychoanalytic, interpersonal, 


cognitive, factorial, genetic, social, neurobiologic, and evolu- 
tionary theorists. It is to these theorists and their followers 
that we turn next. 

Bringing the primitive and highly speculative genetic 
thought of the early twentieth century up to date by drawing 
on the technologies of the recent decade, W. John Livesley, 
Kerry L. Jang, and Philip Anthony Vernon articulate a con- 
vincing rationale for formulating personality concepts and 
their structure on the basis of trait-heritability studies. In a 
manner similar to Millon, who grounds his personologic con- 
cepts on the basis of a theory of evolutionary functions, 
Livesley et al. argue that genetic research provides a funda- 
mental grounding for deriving complex trait constellations; 
these two biologically anchored schemas may ultimately be 
coordinated through future theoretical and empirical re- 
search. The authors contend that most measures of personal- 
ity reflect heritable components and that the phenotypic 
structure of personality will ultimately resemble the pattern 
of an underlying genetic architecture. They assert, further, 
that etiologic criteria such as are found in genetics can offer a 
more objective basis for appraising personologic structure 
than can psychometrically based phenotypic analyses. More- 
over, they believe that the interaction of multiple genetic fac- 
tors will fully account for the complex patterns of trait 
covariances and trait clusters. 

Continuing the thread of logic from evolution to genetics to 
the neurochemical and physiological, Marvin Zuckerman 
traces the interplay of these biologically based formulations to 
their interaction with the environment and the generation of 
learned behavioral traits. Writing in the spirit of Edward 
Wilson’s concept of consilience and its aim of bringing a mea- 
sure of unity to ostensibly diverse sciences, Zuckerman spells 
out in considerable detail the flow or pathways undergird- 
ing four major personality trait concepts: extroversion/ 
sociability; neuroticism/anxiety; aggression/agreeableness; 
and impulsivity/sensation seeking. Recognizing that detailed 
connections between the biological and the personological 
are not as yet fully developed, Zuckerman goes to great 
pains, nevertheless, to detail a wide range of strongly sup- 
porting evidence, from genetic twin studies to EEG and brain 
imaging investigations of cortical and autonomic arousal, to 
various indexes of brain neurochemistry. 

Shifting the focus from the biological grounding of per- 
sonality attributes, Robert F. Bornstein provides a thoughtful 
essay on both classical psychoanalytic and contemporary 
models of psychodynamic theory. He does record, however, 
that the first incarnation of psychoanalysis was avowedly 
biological, recognizing that Freud in 1895 set out to link 
psychological phenomena to then-extant models of neural 
functioning. Nevertheless, the course of analytic theory has 


evolved in distinctly divergent directions over the past cen- 
tury, although recent efforts have been made to bridge them 
again to the challenge of modern neuroscience, as Bornstein 
notes. His chapter spells out core assumptions common to all 
models of psychoanalysis, such as classical analytic theory, 
neoanalytic models, object-relations theory, and self psychol- 
ogy, as well as contemporary integrative frameworks. 
Threads that link these disparate analytic perspectives are 
discussed, as are the key issues facing twenty-first-century 
analytic schemas. 

No more radical a contrast with psychoanalytic models of 
personality can be found than in theories grounded in the log- 
ical positivism and empiricism that are fundamental to be- 
havioral models, such as those articulated in the chapter by 
one of its primary exponents, Arthur W. Staats. Committed to 
a formal philosophical approach to theory development, 
Staats avers that most personality models lack formal rules of 
theory construction, possessing, at best, a plethora of differ- 
ent and unrelated studies and tests. Staats’s theory, termed 
psychological behaviorism, is grounded in learning principles 
generated originally in animal research, but more recently put 
into practice in human behavioral therapy. Like Clark Hull, a 
major second-generation behavioral thinker, he believes that 
all behavior is generated from the same primary laws. In his 
own formulations, Staats explicates a unified model of behav- 
ioral personology that is philosophically well structured and 
provides a program for developing diverse avenues of sys- 
tematic personality research. 

An innovative and dynamic framework for coordinating 
the cognitive, experiential, learning, and self-oriented com- 
ponents of personology (termed CEST) is presented in the 
theoretical chapter by Seymour Epstein. The author proposes 
that people operate through two interacting information- 
processing modes, one predominantly conscious, verbal, and 
rational, the other predominantly preconscious, automatic, 
and emotionally experiential. Operating according to differ- 
ent rules, it is asserted that the influence of the experiential 
system on the rational system is akin to what psychoanalysis 
claims for the role of the unconscious, but it is conceptualized 
in CEST in a manner more consistent with contemporary 
evolutionary and cognitive science. Epstein details the appli- 
cation of his CEST model for psychotherapy, notably by 
pointing out how the rational system can be employed to cor- 
rect problems generated in the experiential system. Also 
discussed is the importance of designing research that fully 
recognizes and encompasses the interplay between these two 
information-processing systems. 

The chapter by Charles S. Carver and Michael F. Scheier 
represents the current status of their decades-long thought 
and research on self-regulatory models of personality func- 
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tioning. Anchored in a sophisticated framework of feedback 
schemas, the authors emphasize a major facet of personality 
processing, the system of goals that compose the self, how 
the patterns of a person’s goals are related, and the means by 
which persons move toward and away from their goals. As a 
consequence of their research, the authors have come to see 
that actions are managed by a different set of feedback 
processes than are feelings. Aspirations are recalibrated in 
reasonably predictable ways as a function of experience; for 
example, successes lead to setting higher goals, whereas fail- 
ures tend to lower them. Conflicting goals often call for the 
suppression of once-desired goals, resulting in goal shifts, 
scaling back, disengagements, and, ultimately, lapses in self- 
control. Carver and Scheier view their goal as closely related 
to other contemporary schemas, such as dynamic systems 
theory and connectionism. 

In their richly developed chapter, Aaron L. Pincus and 
Emily B. Ansell set out to create a new identity for interper- 
sonal theory that recognizes its unique aspects and integra- 
tive potential. They suggest that the interpersonal perspective 
can serve as the basis for integrating diverse theoretical ap- 
proaches to personality. Given its focus on interpersonal situ- 
ations, this perspective includes both proximal descriptions 
of overt behavioral transactions and the covert or intrapsy- 
chic processes that mediate those transactions, including the 
internal mental representations of self and other. In addition 
to reviewing the work of the major originators (e.g., Sullivan, 
Leary) and contemporary thinkers in interpersonal theory 
(e.g., Benjamin, Kiesler), the authors believe that there 
continues to be a need for a more complete integration of the 
interpersonal perspective with motivational, developmental, 
object-relations, and cognitive theories of human behavior. 
Similarly, they argue for a further identification of those 
catalysts that stimulate the internalization of relational expe- 
riences into influential mental representations. 

The current popularity among psychologists of various 
five-factor formulations of personality in contemporary 
research is undeniable. Despite the extensive literature in the 
area, these formulations have not been as thoroughly dis- 
sected, critically examined, and explicated as they are in 
Willem K. B. Hofstee’s chapter on the structure of personal- 
ity traits. The author asserts that concepts such as personality 
are shaped and defined largely by the operations employed to 
construct them. Hence, several procedures applied under the 
rubric of the number five have been employed to characterize 
trait adjectives describing the structure and composition 
of the personality concept. Hofstee differentiates four opera- 
tional modules that constitute the five component paradigms: 
The first set of operations reflects standardized self-report 
questionnaires; the second comprises the lexical approach 
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based on selections from a corpus of a language; the third 
relies on a linear methodology employing a principal compo- 
nents analysis of Likert item scales; and the fourth produces 
rival hierarchical and circumplex models for structuring trait 
information. Hofstee concludes his chapter by proposing a 
family of models composed of a hierarchy of generalized 
semicircumplexes. 

Appropriately placed at the conclusion of the social psy- 
chology section, Aubrey Immelman’s chapter comprises a 
synthesis of personality and social behavior. It not only ex- 
amines the history of personality inquiry in political psychol- 
ogy but also offers a far-reaching and theoretically coherent 
framework for studying the subject in a manner consonant 
with principles in contextually adjacent fields, such as behav- 
ioral neuroscience and evolutionary ecology. Immelman pro- 
vides an explicit framework for a personality-based risk 
analysis of political outcomes, acknowledging the role of fil- 
ters that modulate the impact of personality on political per- 
formance. Seeking to accommodate a diversity of politically 
relevant personality characteristics, he bridges conceptual 
and methodological gaps in contemporary political study and 
specifically attempts a psychological examination of political 
leaders, on the basis of which he imposes a set of standards 
for personality-in-politics modeling. 

By way of confession, the social psychology chapters in 
this volume were selected for the most part after simply jot- 
ting down the first thoughts about what areas to include 
and who would be good candidates to write the chapters. 
Fortunately, subsequent scanning of a few well-known intro- 
ductory texts and prior handbooks did nothing to alter those 
initial hunches that came so immediately and automatically 
to mind. For the most part, the vast majority of the chapters 
cover contemporary perspectives on traditional social psy- 
chological issues; however, a few introduce new, highly ac- 
tive areas of inquiry (e.g., justice, close relationships, and 
peace studies). 

At this point, it would be nice to describe the central 
theme, the deep structure underlying the organization of the 
social psychology chapters. But, as most readers know, social 
psychology and social behavior are too broad and varied for 
that kind of organization to be valid, much less useful. For the 
past 50 years or so, social psychology has done remarkably 
well examining the various aspects of social behavior with 
what Robert Merton termed theories of the midrange—his 
theory of relative deprivation being a good example. 

The social psychology chapters easily fall in to a few 
categories based on the nature of the issues they address. 
Four chapters focus on the social context of fundamental 
psychological structures: social cognitions, emotions, the 
self concept, and attitudes. These, together with the chapter 


on environmental psychology, provide a natural introduction 
to the social processes and interpersonal dynamics that 
follow. 

In the chapter on social cognition, Galen V. Bodenhausen, 
C. Neil Macrae, and Kurt Hugenberg, point out that the sub- 
stance of the chapter contains an excellent review of the 
available literature describing the types of mental representa- 
tions that make up the content of social cognitions; how var- 
ious motives and emotions influence those cognitions; and 
the recent very exciting work on the nature, appearance, and 
consequences of automatic as well as more thoughtfully con- 
trolled processes. This chapter would be an excellent place 
for someone to get an overview of the best that is now known 
about the cognitive structures and processes that shape un- 
derstanding of social situations and mediate behavioral reac- 
tions to them. 

No less fundamental are the questions of the sources of 
people’s emotions and how they influence behavior. The 
chapter by José-Miguel Fernändez-Dols and James A. 
Russell provides a review of the theories and empirical evi- 
dence relevant to the two basic approaches to emotions and 
affect: as modular products of human evolutionary past 
and as script-like products of human cultural history. 
Whether one fully accepts their highly creative and brave 
integration of these two approaches employing the concept of 
core affect, their lucid description of the best available evi- 
dence together with their astute analytic insights will be well 
worth the reader’s time and effort. In addition, it would be re- 
markably easy to take their integrative theoretical model as 
the inspiration, or at least starting point, for various lines of 
critically important research. 

Roy F. Baumeister and Jean M. Twenge clearly intend that 
their readers fully appreciate their observation that the self- 
concept is intrinsically located in a social processes and 
interpersonal relations. In fact, as they state, the self is con- 
structed and maintained as a way of connecting the individual 
organism to other members of the species. It would be easy to 
view this as a contemporary example of teleological theoriz- 
ing (i.e., explaining structures and processes referring to a 
functional purpose); however, the authors go to considerable 
length to provide evidence explicitly describing the underly- 
ing dynamics. This includes issues such as belongingness, 
social exclusion, and ostracism, as well as the more familiar 
concerns with conformity and self-esteem. The authors make 
a good case for their proposition that one of the self’s crucial 
defining functions is to enable people to live with other peo- 
ple in harmony and mutual belongingness. 

The notion that people walk around with predispositions 
to think, feel, and act with regard to identifiable aspects 
of their world has a long and noble tradition in social 


psychology. Certainly since Gordon Allport’s writings the 
concept of attitudes and their nature, origins, and behavioral 
consequences have been at the core of social psychology. To 
be sure, those issues appear in one form or another through- 
out most of the chapters in this volume. James M. Olson and 
Gregory Maio took on the task of presenting what is now 
known about attitudes in social behavior. This includes the 
structure of attitudes, the dimensions on which they differ, 
how they are formed and related to beliefs and values, and 
their functions in social relations and behavior. Of particular 
importance is the identification of those issues and questions 
that should be addressed in future research. For example, the 
evidence for the distinction between implicit and explicit 
attitudes opens up several areas worthy of investigation. 

Ever since the seminal work of Barker and his colleagues, 
social psychologists have recognized the importance of con- 
sidering the built environments as well as sociocultural con- 
texts in arriving at an adequate understanding of human 
thought, feelings, and actions. In their chapter on environ- 
mental psychology, Gabriel Moser and David Uzzell adopt 
the idea exemplified in Barker’s early field research that psy- 
chologists must recognize that the environment is a critical 
factor if they are to understand how people function in the 
real world. As Moser and Uzzell demonstrate, much has been 
discovered about the environment-person relationship that 
falls nicely within the context created by that early work. The 
authors note that not only do environmental psychologists 
work in collaboration with other psychologists to understand 
the processes mediating these relationships, but they also find 
themselves in collaborative efforts with other disciplines, 
such as architects, engineers, landscape architects, urban 
planners, and so on. The common focus, of course, consists 
of the cognitions, attitudes, emotions, self-concepts, and 
actions of the social participants. 

The next chapters consider the dynamics involved in 
interpersonal and social processes that lead to changes in 
people’s attitudes and social behavior. 

Recognizing the important distinction between implicit 
and explicit attitudes, in their chapter on persuasion and atti- 
tude change Richard E. Petty, S. Christian Wheeler, and 
Zakary L. Tormala report that as yet there is no way to change 
implicit attitudes. Their main contribution consists of pre- 
senting the evidence and theories relevant to changing ex- 
plicit attitudes. After a relatively brief discussion of the cur- 
rently influential elaboration likelihood model, their chapter 
is organized around the important distinction between 
processes that involve relatively automatic low-effort reac- 
tions from the target person and those that engage the target’s 
thoughts and at times behavioral reactions. The distinction 
between high- and low-effort processes of attitude change 
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provides a comfortable and rather meaningful framework for 
organizing processes as seemingly disparate as affective 
priming, heuristic-based reactions, role playing, dissonance, 
information integration, and so on. 

Andrzej Nowak, Robin R. Vallacher, and Mandy E. 
Miller’s chapter on social influence and group dynamics has 
several noteworthy features, one of which is the range of 
material that they have included. The chapter is so nicely 
composed and lucidly written that the reader may not easily 
appreciate the wide range of material, both theory and evi- 
dence, that is being covered. For example, the chapter begins 
with the more traditionally familiar topics such as obedience 
and reactance, moves on to what is known about more ex- 
plicit efforts to influence people’s behavior, and then ad- 
dresses the interpersonal processes associated with group 
pressure, polarization, and social loafing. All that is pretty 
familiar to most psychologists. However, the authors finally 
arrive at the most recent theoretical perspectives involving 
cellular automata that naturally lend themselves to the use 
of computer simulations to outline the implicit axiomatic 
changes in complex systems. What an amazing trip in both 
theories and method! Is it possible that what the authors iden- 
tify as the press for higher order coherence provides a coher- 
ent integration of the entire social influence literature? 

The transition from these initial chapters to those remain- 
ing can be roughly equated with the two dominant concerns 
of social psychologists. Up to this point, the chapters were 
most concerned with basic social psychological processes: 
scientific understanding of the interpersonal processes and 
social behavior. The remaining chapters exemplify social 
psychologists’ desire to find ways to make the world a better 
place, where people treat each other decently or at least are 
less cruel and destructive. Three of these chapters consider 
the social motives and processes that are involved in people 
helping and being fair to one another, whereas the last three 
examine harmful things that can happen between individuals 
and social groups, ranging from acts of prejudice to open 
warfare. The last chapter offers an introduction to what is 
now known about achieving a peaceful world. 

In their chapter on altruism and prosocial behavior, 
C. Daniel Batson and Adam A. Powell offer a most sophisti- 
cated analysis of the relevant social psychological literature. 
On the basis of his research and theoretical writings, Batson 
is the most cited and respected psychological expert on 
prosocial behavior. In this chapter he discusses the evidence 
for four sources of prosocial behavior. After providing 
an analysis of the sources of these prosocial motives— 
enlightened self-interest, altruism, principalism, and collec- 
tivism—he then takes on the task of discussing the points of 
possible conflict and cooperation among them. One might 
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argue with his evidence for the ease with which the princi- 
palist motives—justice and fairness—can be corrupted by 
self-interest, and thus his conclusion is that prosocial behav- 
ior can be most reliably based on altruistic (i.e., empathy- 
based) motives. I suspect, however, that Kurt Lewin would 
have been very pleased with this highly successful example 
of the potential societal value of good social psychological 
theory. 

Leo Montada, in the chapter on justice, equity, and fair- 
ness in human relations, provides a very content-rich but nec- 
essarily selective review of what is known about how justice 
appears in people’s lives, the various aspects of justice, and 
their social and individual sources, as well as interpersonal 
consequences. At the same time that he leads the reader 
through a general survey of the justice literature, he provides 
the reader with highly sophisticated insights and critical 
analyses. It is clear from the outset of this chapter that 
Montada is a thoroughly well-informed social scientist ap- 
proaching one of the fundamental issues in human relations: 
how and why people care about justice in their lives, what 
forms that concern takes, and how important those are con- 
cerns in shaping how they treat one another. 

Margaret S. Clark and Nancy K. Grote’s chapter can be 
viewed as the integration of several literatures associated with 
close relationships, friendships, and marriages—romantic 
and familial. They focus on the social-psychological 
processes associated with “good relationships”: those that 
they define as fostering members’ well-being. This chapter 
provides the most recent developments in Clark’s important 
distinction between communal and exchange relationships 
and includes the report of an important longitudinal study ex- 
amining the relationship between conflict and fairness in close 
relationships. They find that conflict in a relationship leads to 
increased concern with issues of fairness that then lead the 
participants even further from the important communal norms 
based on mutual concern for one another’s welfare. 

Kenneth L. Dion’s chapter on prejudice, racism, and dis- 
crimination looks at various aspects of the darker side of in- 
terpersonal relations. In the first section of the chapter, Dion 
leads the reader to a very thoughtful and complete review of 
the various explanations for prejudice, racism, and discrimi- 
nation. Beginning with the classic and contemporary ver- 
sions of the authoritarian personality theories, he discusses 
just-world, belief congruence, and ambivalence literatures. 
Dion does a masterful job of leading the reader through the 
more recently developed distinction between automatic and 
controlled processes, as well as social dominance theory and 
multicomponent approaches to intergroup attitudes. But that 
is only the beginning. Reflecting his own earlier research 
interests, Dion devotes the second section of his chapter to 


the psychology of the victim of prejudice and discrimination. 
This section integrates the most recent findings in this highly 
active and productive area of inquiry. Dion describes the re- 
search that has given the familiar self-fulfilling prophecy no- 
tion in social psychology new meaning and has provided 
compelling new insights into the very important ways vic- 
tims respond to their unfair treatment. 

The chapter by John F. Dovidio, Samuel L. Gaertner, 
Victoria M. Esses and Marilynn B. Brewer examines the 
social-psychological processes involved in interpersonal and 
intergroup relations. This includes both the sources of social 
conflict and those involved in bringing about harmony and 
integration. The origins of the important work reported in this 
chapter can be traced to the initial insights of European social 
psychologists who recognized that when people they think in 
terms of “we” rather than “I,” there is a strong tendency also 
to react in terms of “us” versus “them” (i.e., in-group vs. out- 
group). The consequences, of course, include favoring mem- 
bers of the in-group and discriminating against members 
of the salient out-groups. After describing what is known 
about the psychological processes involved in these biased 
reactions, the authors then consider those processes that can 
preclude or overcome those destructive biases and promote 
harmony and social integration. 

Joseph de Rivera’s chapter takes a similar path, by first 
focusing on those social-psychological processes involved in 
aggression and violence, and then with that as background 
presenting his recommendations concerning how positive 
peace can be promoted. For de Rivera this does not simply 
mean an absence of open conflict, but rather a benevolent and 
supportive environment, as well as societal norms, that pro- 
mote individual processes involving harmony and well-being. 
In describing the various means for generating a global culture 
of peace, he also makes the case for the importance of individ- 
ual’s personal transformation in creating and maintaining a 
culture of peace. De Rivera offers the reader a highly sophisti- 
cated use of the social-psychological research and theory to 
arrive at specific recommendations for solving, arguably, the 
most important issues of our lives: the achievement of a 
peaceful, caring, nurturing social environment. Ambitious? 
Yes. But de Rivera generates the framework of his own per- 
spective out of the best of what social science has to offer. 

We trust the readers of this volume on personality and 
social psychology will find the chapters it contains to be both 
provocative and illuminating. It has been an honor and a joy 
to edit a book written by so many able, inspiring, and cooper- 
ative authors, whom we thank personally for their thoughtful 
and stimulating contributions. 

THEODORE MILLON 
MELVIN J. LERNER 
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In the last year of the twentieth century, voters elected a 
group of Kansas school board members who supported the 
removal of the concept of evolution from the state’s science 
curriculum, an act that indicated the extent to which evolu- 
tionary ideas could incite intense emotional, if not irrational 
opposition on the part of unenlightened laymen. Retrospec- 
tively appalled by their prior action, in the following year 
Kansan voters rescinded their perverse judgment and chose 
new board members who intended to restore the concept. 

The theory of evolution was reinstated not because the 
electors of Kansas, a most conservative and religious state, 
suddenly became agnostic, but because they realized that 
rejecting the idea would deny their children the necessity of 
remaining in touch with one of the fundamentals of modern 
science; they realized that this could, in effect, allow their 
children to fall behind, to be bereft of a basic science, and to 
be both a misinformed and misguided generation. Their chil- 
dren could become embarrassingly backward in a time of 
rapidly changing technology. 

Might not the same ambivalence be true of our own field, 
one composed of ostensibly sophisticated and knowledgeable 
scientists? Might we not be so deeply mired in our own tradi- 
tions (scholarly religions?) that we are unable to free our- 
selves from the habit of seeing our subject from no vantage 
point other than those to which we have become accustomed? 
Are we unable to recognize that behavior, cognition, the un- 
conscious, personality—all of our traditional subjects—are 
merely diverse manifestations of certain common and deeper 
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principles of functioning, processes, and mechanisms that 
have evolved either randomly or adaptively through history 
and time? Do we psychologists have a collective phobia 
about laws that may represent the fundamental origins of our 
traditional subjects? Does the search for and application of 
such laws push our emotional buttons, perhaps run hard 
against our habitual blinders, so much so as to prevent us 
from recognizing their value as a potential generative source 
that may more fully illuminate our science? 


PERSONOLOGY’S RELATIONSHIP 
TO OTHER SCIENCES 


It is the intent of this chapter to broaden our vistas, to furnish 
both a context and a set of guiding ideas that may enrich our 
studies. I believe it may be wise and perhaps even necessary 
to go beyond our current conceptual boundaries in psychol- 
ogy, more specifically to explore carefully reasoned, as well 
as intuitive hypotheses that draw their laws and principles if 
not their substance from contextually adjacent sciences such 
as evolution. Not only may such steps bear new conceptual 
fruits, but they may also provide a foundation that can under- 
gird and guide our own discipline’s explorations. Much of 
personology, no less psychology as a whole, remains adrift, 
divorced from broader spheres of scientific knowledge; it is 
isolated from firmly grounded if not universal principles, 
leading us to continue building the patchwork quilt of 
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concepts and data domains that characterize our field. Preoc- 
cupied with but a small part of the larger puzzle of nature or 
fearful of accusations of reductionism, we may fail to draw 
on the rich possibilities to be found in parallel realms of sci- 
entific pursuit. With few exceptions, cohering concepts that 
would connect our subject domain to those of its sister sci- 
ences in nature have not been adequately developed. 

It appears to me that we have become trapped in (obsessed 
with?) horizontal refinements. A search for integrative 
schemas and cohesive constructs that link its seekers closely 
to relevant observations and laws developed in other scien- 
tific fields is needed. The goal—albeit a rather ambitious 
one—is to refashion our patchwork quilt of concepts into a 
well-tailored and aesthetically pleasing tapestry that inter- 
weaves the diverse forms in which nature expresses itself 
(E. O. Wilson, 1998). 

What sphere is there within the psychological sciences 
more apt than personology to undertake the synthesis of na- 
ture? Persons are the only organically integrated system in 
the psychological domain, evolved through the millennia and 
inherently created from birth as natural entities rather than 
culture-bound and experience-derived gestalts. The intrinsic 
cohesion of nature’s diverse elements that inheres in persons 
is not a rhetorical construction, but rather an authentic sub- 
stantive unity. Personological features may often be disso- 
nant and may be partitioned conceptually for pragmatic or 
scientific purposes, but they are segments of an inseparable 
physicochemical, biopsychosocial entity. 

To take this view is not to argue that different spheres of 
scientific inquiry must be collapsed or even equated, but rather 
that there may be value in seeking a single, overarching 
conceptual system that interconnects ostensibly diverse sub- 
jects such as physics, biology, and psychology (Millon, 1990; 
E. O. Wilson, 1998). Arguing in favor of establishing explicit 
links between these domains calls for neither a reductionistic 
philosophy, nor a belief in substantive identicality, nor efforts 
to so fashion the links by formal logic. Rather, one should as- 
pire to their substantive concordance, empirical consistency, 
conceptual interfacing, convergent dialogues, and mutual 
enlightenment. 

A few words should be said concerning the undergird- 
ing framework used to structure an evolutionary context for 
a personology model. Parallel schemas are almost universally 
present in the literature; the earliest may be traced to 
mid-nineteenth-century philosophers, most notably Spencer 
(1855) and Haeckel (1874). More modern but equally specu- 
lative systems have been proposed by keen and broadly in- 
formed observers such as Edward Wilson (1975), Cosmides 
and Tooby (1987, 1989) and M. Wilson and Daley (1992), as 
well as by empirically well-grounded methodologists, such as 
Symons (1979, 1992) and D. M. Buss (1989, 1994). Each of 


their proposals fascinates either by virtue of its intriguing por- 
trayals or by the compelling power of its logic or its data. Their 
arguments not only coordinate with but also are anchored to 
observations derived specifically from principles of modern 
physical and biological evolution. It is these underpinnings of 
knowledge on which the personological model presented in 
this chapter has been grounded and from which a deeper and 
clearer understanding may be obtained concerning the nature 
of both normal and pathological personality functioning. 


On the Place of Theory in Personology 


The following discussion is conjectural, if not overly ex- 
tended in its speculative reach. In essence, it seeks to expli- 
cate the structure and styles of personality with reference to 
deficient, imbalanced, or conflicted modes of evolutionary 
survival, ecological adaptation, and reproductive strategy. 
Whatever one’s appraisal of these conjectures, the model that 
follows may best be approached in the spirit in which it was 
formulated—an effort to provide a context for explicating 
the domains of personological science in the hope that it can 
lead to a clearer understanding of our subject. All sciences 
have organizing principles that not only create order but also 
provide the basis for generating hypotheses and stimulating 
new knowledge. A contextual theory not only summarizes 
and incorporates extant knowledge, but is heuristic—that is, 
it has “systematic import,” as Hempel (1965) has phrased it, 
in that it may originate and develop new observations and 
new methods. 

It is unfortunate that the number of theories that have been 
advanced to “explain” personality is proportional to the in- 
ternecine squabbling found in the literature. However, and 
ostensibly toward the end of pragmatic sobriety, those of an 
antitheory bias have sought to persuade the profession of the 
failings of premature formalization, warning that one cannot 
arrive at the desired future by lifting science by its own boot- 
straps. To them, there is no way to traverse the road other sci- 
ences have traveled without paying the dues of an arduous 
program of empirical research. Formalized axiomatics, they 
say, must await the accumulation of so-called hard evidence 
that is simply not yet in. Shortcutting the route with ill-timed 
systematics, they claim, will lead us down primrose paths, 
preoccupying attentions as we wend fruitlessly through end- 
less detours, each of which could be averted by our holding 
fast to an empiricist philosophy and methodology. 

No one argues against the view that theories that float, so to 
speak, on their own, unconcerned with the empirical domain, 
should be seen as the fatuous achievements they are and the 
travesty they make of the virtues of a truly coherent concep- 
tual system. Formal theory should not be pushed far beyond 
the data, and its derivations should be linked at all points to 


established observations. However, a theoretical framework 
can be a compelling instrument for coordinating and giving 
consonance to complex and diverse observations—if its con- 
cepts are linked to relevant facts in the empirical world. By 
probing beneath surface impressions to inner structures and 
processes, previously isolated facts and difficult-to-fathom 
data may yield new relationships and expose clearer mean- 
ings. Scientific progress occurs when observations and con- 
cepts elaborate and refine previous work. However, this 
progression does not advance by brute empiricism alone, by 
merely piling up more descriptive and more experimental 
data. What is elaborated and refined in theory is understand- 
ing, an ability to see relations more plainly, to conceptualize 
categories and dimensions more accurately, and to create 
greater overall coherence in a subject—to integrate its ele- 
ments in a more logical, consistent, and intelligible fashion. 

A problem arises when introducing theory into the study 
of personality. Given our intuitive ability to “sense” the cor- 
rectness of a psychological insight or speculation, theoretical 
efforts that impose structure or formalize these insights into a 
scientific system will often be perceived as not only cumber- 
some and intrusive, but alien as well. This discomfiture and 
resistance does not arise in fields such as particle physics, in 
which everyday observations are not readily available and 
in which innovative insights are few and far between. In such 
subject domains, scientists not only are quite comfortable, 
but also turn readily to deductive theory as a means of help- 
ing them explicate and coordinate knowledge. It is paradoxi- 
cal but true and unfortunate that personologists learn their 
subject quite well merely by observing the ordinary events of 
life. As a consequence of this ease, personologists appear to 
shy from and hesitate placing trust in the obscure and com- 
plicating, yet often fertile and systematizing powers inherent 
in formal theory, especially when a theory is new or different 
from those learned in their student days. 

Despite the shortcomings in historic and contemporary the- 
oretical schemas, systematizing principles and abstract con- 
cepts can “facilitate a deeper seeing, a more penetrating vision 
that goes beyond superficial appearances to the order underly- 
ing them” (Bowers, 1977). For example, pre-Darwinian tax- 
onomists such as Linnaeus limited themselves to apparent 
similarities and differences among animals as a means of con- 
structing their categories. Darwin was not seduced by appear- 
ances. Rather, he sought to understand the principles by which 
overt features came about. His classifications were based not 
only on descriptive qualities, but also on explanatory ones. 


On the Place of Evolutionary Theory in Personology 


It is in both the spirit and substance of Darwin’s explanatory 
principles that the reader should approach the proposals that 
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follow. The principles employed are essentially the same as 
those that Darwin developed in seeking to explicate the origins 
of species. However, they are listed to derive not the origins of 
species, but rather the structure and style of personalities that 
have previously been generated on the basis of clinical obser- 
vation alone. Aspects of these formulations have been pub- 
lished in earlier books (Millon, 1969, 1981, 1986, 1990; 
Millon & Davis, 1996); they are anchored here, however, ex- 
plicitly to evolutionary and ecological theory. Identified in 
earlier writings as a biosocial learning model for personality 
and psychopathology, the theory we present seeks to generate 
the principles, mechanisms, and typologies of personality 
through formal processes of deduction. 

To propose that fruitful ideas may be derived by applying 
evolutionary principles to the development and functions of 
personological traits has a long (if yet unfulfilled) tradition. 
Spencer (1870), Huxley (1870), and Haeckel (1874) offered 
suggestions of this nature shortly after Darwin’s seminal Ori- 
gins was published. The school of functionalism, popular in 
psychology in the early part of this century, likewise drew its 
impetus from evolutionary concepts as it sought to articulate a 
basis for individual difference typologies (McDougall, 1932). 

In recent decades, numerous evolution-oriented psycholo- 
gists and biologists have begun to explore how the human 
mind may have been shaped over the past million years to 
solve the problems of basic survival, ecological adaptation, 
and species replication and diversification. These well-crafted 
formulations are distinctly different from other, more tradi- 
tional models employed to characterize human functioning. 

The human mind is assuredly sui generis, but it is only the 
most recent phase in the long history of organic life. Moreover, 
there is no reason to assume that the exigencies of life have dif- 
fered in their essentials among early and current species. It 
would be reasonable, therefore—perhaps inevitable—that the 
study of the functions of mind be anchored to the same princi- 
ples that are universally found in evolution’s progression. 
Using this anchor should enable us to build a bridge between 
the human mind and all other facets of natural science; more- 
over, it should provide a broad blueprint of why the mind en- 
gages in the functions it does, as well as what its essential 
purposes may be, such as pursuing parental affection and pro- 
tection, exploring the rationale and patterns of sexual mating, 
and specifying the styles of social communication and abstract 
language. 

In recent times we have also seen the emergence of socio- 
biology, a new science that has explored the interface be- 
tween human social functioning and evolutionary biology 
(E. O. Wilson, 1975, 1978). The common goal among both 
sociobiological and personological proposals is the desire not 
only to apply analogous principles across diverse scientific 
realms, but also to reduce the enormous range of behavioral 
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and trait concepts that have proliferated through modern his- 
tory. This goal might be achieved by exploring the power of 
evolutionary theory to simplify and order previously dis- 
parate personological features. For example, all organisms 
seek to avoid injury, find nourishment, and reproduce their 
kind if they are to survive and maintain their populations. 
Each species displays commonalities in its adaptive or sur- 
vival style. Within each species, however, there are differ- 
ences in style and differences in the success with which its 
various members adapt to the diverse and changing environ- 
ments they face. In these simplest of terms, differences 
among personality styles would be conceived as representing 
the more-or-less distinctive ways of adaptive functioning that 
an organism of a particular species exhibits as it relates to its 
typical range of environments. Disorders of personality, so 
formulated, would represent particular styles of maladaptive 
functioning that can be traced to deficiencies, imbalances, or 
conflicts in a species’ capacity to relate to the environments it 
faces. 

A few additional words should be said concerning analo- 
gies between evolution and ecology on the one hand and per- 
sonality on the other. During its life history, an organism 
develops an assemblage of traits that contribute to its individ- 
ual survival and reproductive success, the two essential com- 
ponents of fitness formulated by Darwin. Such assemblages, 
termed complex adaptations and strategies in the literature of 
evolutionary ecology, are close biological equivalents to what 
psychologists have conceptualized as personality styles and 
structures. In biology, explanations of a life history strategy 
of adaptations refer primarily to biogenic variations among 
constituent traits, their overall covariance structure, and the 
nature and ratio of favorable to unfavorable ecological re- 
sources that have been available for purposes of extending 
longevity and optimizing reproduction. Such explanations 
are not appreciably different from those used to account for 
the development of personality styles or functions. 

Bypassing the usual complications of analogies, a relevant 
and intriguing parallel may be drawn between the phylogenic 
evolution of a species’ genetic composition and the ontogenic 
development of an individual organism’s adaptive strategies 
(i.e., its personality style, so to speak). At any point in time, a 
species possesses a limited set of genes that serve as trait 
potentials. Over succeeding generations, the frequency distri- 
bution of these genes will likely change in their relative 
proportions depending on how well the traits they undergird 
contribute to the species’ “fittedness” within its varying 
ecological habitats. In a similar fashion, individual organisms 
begin life with a limited subset of their species’ genes and 
the trait potentials they subserve. Over time the salience 
of these trait potentials—not the proportion of the genes 


themselves—will become differentially prominent as the or- 
ganism interacts with its environments. It “learns” from these 
experiences which of its traits fit best (i.e., most optimally 
suit its ecosystem). In phylogenesis, then, actual gene fre- 
quencies change during the generation-to-generation adap- 
tive process, whereas in ontogenesis it is the salience or 
prominence of gene-based traits that changes as adaptive 
learning takes place. Parallel evolutionary processes occur— 
one within the life of a species, and the other within the life 
of an organism. What is seen in the individual organism is a 
shaping of latent potentials into adaptive and manifest styles 
of perceiving, feeling, thinking, and acting; these distinctive 
ways of adaptation, engendered by the interaction of biologi- 
cal endowment and social experience, comprise the elements 
of what is termed personality styles. It is a formative process 
in a single lifetime that parallels gene redistributions among 
species during their evolutionary history. 

Two factors beyond the intrinsic genetic trait potentials of 
advanced social organisms have a special significance in af- 
fecting their survival and replicability. First, other members 
of the species play a critical part in providing postnatal nur- 
turing and complex role models. Second, and no less rele- 
vant, is the high level of diversity and unpredictability of 
their ecological habitats. This requires numerous, multifac- 
eted, and flexible response alternatives that are either prepro- 
grammed genetically or acquired subsequently through early 
learning. Humans are notable for unusual adaptive pliancy, 
acquiring a wide repertoire of styles or alternate modes of 
functioning for dealing with both predictable and novel envi- 
ronmental circumstances. Unfortunately, the malleability of 
early potentials for diverse learnings diminishes as matura- 
tion progresses. As a consequence, adaptive styles acquired 
in childhood and usually suitable for comparable later envi- 
ronments become increasingly immutable, resisting modifi- 
cation and relearning. Problems arise in new ecological 
settings when these deeply ingrained behavior patterns per- 
sist, despite their lessened appropriateness; simply stated, 
what was learned and was once adaptive may no longer fit. 
Perhaps more important than environmental diversity, then, 
is the divergence between the circumstances of original 
learning and those of later life, a schism that has become 
more problematic as humans have progressed from stable 
and traditional to fluid and inconstant modern societies. 

From the viewpoint of survival logic, it is both efficient 
and adaptive either to preprogram or to train the young of a 
species with traits that fit the ecological habitats of their par- 
ents. This wisdom rests on the usually safe assumption that 
consistency if not identicality will characterize the ecological 
conditions of both parents and their offspring. Evolution is 
spurred when this continuity assumption fails to hold—when 


formerly stable environments undergo significant change. 
Radical shifts of this character could result in the extinction 
of a species. It is more typical, however, for environments to 
be altered gradually, resulting in modest, yet inexorable re- 
distributions of a species’ gene frequencies. Genes that sub- 
serve competencies that proved suited to the new conditions 
become proportionately more common; ultimately, the fea- 
tures they engender come to typify either a new variant of or 
a successor to the earlier species. 

All animal species intervene in and modify their habitats 
in routine and repetitive ways. Contemporary humans are 
unique in evolutionary history, however, in that both the 
physical and social environment has been altered in precipi- 
tous and unpredictable ways. These interventions appear 
to have set in motion consequences not unlike the “equilib- 
rium punctuations” theorized by modern paleontologists 
(Eldredge & Gould, 1972). This is best illustrated in the ori- 
gins of our recent borderline personality epidemic (Millon, 
1987): 


Central to our recent culture have been the increased pace of so- 
cial change and the growing pervasiveness of ambiguous and 
discordant customs to which children are expected to subscribe. 
Under the cumulative impact of rapid industrialization, immigra- 
tion, urbanization, mobility, technology, and mass communica- 
tion, there has been a steady erosion of traditional values and 
standards. Instead of a simple and coherent body of practices 
and beliefs, children find themselves confronted with constantly 
shifting styles and increasingly questioned norms whose durabil- 
ity is uncertain and precarious. Few times in history have so 
many children faced the tasks of life without the aid of accepted 
and durable traditions. Not only does the strain of making 
choices among discordant standards and goals beset them at 
every turn, but these competing beliefs and divergent demands 
prevent them from developing either internal stability or external 
consistency. (p. 363) 


Murray has said that “life is a continuous procession of ex- 
plorations . . . learnings and relearnings” (1959). Yet, among 
species such as humans, early adaptive potentials and plian- 
cies may fail to crystallize because of the fluidities and in- 
consistencies of the environment, leading to the persistence 
of what some have called immature and unstable styles that 
fail to achieve coherence and effectiveness. 

Lest the reader assume that those seeking to wed the sci- 
ences of evolution and ecology find themselves fully wel- 
come in their respective fraternities, there are those who 
assert that “despite pious hopes and intellectual convictions, 
[these two disciplines] have so far been without issue” 
(Lewontin, 1979). This judgment is now both dated and 
overly severe, but numerous conceptual and methodological 
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impediments do face those who wish to bring these fields of 
biological inquiry into fruitful synthesis—no less employing 
them to construe the styles of personality. Despite such con- 
cerns, recent developments bridging ecological and evolu- 
tionary theory are well underway, and hence do offer some 
justification for extending their principles to human styles of 
adaptation. 

To provide a conceptual background from these sciences 
and to furnish a rough model concerning the styles of person- 
ality, four domains or spheres of evolutionary and ecological 
principles are detailed in this chapter. They are labeled exis- 
tence, adaptation, replication, and abstraction. The first re- 
lates to the serendipitous transformation of random or less 
organized states into those possessing distinct structures of 
greater organization; the second refers to homeostatic 
processes employed to sustain survival in open ecosystems; 
the third pertains to reproductive styles that maximize the 
diversification and selection of ecologically effective attrib- 
utes; and the fourth, a distinctly human phenomenon, con- 
cerns the emergence of competencies that foster anticipatory 
planning and reasoned decision making. 

What makes evolutionary theory and ecological theory 
as meritorious as I propose them to be? Are they truly coex- 
tensive with the origins of the universe and the procession of 
organic life, as well as human modes of adaptation? Is ex- 
trapolation to personality a conjectural fantasy? Is there justi- 
fication for employing them as a basis for understanding 
normal and pathological behaviors? 

Owing to the mathematical and deductive insights of our 
colleagues in physics, we have a deeper and clearer sense of 
the early evolution and structural relations among matter and 
energy. So too has knowledge progressed in our studies of 
physical chemistry, microbiology, evolutionary theory, popu- 
lation biology, ecology, and ethology. How odd it is (is it 
not?) that we have only now again begun to investigate—as 
we did at the turn of the last century—the interface between 
the basic building blocks of physical nature and the nature of 
life as we experience and live it personally. How much more 
is known today, yet how hesitant are people to undertake a se- 
rious rapprochement? As Barash (1982) has commented: 


Like ships passing in the night, evolutionary biology and the 
social sciences have rarely even taken serious notice of each 
other, although admittedly, many introductory psychology 
texts give an obligatory toot of the Darwinian horn somewhere 
in the first chapter . . . before passing on to discuss human be- 
havior as though it were determined only by environmental 
factors. (p. 7) 


Commenting that serious efforts to undergird the behavioral 
sciences with the constructs and principles of evolutionary 
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biology are as audacious as they are overdue, Barash (1982) 
notes further: 


As with any modeling effort, we start with the simple, see how 
far it takes us, and then either complicate or discard it as it gets 
tested against reality. The data available thus far are certainly 
suggestive and lead to the hope that more will shortly be forth- 
coming, so that tests and possible falsification can be carried out. 
In the meanwhile, as Darwin said when he first read Malthus, at 
least we have something to work with! (p. 8) 


The role of evolution is most clearly grasped when it is paired 
with the principles of ecology. So conceived, the so-called 
procession of evolution represents a series of serendipitous 
transformations in the structure of a phenomenon (e.g., ele- 
mentary particle, chemical molecule, living organism) that 
appear to promote survival in both its current and future 
environments. Such processions usually stem from the 
consequences of either random fluctuations (such as muta- 
tions) or replicative reformations (e.g., recombinant mating) 
among an infinite number of possibilities—some simpler 
and others more complex, some more and others less orga- 
nized, some increasingly specialized and others not. Evolu- 
tion is defined, then, when these restructurings enable a 
natural entity (e.g., species) or its subsequent variants to sur- 
vive within present and succeeding ecological milieus. It is 
the continuity through time of these fluctuations and refor- 
mations that comprises the sequence we characterize as evo- 
lutionary progression. 


THREE UNIVERSAL POLARITIES OF EVOLUTION 


As noted in previous paragraphs, existence relates to the 
serendipitous transformation of states that are more 
ephemeral, less organized, or both into those possessing 
greater stability, greater organization, or both. It pertains to 
the formation and sustenance of discernible phenomena, to 
the processes of evolution that enhance and preserve life, 
and to the psychic polarity of pleasure and pain. Adaptation 
refers to homeostatic processes employed to foster survival 
in open ecosystems. It relates to the manner in which extant 
phenomena adapt to their surrounding ecosystems, to the 
mechanisms employed in accommodating to or in modifying 
these environments, and to the psychic polarity of passivity 
and activity. Replication pertains to reproductive styles that 
maximize the diversification and selection of ecologically ef- 
fective attributes. It refers to the strategies utilized to repli- 
cate ephemeral organisms, to the methods of maximizing 
reproductive propagation and progeny nurturance, and to the 
psychic polarity of self and other. These three polarities have 


forerunners in psychological theory that may be traced back 
to the early 1900s. 


Some Historical Notes 


A number of pre-World War I theorists proposed polarities 
that were used as the foundation for understanding a variety 
of psychological processes. Although others formulated par- 
allel schemas earlier than he, I illustrate these conceptions 
with reference to ideas presented by Sigmund Freud. He 
wrote in 1915 what many consider to be among his most 
seminal works, those on metapsychology and in particular, 
the paper entitled “The Instincts and Their Vicissitudes.” 
Speculations that foreshadowed several concepts developed 
more fully later both by himself and by others were pre- 
sented in preliminary form in these papers. Particularly no- 
table is a framework that Freud (1915/1925) advanced as 
central to understanding the mind; he framed these polarities 
as follows: 


Our mental life as a whole is governed by three polarities, 
namely, the following antitheses: 


e Subject (ego)-Object (external world) 
e Pleasure-Pain 


e Active-Passive 


The three polarities within the mind are connected with one 
another in various highly significant ways. 

We may sum up by saying that the essential feature in the 
vicissitudes undergone by instincts is their subjection to the in- 
fluences of the three great polarities that govern mental life. Of 
these three polarities we might describe that of activity-passivity 
as the biological, that of the ego-external world as the real, and 
finally that of pleasure-pain as the economic, respectively. 
(pp. 76-77, 83) 


Preceding Freud, however, aspects of these three polarities 
were conceptualized and employed by other theorists—in 
France, Germany, Russia, and other European nations as 
well as in the United States. Variations of the polarities of 
active-passive, subject-object, and pleasure-pain were identi- 
fied by Heymans and Wiersma in Holland, McDougall in the 
United States, Meumann in Germany, Kollarits in Hungary, 
and others (Millon, 1981; Millon & Davis, 1996). 

Despite the central role Freud assigned these polarities, 
he failed to capitalize on them as a coordinated system for un- 
derstanding patterns of human functioning. Although he 
failed to pursue their potentials, the ingredients he formulated 
for his tripartite polarity schema were drawn upon by his 


disciples for many decades to come, seen prominently in the 
progressive development from instinct or drive theory, in 
which pleasure and pain were the major forces, to ego psy- 
chology, in which the apparatuses of activity and passivity 
were central constructs, and, most recently, to self-psychology 
and object relations theory, in which the self-other polarity is 
the key issue (Pine, 1990). 

Forgotten as a metapsychological speculation by most, the 
scaffolding comprising these polarities was fashioned anew 
by this author in the mid-1960s (Millon, 1969). Unacquainted 
with Freud’s proposals at the time and employing a biosocial- 
learning model anchored to Skinnerian concepts, I constructed 
a framework similar to Freud’s “great polarities that govern all 
of mental life.” Phrased in the terminology of learning con- 
cepts, the model comprised three polar dimensions: positive 
versus negative reinforcement (pleasure-pain); self-other as 
reinforcement source; and the instrumental styles of active- 
passive. I (Millon, 1969) stated: 


By framing our thinking in terms of what reinforcements the in- 
dividual is seeking, where he is looking to find them and how he 
performs we may see more simply and more clearly the essential 
strategies which guide his coping behaviors. 

These reinforcements [relate to] whether he seeks primarily 
to achieve positive reinforcements (pleasure) or to avoid nega- 
tive reinforcements (pain). 

Some patients turn to others as their source of reinforcement, 
whereas some turn primarily to themselves. The distinction [is] 
between others and self as the primary reinforcement source. 

On what basis can a useful distinction be made among instru- 
mental behaviors? A review of the literature suggests that the 
behavioral dimension of activity-passivity may prove useful... . 
Active patients [are] busily intent on controlling the circum- 
stances of their environment. . . . Passive patients . . . wait for the 
circumstances of their environment to take their course... 
reacting to them only after they occur. (pp. 193-195) 


Do we find parallels within the disciplines of psychiatry and 
psychology that correspond to these broad evolutionary 
polarities? 

In addition to the forerunners noted previously, there is a 
growing group of contemporary scholars whose work relates 
to these polar dimensions, albeit indirectly and partially. For 
example, a modern conception anchored to biological foun- 
dations has been developed by the distinguished British psy- 
chologist Jeffrey Gray (1964, 1973). A three-part model of 
temperament, matching the three-part polarity model in most 
regards, has been formulated by the American psychologist 
Arnold Buss and his associates (Buss & Plomin 1975, 1984). 
Circumplex formats based on factor analytic studies of mood 
and arousal that align well with the polarity schema have been 
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published by Russell (1980) and Tellegen (1985). Deriving 
inspiration from a sophisticated analysis of neuroanatomical 
substrates, the highly resourceful American psychiatrist 
Robert Cloninger (1986, 1987) has deduced a threefold 
schema that is coextensive with major elements of the 
model’s three polarities. Less oriented to biological founda- 
tions, recent advances in both interpersonal and psychoana- 
lytic theory have likewise exhibited strong parallels to one or 
more of the three polar dimensions. A detailed review of these 
and other parallels has been presented in several recent books 
(e.g., Millon, 1990; Millon & Davis, 1996). 

The following pages summarize the rationale and charac- 
teristics of the three-part polarity model. A few paragraphs 
draw upon the model as a basis for establishing attributes for 
conceptualizing personality patterns. 


Aims of Existence 


The procession of evolution is not limited just to the evolution 
of life on earth but extends to prelife, to matter, to the primor- 
dial elements of our local cosmos, and, in all likelihood, to the 
elusive properties of a more encompassing universe within 
which our cosmos is embedded as an incidental part. The de- 
marcations we conceptualize to differentiate states such as 
nonmatter and matter, or inorganic and organic, are nominal 
devices that record transitions in this ongoing procession of 
transformations, an unbroken sequence of re-formed ele- 
ments that have existed from the very first. 

We may speak of the emergence of our local cosmos from 
some larger universe, or of life from inanimate matter, but if 
we were to trace the procession of evolution backward we 
would have difficulty identifying precise markers for each of 
these transitions. What we define as life would become pro- 
gressively less clear as we reversed time until we could no 
longer discern its presence in the matter we were studying. 
So, too, does it appear to theoretical physicists that if we trace 
the evolution of our present cosmos back to its ostensive ori- 
gins, we would lose its existence in the obscurity of an undif- 
ferentiated and unrecoverable past. The so-called Big Bang 
may in fact be merely an evolutionary transformation, one of 
an ongoing and never-ending series of transitions. 


Life Preservation and Life Enhancement: 
The Pain-Pleasure Polarity 


The notion of open systems is of relatively recent origin 
(Bertalanffy, 1945; Lotka, 1924; Schrodinger, 1944), brought 
to bear initially to explain how the inevitable consequences 
of the second law of thermodynamics appear to be circum- 
vented in the biological realm. By broadening the ecological 
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field so as to encompass events and properties beyond the 
local and immediate, it becomes possible to understand how 
living organisms on earth function and thrive, despite seem- 
ing to contradict this immutable physical law (e.g., solar ra- 
diation, continuously transmitting its ultimately exhaustible 
supply of energy, temporarily counters the earth’s inevitable 
thermodynamic entropy). The open system concept has been 
borrowed freely and fruitfully to illuminate processes across 
a wide range of subjects. In recent decades it has been ex- 
tended, albeit speculatively, to account for the evolution of 
cosmic events. These hypotheses suggest that the cosmos as 
known today may represent a four-dimensional “bubble” or 
set of “strings” stemming either from the random fluctuations 
of an open meta-universe characterized primarily by entropic 
chaos or of transpositions from a larger set of dimensions that 
comprise the properties of an open mega-universe—that is, 
dimensions beyond those we apprehend (Millon, 1990). 

By materializing new matter from fluctuations in a larger 
and unstable field—that is, by creating existence from non- 
existence (cold dark matter)—any embedded open system 
might not only expand, but also form entities displaying anti- 
entropic structure, the future survival of which is determined 
by the character of parallel materializations and by the fortu- 
itous consequences of their interactions (including their eco- 
logical balance, symbiosis, etc.). Beyond fortuitous levels of 
reciprocal fitness, some of these anti-entropic structures may 
possess properties that enable them to facilitate their own 
self-organization; that is to say, the forms into which they 
have been rendered randomly may not only survive, but also 
be able to amplify themselves, to extend their range, or both, 
sometimes in replicated and sometimes in more comprehen- 
sive structures. 

Recent mathematical research in both physics and chem- 
istry has begun to elucidate processes that characterize how 
structures “evolve” from randomness. Whether one evaluates 
the character of cosmogenesis, the dynamics of open chemi- 
cal systems, or repetitive patterns exhibited among weather 
movements, it appears that random fluctuations assume se- 
quences that often become both self-sustaining and recurrent. 
In chemistry, the theory of dissipative (free energy) structures 
(Prigogine 1972, 1976) proposes a principle called order 
through fluctuation that relates to self-organizational dynam- 
ics; these fluctuations proceed through sequences that not 
only maintain the integrity of the system but are also self- 
renewing. According to the theory, any open system may 
evolve when fluctuations exceed a critical threshold, setting 
in motion a qualitative shift in the nature of the system’s 
structural form. Similar shifts within evolving systems are 
explained in pure mathematics by what has been termed cat- 
astrophic theory (Thom, 1972); here, sudden switches from 


one dynamic equilibrium state to another occur instanta- 
neously with no intervening bridge. As models portraying 
how the dynamics of random fluctuation drive prior levels of 
equilibrium to reconstitute themselves into new structures, 
both catastrophe and dissipative theories prove fruitful in ex- 
plicating self-evolving morphogenesis—the emergence of 
new forms of existence from prior states. 

There is another equally necessary step to existence, one 
that maintains “being” by protecting established structures 
and processes. Here, the degrading effects of entropy are 
counteracted by a diversity of safeguarding mechanisms. 
Among both physical and organic substances, such as atoms 
and molecules, the elements comprising their nuclear struc- 
ture are tightly bound, held together by the strong force that 
is exceptionally resistant to decomposition (hence the power 
necessary to split the atom). More complicated organic struc- 
tures, such as plants and animals, also have mechanisms to 
counter entropic dissolution—that is to say, to maintain the 
existence of their lives. 

Two intertwined strategies are required, therefore: one to 
achieve existence, the other to preserve it. The aim of one is 
the enhancement of life—creating and then strengthening 
ecologically survivable organisms; the aim of the other is the 
preservation of life—avoiding circumstances that might ter- 
minate (entropically decompose) it. Although I disagree with 
Freud’s concept of a death instinct (Thanatos), I believe he 
was essentially correct in recognizing that a balanced yet fun- 
damental biological bipolarity exists in nature, a bipolarity 
that has its parallel in the physical world. As he wrote in one 
of his last works, “The analogy of our two basic instincts ex- 
tends from the sphere of living things to the pair of opposing 
forces—attraction and repulsion—which rule the inorganic 
world” (Freud, 1940, p. 72). Among humans, the former may 
be seen in life-enhancing acts that are attracted to what we 
experientially record as pleasurable events (positive rein- 
forcers), the latter in life-preserving behaviors oriented to 
repel events experientially characterized as painful (negative 
reinforcers). More is said of these fundamental if not univer- 
sal mechanisms of countering entropic disintegration in the 
next section. 

To summarize, the aims of existence reflects a to-be or 
not-to-be issue. In the inorganic world, to be is essentially a 
matter of possessing qualities that distinguish a phenomenon 
from its surrounding field—not being in a state of entropy. 
Among organic beings, to be is a matter of possessing the 
properties of life as well as being located in ecosystems that 
facilitate the enhancement and preservation of that life. In the 
phenomenological or experiential world of sentient organ- 
isms, events that extend life and preserve it correspond 
largely to metaphorical terms such as pleasure and pain; that 


is to say, recognizing and pursuing positive sensations and 
emotions, on the one hand, and recognizing and eschewing 
negative sensations and emotions, on the other. 

Although there are many philosophical and metapsycho- 
logical issues associated with the nature of pain and pleasure 
as constructs, it is neither our intent nor our task to inquire 
into them here. That they recur as a polar dimension time and 
again in diverse psychological domains (e.g., learned behav- 
iors, unconscious processes, emotion, and motivation, as well 
as their biological substrates) has been elaborated in another 
publication (Millon, 1990). In this next section, I examine 
their role as constructs for articulating attributes that may 
usefully define personality. 

Before we proceed, let us note that a balance must be 
struck between the two extremes that comprise each polarity; 
a measure of integration among the evolutionary polarities is 
an index of normality. Normal personality functioning, how- 
ever, does not require equidistance between polar extremes. 
Balanced but unequal positions emerge as a function of 
temperamental dispositions, which, in their turn, are modi- 
fied by the wider ecosystems within which individuals de- 
velop and function. In other words, there is no absolute or 
singular form of normal personality. Various polar positions 
and the personality attributes they subserve result in diverse 
styles of normality, just as severe or marked imbalances be- 
tween the polarities manifest themselves in diverse styles of 
abnormality (Millon & Davis, 1996). 

Moreover, given the diverse and changing ecological mi- 
lieus that humans face in our complex modern environment, 
there is reason to expect that most persons will develop mul- 
tiple adaptive styles, sometimes more active, sometimes less 
so, occasionally focused on self, occasionally on others, at 
times oriented to pleasure, at times oriented to the avoidance 
of pain. Despite the emergence of relatively enduring and 
characteristic styles over time, a measure of adaptive flexibil- 
ity typifies most individuals: Persons are able to shift from 
one position on a bipolar continuum to another as the cir- 
cumstances of life change. 


Personality Implications 


As noted, an interweaving and shifting balance between the 
two extremes that comprise the pain-pleasure polarity typi- 
fies normal personality functioning. Both of the following 
personality attributes should be met in varying degrees as life 
circumstances require. In essence, a synchronous and coordi- 
nated personal style would have developed to answer the 
question of whether the person should focus on experiencing 
only the enhancement of life versus concentrating his or her 
efforts on ensuring its preservation. 
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Avoiding Danger and Threat: The Life Preservation 
Attribute. One might assume that an attribute based on the 
avoidance of psychic or physical pain would be sufficiently 
self-evident not to require specification. As is well known, 
debates have arisen in the literature as to whether normal 
personality functioning represents the absence of mental 
disorder—that is, the reverse side of the mental illness or 
abnormality coin. That there is an inverse relationship be- 
tween health and disease cannot be questioned; the two are 
intimately connected both conceptually and physically. On 
the other hand, to define a healthy personality solely on the 
basis of an absence of disorder does not suffice. As a single 
attribute of behavior that signifies both the lack of (e.g., anx- 
iety, depression) and an aversion to (e.g., threats to safety and 
security) pain in its many and diverse forms does provide a 
foundation upon which other, more positively composed at- 
tributes may rest. Substantively, however, positive personal 
functioning must comprise elements beyond mere nonnor- 
mality or abnormality. And despite the complexities of per- 
sonality, from a definitional point of view normal functioning 
does preclude nonnormality. 

Turning to the evolutionary aspect of pain avoidance, that 
pertaining to a distancing from life-threatening circum- 
stances, psychic and otherwise, we find an early historical 
reference in the writings of Herbert Spencer, a supportive 
contemporary of Darwin. In 1870 Spencer averred: 


Pains are the correlative of actions injurious to the organism, 
while pleasures are the correlatives of actions conducive to its 
welfare. 

Those races of beings only can have survived in which, on the 
average, agreeable or desired feelings went along with activities 
conducive to the maintenance of life, while disagreeable and ha- 
bitually avoided feelings went along with activities directly or 
indirectly destructive of life. 

Every animal habitually persists in each act which gives plea- 
sure, so long as it does so, and desists from each act which gives 
pain. .. . It is manifest that in proportion as this guidance ap- 
proaches completeness, the life will be long; and that the life will 
be short in proportion as it falls short of completeness. 

We accept the inevitable corollary from the general doctrine 
of Evolution, that pleasures are the incentives to life-supporting 
acts and pains the deterrents from life-destroying acts. (pp. 279- 
284) 


More recently, Freedman and Roe (1958) wrote: 


We... hypothesize that psychological warning and warding- 
off mechanisms, if properly studied, might provide a kind of 
psychological-evolutionary systematics. Exposure to pain, anxi- 
ety, or danger is likely to be followed by efforts to avoid a 
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repetition of the noxious stimulus situation with which the expe- 
rience is associated. Obviously an animal with a more highly 
developed system for anticipating and avoiding the threatening 
circumstance is more efficiently equipped for adaptation and sur- 
vival. Such unpleasant situations may arise either from within, in 
its simplest form as tissue deprivation, or from without, by the 
infliction of pain or injury. Man’s psychological superstructure 
may be viewed, in part, as a system of highly developed warning 
mechanisms. (p. 458) 


As for the biological substrate of pain signals, Gray (1975) 
suggests two systems, both of which alert the organism to 
possible dangers in the environment. Those mediating the 
behavioral effects of unconditioned (instinctive?) aversive 
events are termed the fight-flight system (FFS). This system 
elicits defensive aggression and escape and is subserved, ac- 
cording to Gray’s pharmacological inferences, by the amgy- 
dala, the ventromedial hypothalamus, and the central gray of 
the midbrain; neurochemically, evidence suggests a difficult- 
to-unravel interaction among aminobutyric acids (for exam- 
ple, gamma-ammobutyric acid), serotonin, and endogenous 
opiates (for example, endorphins). The second major source 
of sensitivity and action in response to pain signals is referred 
to by Gray as the behavioral inhibition system (BIS), consist- 
ing of the interplay of the septal-hippocampal system, its 
cholinergic projections and monoamine transmissions to the 
hypothalamus, and then on to the cingulate and prefrontal cor- 
tex. Activated by signals of punishment or nonreward, the BIS 
suppresses associated behaviors, refocuses the organism’s at- 
tention, and redirects activity toward alternate stimuli. 

Harm avoidance is a concept proposed by Cloninger 
(1986, 1987). As he conceives the construct, it is a heritable 
tendency to respond intensely to signals of aversive stimuli 
(pain) and to learn to inhibit behaviors that might lead to pun- 
ishment and frustrative nonreward. Those high on this di- 
mension are characterized as cautious, apprehensive, and 
inhibited; those low on this valence would likely be confi- 
dent, optimistic, and carefree. Cloninger subscribes essen- 
tially to Gray’s behavioral inhibition system concept in 
explicating this polarity, as well as the neuroanatomical and 
neurochemical hypotheses Gray proposed as the substrates 
for its pain-avoidant mechanisms. 

Shifting from biological-evolutionary concepts, we may 
turn to proposals of a similar cast offered by thinkers of a 
distinctly psychological turn of mind. Notable here are the 
contributions of Maslow (1968), particularly his hierarchical 
listing of needs. Best known are the five fundamental needs 
that lead ultimately to self-actualization, the first two of 
which relate to our evolutionary attribute of life preservation. 
Included in the first group are the physio-logical needs such 
as air, water, food, and sleep, qualities of the ecosystem 


essential for survival. Next, and equally necessary to avoid 
danger and threat, are what Maslow terms the safety needs, 
including the freedom from jeopardy, the security of physical 
protection and psychic stability, as well as the presence of so- 
cial order and interpersonal predictability. 

That pathological consequences can ensue from the fail- 
ure to attend to the realities that portend danger is obvious; 
the lack of air, water, and food are not issues of great concern 
in civilized societies today, although these are matters of con- 
siderable import to environmentalists of the future and to 
contemporary poverty-stricken nations. 

It may be of interest next to record some of the psychic 
pathologies that can be traced to aberrations in meeting this 
first attribute of personality. For example, among those 
termed inhibited and avoidant personalities (Millon, 1969, 
1981), we see an excessive preoccupation with threats to 
one’s psychic security—an expectation of and hyperalertness 
to the signs of potential rejection—that leads these persons to 
disengage from everyday relationships and pleasures. At the 
other extreme of the polarity attribute, we see those of a risk- 
taking attitude, a proclivity to chance hazards and to endan- 
ger one’s life and liberty, a behavioral pattern characteristic 
of those we contemporaneously label antisocial personali- 
ties. Here there is little of the caution and prudence expected 
in the normal personality attribute of avoiding danger and 
threat; rather, we observe its opposite, a rash willingness to 
put one’s safety in jeopardy, to play with fire and throw cau- 
tion to the wind. Another pathological style illustrative of a 
failure to fulfill this evolutionary attribute is seen among 
those variously designated as masochistic and self-defeating 
personalities. Rather than avoid circumstances that may 
prove painful and self-endangering, these nonnormal person- 
ality styles set in motion situations in which they will come to 
suffer physically, psychically, or both. Either by virtue of 
habit or guilt absolution, these individuals induce rather than 
avoid pain for themselves. 


Seeking Rewarding Experiences: The Life Enhance- 
ment Attribute. At the other end of the existence polarity 
are attitudes and behaviors designed to foster and enrich 
life, to generate joy, pleasure, contentment, fulfillment, and 
thereby strengthen the capacity of the individual to remain 
vital and competent physically and psychically. This attribute 
asserts that existence and survival call for more than life 
preservation alone—beyond pain avoidance is what we have 
chosen to term pleasure enhancement. 

This attribute asks us to go at least one step further than 
Freud’s parallel notion that life’s motivation is chiefly that of 
“reducing tensions” (Oe, avoiding or minimizing pain), 
maintaining thereby a steady state, if you will, a homeostatic 


balance and inner stability. In accord with my view of evolu- 
tion’s polarities, I would assert that normal humans are also 
driven by the desire to enrich their lives, to seek invigorating 
sensations and challenges, to venture and explore, all to the 
end of magnifying if not escalating the probabilities of both 
individual viability and species replicability. 

Regarding the key instrumental role of “the pleasures,” 
Spencer (1870) put it well more than a century ago: “Pleasures 
are the correlatives of actions conducive to [organismic] 
welfare. . . . the incentives to life-supporting acts” (pp. 279, 
284). The view that there exists an organismic striving to ex- 
pand one’s inherent potentialties (as well as those of one’s kin 
and species) has been implicit in the literature of all times. 
That the pleasures may be both sign and vehicle for this real- 
ization was recorded even in the ancient writings of the 
Talmud, where it states: “everyone will have to justify himself 
in the life hereafter for every failure to enjoy a legitimately 
offered pleasure in this world” (Jahoda, 1958, p. 45). 

As far as contemporary psychobiological theorists are 
concerned, brief mention will be made again of the contribu- 
tions of Gray (1975, 1981) and Cloninger (1986, 1987). 
Gray’s neurobiological model centers heavily on activation 
and inhibition (active-passive polarities) as well as on signals 
of reward and punishment (pleasure-pain polarity). Basing 
his deductions primarily on pharmacological investigations 
of animal behavior, Gray has proposed the existence of sev- 
eral interrelated and neuroanatomically grounded response 
systems that activate various positive and negative affects. 
He refers to what he terms the behavioral activation system 
(BAS) as an approach system that is subserved by the reward 
center uncovered originally by Olds and Milner (1954). 
Ostensibly mediated at brain stem and cerebellar levels, it is 
likely to include dopaminergic projections across various 
striata and is defined as responding to conditioned rewarding 
and safety stimuli by facilitating behaviors that maximize 
their future recurrence (Gray, 1975). There are intricacies in 
the manner with which the BAS is linked to external stimuli 
and its anatomic substrates, but Gray currently views it as a 
system that subserves signals of reward, punishment relief, 
and pleasure. 

Cloninger (1986, 1987) has generated a theoretical model 
composed of three dimensions, which he terms reward depen- 
dence, harm avoidance, to which I referred previously, and 
novelty seeking. Proposing that each is a heritable personality 
disposition, he relates them explicitly to specific monoamin- 
ergic pathways; for example, high reward dependence is con- 
nected to low noradrenergic activity, harm avoidance to high 
serotonergic activity, and high novelty seeking to low 
dopaminergic activity. Cloninger’s reward dependence di- 
mension reflects highs and lows on the positive-gratifying- 


Three Universal Polarities of Evolution 13 


pleasure valence, whereas the harm avoidance dimension 
represents highs and lows on the negative-pain-displeasure 
valence. Reward dependence is hypothesized to be a herita- 
ble neurobiological tendency to respond to signals of reward 
(pleasure), particularly verbal signals of social approval, 
sentiment, and succor, as well as to resist events that might 
extinguish behaviors previously associated with these re- 
wards. Cloninger portrays those high on reward dependence 
to be sociable, sympathetic, and pleasant; in contrast, those 
low on this polarity are characterized as detached, cool, 
and practical. Describing the undergirding substrate for the 
reward-pleasure valence as the behavior maintenance sys- 
tem (BMS), Cloninger speculates that its prime neuromodu- 
lator is likely to be norepinephrine, with its major ascending 
pathways arising in the pons, projecting onward to hypo- 
thalamic and limbic structures, and then branching upward 
to the neocortex. 

Turning again to pure psychological formulations, both 
Rogers (1963) and Maslow (1968) have proposed concepts 
akin to my criterion of enhancing pleasure. In his notion of 
“openness to experience,” Rogers asserts that the fully func- 
tioning person has no aspect of his or her nature closed off. 
Such individuals are not only receptive to the experiences that 
life offers, but they are able also to use their experiences in ex- 
panding all of life’s emotions, as well as in being open to all 
forms of personal expression. Along a similar vein, Maslow 
speaks of the ability to maintain a freshness to experience, to 
keep up one’s capacity to appreciate relationships and events. 
No matter how often events or persons are encountered, one is 
neither sated nor bored but is disposed to view them with an 
ongoing sense of awe and wonder. 

Perhaps less dramatic than the conceptions of either 
Rogers and Maslow, I believe that this openness and freshness 
to life’s transactions is an instrumental means for extending 
life, for strengthening one’s competencies and options, and 
for maximizing the viability and replicability of one’s species. 
More mundane and pragmatic in orientation than their views, 
this conception seems both more substantive theoretically 
and more consonant a rationale for explicating the role the 
pleasures play in undergirding reward experience and open- 
ness to experience. 

As before, a note or two should be recorded on the patho- 
logical consequences of a failure to possess an attribute. 
These are seen most clearly in the personality disorders la- 
beled schizoid and avoidant. In the former there is a marked 
hedonic deficiency, stemming either from an inherent 
deficit in affective substrates or the failure of stimulative ex- 
perience to develop attachment behaviors, affective capac- 
ity, or both (Millon, 1981, 1990). Among those designated 
avoidant personalities, constitutional sensitivities or abusive 
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life experiences have led to an intense attentional sensitivity 
to psychic pain and a consequent distrust in either the 
genuineness or durability of the pleasures, such that these 
individuals can no longer permit themselves to chance expe- 
riencing them, lest they prove again to be fickle and unreli- 
able. Both of these personalities tend to be withdrawn and 
isolated, joyless and grim, neither seeking nor sharing in the 
rewards of life. 


Modes of Adaptation 


To come into existence as an emergent particle, a local cos- 
mos, or a living creature is but an initial phase, the serendip- 
itous presence of a newly formed structure, the chance 
evolution of a phenomenon distinct from its surroundings. 
Although extant, such fortuitous transformations may exist 
only for a fleeting moment. Most emergent phenomena do 
not survive (i.e., possess properties that enable them to retard 
entropic decomposition). To maintain their unique structure, 
differentiated from the larger ecosystem of which they are a 
part, and to be sustained as a discrete entity among other phe- 
nomena that comprise their environmental field requires 
good fortune and the presence of effective modes of adapta- 
tion. These modes of basic survival comprise the second es- 
sential component of evolution’s procession. 


Ecological Accommodation and Ecological Modification. 
The Passive-Active Polarity 


The second evolutionary stage relates to what is termed the 
modes of adaptation; it is also framed as a two-part polarity. 
The first may best be characterized as the mode of ecological 
accommodation, signifying inclinations to passively fit in, to 
locate and remain securely anchored in a niche, subject to the 
vagaries and unpredictabilities of the environment, all ac- 
ceded to with one crucial proviso: that the elements compris- 
ing the surroundings will furnish both the nourishment and 
the protection needed to sustain existence. Although based on 
a somewhat simplistic bifurcation among adaptive strategies, 
this passive and accommodating mode is one of the two fun- 
damental methods that living organisms have evolved as a 
means of survival. It represents the core process employed in 
the evolution of what has come to be designated as the plant 
kingdom: a stationary, rooted, yet essentially pliant and de- 
pendent survival mode. By contrast, the second of the two 
major modes of adaptation is seen in the lifestyle of the ani- 
mal kingdom. Here we observe a primary inclination toward 
ecological modification, a tendency to change or rearrange 
the elements comprising the larger milieu, to intrude upon 
otherwise quiescent settings, a versatility in shifting from one 


niche to another as unpredictability arises, a mobile and in- 
terventional mode that actively stirs, maneuvers, yields, and 
at the human level substantially transforms the environment 
to meet its own survival aims. 

Both modes—passive and active—have proven impres- 
sively capable to both nourishing and preserving life. Whether 
the polarity sketched is phrased in terms of accommodating 
versus modifying, passive versus active, or plant versus ani- 
mal, it represents at the most basic level the two fundamental 
modes that organisms have evolved to sustain their existence. 
This second aspect of evolution differs from the first stage, 
which is concerned with what may be called existential be- 
coming, in that it characterizes modes of being: how what has 
become endures. 

Broadening the model to encompass human experience, 
the active-passive polarity means that the vast range of be- 
haviors engaged in by humans may fundamentally be grouped 
in terms of whether initiative is taken in altering and shaping 
life’s events or whether behaviors are reactive to and accom- 
modate those events. 

Much can be said for the survival value of fitting a specific 
niche well, but no less important are flexibilities for adapting 
to diverse and unpredictable environments. It is here again 
where a distinction, although not a hard and fast one, may be 
drawn between the accommodating (plant) and the modify- 
ing (animal) mode of adaptation, the former more rigidly 
fixed and constrained by ecological conditions, the latter 
more broad-ranging and more facile in its scope of maneu- 
verability. To proceed in evolved complexity to the human 
species, we cannot help but recognize the almost endless va- 
riety of adaptive possibilities that may (and do) arise as sec- 
ondary derivatives of a large brain possessing an open 
network of potential interconnections that permit the func- 
tions of self-reflection, reasoning, and abstraction. But this 
takes us beyond the subject of this section of the chapter. The 
reader is referred elsewhere (Millon 1990) for a fuller discus- 
sion of active-passive parallels in wider domains of psycho- 
logical thought (for example, the “ego apparatuses” 
formulated by Hartmann (1939) or the distinction between 
classical and operant conditioning in the writings of Skinner 
(1938, 1953). 

Normal or optimal functioning, at least among humans, ap- 
pears to call for a flexible balance that interweaves both polar 
extremes. In the first evolutionary stage, that relating to exis- 
tence, behaviors encouraging both life enhancement (plea- 
sure) and life preservation (pain avoidance) are likely to be 
more successful in achieving survival than actions limited to 
one or the other alone. Similarly, regarding adaptation, modes 
of functioning that exhibit both ecological accommodation 
and ecological modification are likely to be more successful 


than is either by itself. Nevertheless, it does appear that the 
two advanced forms of life on earth—plants and animals— 
have evolved by giving precedence to one mode rather than 
both. 


Personality Implications 


As with the pair of criteria representing the aims of existence, 
a balance should be achieved between the two criteria com- 
prising modes of adaptation, those related to ecological 
accommodation and ecological modification, or what I have 
termed the passive-active polarity. Healthy personality func- 
tioning calls for a synchronous and coordinated style that 
weaves a balanced answer to the question of whether one 
should accept what the fates have brought forth or take the 
initiative in altering the circumstances of one’s life. 


Abiding Hospitable Realities: The Ecologically 
Accommodating Attribute. On first reflection, it would 
seem to be less than optimal to submit meekly to what life pre- 
sents, to adjust obligingly to one’s destiny. As described ear- 
lier, however, the evolution of plants is essentially grounded 
(no pun intended) in environmental accommodation, in an 
adaptive acquiescence to the ecosystem. Crucial to this adap- 
tive course, however, is the capacity of these surroundings to 
provide the nourishment and protection requisite to the thriv- 
ing of a species. 

Could the same be true for the human species? Are there 
not circumstances of life that provide significant and assured 
levels of sustenance and safekeeping (both psychic and phys- 
ical?) And if that were the case, would not the acquisition of 
an accommodating attitude and passive lifestyle be a logical 
consequence? The answer, it would seem, is yes. If one’s up- 
bringing has been substantially secure and nurturant, would it 
not be not normal to flee or overturn it? 

We know that circumstances other than those in infancy 
and early childhood rarely persist throughout life. Autonomy 
and independence are almost inevitable as a stage of matura- 
tion, ultimately requiring the adoption of so-called adult re- 
sponsibilities that call for a measure of initiative, decision 
making, and action. Nevertheless, to the extent that the 
events of life have been and continue to be caring and giving, 
is it not perhaps wisest, from an evolutionary perspective, to 
accept this good fortune and let matters be? This accommo- 
dating or passive life philosophy has worked extremely well 
in sustaining and fostering those complex organisms that 
comprise the plant kingdom. Hence passivity, the yielding to 
environmental forces, may be in itself not only unproblem- 
atic, but where events and circumstances provide the plea- 
sures of life and protect against their pains, positively 
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adaptive and constructive. Accepting rather than overturning 
a hospitable reality seems a sound course; or as it is said, “If 
it ain’t broke, don’t fix it.” 

Often reflective and deliberate, those who are passively 
oriented manifest few overt strategies to gain their ends. They 
display a seeming inertness, a phlegmatic lack of ambition or 
persistence, a tendency toward acquiescence, a restrained at- 
titude in which they initiate little to modify events, waiting 
for the circumstances of their environment to take their 
course before making accommodations. Some persons may 
be temperamentally ill-equipped to rouse or assert them- 
selves; perhaps past experience has deprived them of oppor- 
tunities to acquire a range of competencies or confidence in 
their ability to master the events of their environment; equally 
possible is a naive confidence that things will come their way 
with little or no effort on their part. From a variety of diverse 
sources, then, those at the passive end of the polarity engage 
in few direct instrumental activities to intercede in events or 
generate the effects they desire. They seem suspended, quies- 
cent, placid, immobile, restrained, listless, waiting for things 
to happen and reacting to them only after they occur. 

Is passivity a natural part of the repertoire of the human 
species, does agreeableness serve useful functions, and where 
and how is it exhibited? A few words in response to these 
questions may demonstrate that passivity is not mere inactiv- 
ity but a stance or process that achieves useful gains. For ex- 
ample, universal among mammalian species are two basic 
modes of learning: the respondent or conditioned type and 
the operant or instrumental type. The former is essentially a 
passive process, the simple pairing of an innate or reflexive 
response to a stimulus that previously did not elicit that re- 
sponse. In like passive fashion, environmental elements that 
occur either simultaneously or in close temporal order be- 
come connected to each other in the organism’s repertoire of 
learning, such that if one of these elements recurs in the fu- 
ture, the expectation is that the others will follow or be 
elicited. The organisms do not have to do anything active to 
achieve this learning; inborn reflexive responses and environ- 
mental events are merely associated by contiguity. 

Operant or instrumental learning, in contrast, represents 
the outcome of an active process on the part of the organism, 
one that requires an effort and execution on its part that has 
the effect of altering the environment. Whereas respondent 
conditioning occurs as a result of the passive observation of a 
conjoining of events, operant conditioning occurs only as a 
result of an active modification by the organism of its sur- 
roundings, a performance usually followed by a positive re- 
inforcer (pleasure) or the successful avoidance of a negative 
one (pain). Unconditioned reflexes, such as a leg jerk in 
reaction to a knee tap, will become a passively acquired 
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conditioned respondent if a bell is regularly sounded prior to 
the tap, as will the shrinking reflex of an eye pupil passively 
become conditioned to that bell if it regularly preceded expo- 
sure to a shining light. 

The passive-active polarity is central to formulations of 
psychoanalytic theory. Prior to the impressively burgeoning 
literature on self and object relations theory of the past two 
decades, the passive-active antithesis had a major role in 
both classical instinct and post-World War II ego schools of 
analytic thought. The contemporary focus on self and object 
is considered in discussions of the third polarity, that of self- 
other. However, we should not overlook the once key and 
now less popular constructs of both instinct theory and ego 
theory. It may be worth noting, as well as of special interest 
to the evolutionary model presented in this chapter, that the 
beginnings of psychoanalytic metapsychology were oriented 
initially to instinctual derivatives (in which pleasure and 
pain were given prominence), and then progressed subse- 
quently to the apparatuses of the ego (Hartmann, 1939; Ra- 
paport, 1953)—where passivity and activity were centrally 
involved. 

The model of activity, as Rapaport puts it, is a dual one: 
First, the ego is strong enough to defend against or control the 
intensity of the id’s drive tensions; or second, through the 
competence and energy of its apparatuses, the ego is success- 
ful in uncovering or creating in reality the object of the id’s 
instinctual drives. Rapaport conceives the model of passivity 
also to be a dual one: First, either the ego gradually modu- 
lates or indirectly discharges the instinctual energies of the 
id; or second, lacking an adequately controlling apparatus, 
the ego is rendered powerless and subject thereby to instinc- 
tual forces. Translating these formulations into evolution- 
ary terms, effective actions by the ego will successfully 
manage the internal forces of the id, whereas passivity will 
result either in accommodations or exposure to the internal 
demands of the id. 

Turning to contemporary theorists more directly con- 
cerned with normal or healthy personality functioning, the 
humanistic psychologist Maslow (1968) states that “self- 
actualized” individuals accept their nature as it is, despite 
personal weaknesses and imperfections; comfortable with 
themselves and with the world around them, they do not 
seek to change “the water because it is wet, or the rocks be- 
cause they are hard” (p. 153). They have learned to accept the 
natural order of things. Passively accepting nature, they need 
not hide behind false masks or transform others to fit 
distorted needs. Accepting themselves without shame or 
apology, they are equally at peace with the shortcomings of 
those with whom they live and relate. 


Where do we find clinical states of personality functioning 
that reflect failures to meet the accommodating-agreeable 
attribute? 

One example of an inability to leave things as they are is 
seen in what is classified as the histrionic personality disor- 
der. These individuals achieve their goals of maximizing pro- 
tection, nurturance, and reproductive success by engaging 
busily in a series of manipulative, seductive, gregarious, and 
attention-getting maneuvers. Their persistent and unrelenting 
manipulation of events is designed to maximize the receipt of 
attention and favors, as well as to avoid social disinterest and 
disapproval. They show an insatiable if not indiscriminate 
search for stimulation and approval. Their clever and often 
artful social behaviors may give the appearance of an inner 
confidence and self-assurance; beneath this guise, however, 
lies a fear that a failure on their part to ensure the receipt of at- 
tention will in short order result in indifference or rejection— 
hence their desperate need for reassurance and repeated signs 
of approval. Tribute and affection must constantly be replen- 
ished and are sought from every interpersonal source. As they 
are quickly bored and sated, they keep stirring up things, 
becoming enthusiastic about one activity and then another. 
There is a restless stimulus-seeking quality in which they can- 
not leave well enough alone. 

At the other end of the polarity are personality maladapta- 
tions that exhibit an excess of passivity, failing thereby to 
give direction to their own lives. Several personality disor- 
ders demonstrate this passive style, although their passivity 
derives from and is expressed in appreciably different ways. 
Schizoid personalities, for example, are passive owing to 
their relative incapacity to experience pleasure and pain; 
without the rewards these emotional valences normally acti- 
vate, they are devoid of the drive to acquire rewards, leading 
them to become apathetically passive observers of the ongo- 
ing scene. Dependent personality styles typically are average 
on the pleasure-pain polarity, yet they are usually as passive 
as schizoids. Strongly oriented to others, they are notably 
weak with regard to self. Passivity for them stems from 
deficits in self-confidence and competence, leading to deficits 
in initiative and autonomous skills, as well as a tendency to 
wait passively while others assume leadership and guide 
them. Passivity among so-called obsessive-compulsive per- 
sonalities stems from their fear of acting independently, 
owing to intrapsychic resolutions they have made to quell 
hidden thoughts and emotions generated by their intense self- 
other ambivalence. Dreading the possibility of making mis- 
takes or engaging in disapproved behaviors, they became 
indecisive, immobilized, restrained, and thereby passive. 
High on pain and low on both pleasure and self, individuals 


with masochistic personality styles operate on the assump- 
tion that they dare not expect nor deserve to have life go their 
way; giving up any efforts to achieve a life that accords with 
their true desires, they passively submit to others’ wishes, 
acquiescently accepting their fate. Finally, narcissistic per- 
sonality styles, especially high on self and low on others, be- 
nignly assume that good things will come their way with little 
or no effort on their part; this passive exploitation of others is 
a consequence of the unexplored confidence that underlies 
their self-centered presumptions. 


Mastering One’s Environment: The Ecologically 
Modifying Attribute. The active end of the adaptational 
polarity signifies the taking of initiative in altering and shap- 
ing life’s events. Such persons are best characterized by their 
alertness, vigilance, liveliness, vigor, forcefulness, stimulus- 
seeking energy, and drive. Some plan strategies and scan al- 
ternatives to circumvent obstacles or avoid the distress of 
punishment, rejection, and anxiety. Others are impulsive, pre- 
cipitate, excitable, rash, and hasty, seeking to elicit pleasures 
and rewards. Although specific goals vary and change from 
time to time, actively aroused individuals intrude on passing 
events and energetically and busily modify the circumstances 
of their environment. 

Neurobiological research has proven to be highly support- 
ive of the activity or arousal construct ever since Papez (1937), 
Moruzzi and Magnum (1949), and MacLean (1949, 1952) 
assigned what were to be termed the reticular and limbic sys- 
tems’ both energizing and expressive roles in the central ner- 
vous system. 

First among historic figures to pursue this theme was Ivan 
Pavlov. In speaking of the basic properties of the nervous sys- 
tem, Pavlov referred to the strength of the processes of exci- 
tation and inhibition, the equilibrium between their respective 
strengths, and the mobility of these processes. Although 
Pavlov’s (1927) theoretical formulations dealt with what 
Donald Hebb (1955) termed a conceptual nervous system, his 
experiments and those of his students led to innumerable di- 
rect investigations of brain activity. Central to Pavlov’s thesis 
was the distinction between strong and weak types of nervous 
systems. 

Closely aligned to Pavlovian theory, Gray (1964) has 
asserted that those with weak nervous systems are easily 
aroused, non-sensation-seeking introverts who prefer to 
experience low rather than high levels of stimulation. Con- 
versely, those with strong nervous systems would arouse 
slowly and be likely to be sensation-seeking extroverts who 
find low stimulation levels to be boring and find high levels 
to be both exciting and pleasant. 
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Akin also to the active modality are the more recent views 
of Cloninger (1986, 1987). To him, novelty-seeking is a her- 
itable tendency toward excitement in response to novel stim- 
uli or cues for reward (pleasure) or punishment relief (pain), 
both of which leading to exploratory activity. Consonant with 
its correspondence to the activity polarity, individuals who 
are assumed to be high in novelty-seeking may be character- 
ized in their personality attributes as impulsive, excitable, 
and quickly distracted or bored. Conversely, those at the pas- 
sive polarity or the low end of the novelty-seeking dimension 
may be portrayed as reflective, stoic, slow-tempered, orderly, 
and only slowly engaged in new interests. 

Turning from ostensive biological substrates to specula- 
tive psychological constructs, de Charms (1968) has pro- 
posed that “man’s primary motivational propensity is to be 
effective in producing changes in his environment” (p. 269). 
A similar view has been conveyed by White (1959) in his con- 
cept of effectance, an intrinsic motive, as he views it, that ac- 
tivates persons to impose their desires upon environments. De 
Charms (1968) elaborates his theme with reference to man as 
Origin and as Pawn, constructs akin to the active polarity on 
the one hand and to the passive polarity on the other; he states 
this distinction as follows: 


That man is the origin of his behavior means that he is constantly 
struggling against being confined and constrained by external 
forces, against being moved like a pawn into situations not of his 
own choosing. . . . An Origin is a person who perceives his be- 
havior as determined by his own choosing; a Pawn is a person 
who perceives his behavior as determined by external forces be- 
yond his control. . . . An Origin has strong feelings of personal 
causation, a feeling that the locus for causation of effects in his 
environment lies within himself. The feedback that reinforces 
this feeling comes from changes in his environment that are at- 
tributable to personal behavior. This is the crux of personal cau- 
sation, and it is a powerful motivational force directing future 
behavior. (pp. 273-274) 


Allport (1955) argued that history records many individuals 
who were not content with an existence that offered them 
little variety, a lack of psychic tension, and minimal chal- 
lenge. Allport considers it normal to be pulled forward by a 
vision of the future that awakened within persons their drive 
to alter the course of their lives. He suggests that people pos- 
sess a need to invent motives and purposes that would con- 
sume their inner energies. In a similar vein, Fromm (1955) 
proposed a need on the part of humans to rise above the roles 
of passive creatures in an accidental if not random world. 
To him, humans are driven to transcend the state of merely 
having been created; instead, humans seek to become the 
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creators, the active shapers of their own destiny. Rising above 
the passive and accidental nature of existence, humans gener- 
ate their own purposes and thereby provide themselves with 
a true basis of freedom. 


Strategies of Replication 


In their mature stage, organisms possess the requisite compe- 
tencies to maintain entropic stability. When these competen- 
cies can no longer adapt and sustain existence, organisms 
succumb inexorably to death and decomposition. This fate 
does not signify finality, however. Prior to their demise, all 
ephemeral species create duplicates that circumvent their ex- 
tinction, engaging in acts that enable them to transcend the 
entropic dissolution of their members’ individual existences. 

If an organism merely duplicates itself prior to death, then 
its replica is doomed to repeat the same fate it suffered. How- 
ever, if new potentials for extending existence can be fash- 
ioned by chance or routine events, then the possibility of 
achieving a different and conceivably superior outcome may 
be increased. And it is this co-occurrence of random and re- 
combinant processes that does lead to the prolongation of a 
species’ existence. This third hallmark of evolution’s proces- 
sion also undergirds another of nature’s fundamental polari- 
ties, that between self and other. 


Reproductive Nurturance and Reproductive Propagation: 
The Other-Self Polarity 


At its most basic and universal level, the manifold varieties of 
organisms living today have evolved, as Mayr (1964) has 
phrased it, to cope with the challenge of continuously chang- 
ing and immensely diversified environments, the resources of 
which are not inexhaustible. The means by which organisms 
cope with environmental change and diversity are well 
known. Inorganic structures survive for extended periods of 
time by virtue of the extraordinary strength of their bonding. 
This contrasts with the very earliest forerunners of organic 
life. Until they could replicate themselves, their distinctive 
assemblages existed precariously, subject to events that could 
put a swift end to the discrete and unique qualities that char- 
acterized their composition, leaving them essentially as tran- 
sient and ephemeral phenomena. After replicative procedures 
were perfected, the chemical machinery for copying organis- 
mic life, the DNA double helix, became so precise that it 
could produce perfect clones—if nothing interfered with its 
structure or its mechanisms of execution. But the patterning 
and processes of complex molecular change are not immune 
to accident. High temperatures and radiation dislodge and 
rearrange atomic structures, producing what are termed 


mutations, alterations in the controlling and directing DNA 
configuration that undergirds the replication of organismic 
morphology. 

Despite the deleterious impact of most mutations, it is the 
genetic variations to which they give rise that have served as 
one of the primary means by which simple organisms acquire 
traits making them capable of adapting to diverse and chang- 
ing environments. But isomorphic replication, aided by an 
occasional beneficent mutation, is a most inefficient if not 
hazardous means of surmounting ecological crises faced by 
complex and slowly reproducing organisms. Advantageous 
mutations do not appear in sufficient numbers and with suffi- 
cient dependability to generate the novel capabilities required 
to adapt to frequent or marked shifts in the ecosystem. How 
then did the more intricate and intermittently reproducing or- 
ganisms evolve the means to resolve the diverse hazards of 
unpredictable environments? 

The answer to this daunting task was the evolution of a re- 
combinant mechanism, one in which a pair of organisms ex- 
change their genetic resources: They develop what we term 
sexual mating. Here, the potentials and traits each partner 
possesses are sorted into new configurations that differ in 
their composition from those of their origins, generating 
thereby new variants and capabilities, of which some may 
prove more adaptive (and others less so) in changing envi- 
ronments than were their antecedents. Great advantages ac- 
crue by the occasional favorable combinations that occur 
through this random shuffling of genes. 

Recombinant replication, with its consequential benefits 
of selective diversification, requires the partnership of two 
parents, each contributing its genetic resources in a distinc- 
tive and species-characteristic manner. Similarly, the atten- 
tion and care given the offspring of a species’ matings are 
also distinctive. Worthy of note is the difference between the 
mating parents in the degree to which they protect and nour- 
ish their joint offspring. Although the investment of energy 
devoted to upbringing is balanced and complementary, rarely 
is it identical or even comparable in either devotion or deter- 
mination. This disparity in reproductive investment strate- 
gies, especially evident among nonhuman animal species 
(e.g., insects, reptiles, birds, mammals), underlies the evolu- 
tion of the male and female genders, the foundation for the 
third cardinal polarity I propose to account for evolution’s 
procession. 

Somewhat less profound than that of the first polarity, 
which represents the line separating the enhancement of 
order (existence-life) from the prevention of disorder 
(nonexistence-death), or that of the second polarity, differen- 
tiating the adaptive modes of accommodation (passive-plant) 
from those of modification (active-animal), the third polarity, 


based on distinctions in replication strategies, is no less fun- 
damental in that it contrasts the maximization of reproduc- 
tive propagation (self-male) from that of the maximization of 
reproductive nurturance (other-female). 

Evolutionary biologists (Cole, 1954; Trivers, 1974; E. O. 
Wilson, 1975) have recorded marked differences among 
species in both the cycle and pattern of their reproductive 
behaviors. Of special interest is the extreme diversity among 
and within species in the number of offspring spawned and 
the consequent nurturing and protective investment the 
parents make in the survival of their progeny. Designated the 
r-strategy and K-strategy in population biology, the former 
represents a pattern of propagating a vast number of offspring 
but exhibiting minimal attention to their survival; the latter 
is typified by the production of few progeny followed by 
considerable effort to assure their survival. Exemplifying the 
r-strategy are oysters, which generate some 500 million eggs 
annually; the K-strategy is found among the great apes, 
which produce a single offspring every 5 to 6 years. 

Not only do species differ in where they fall on the r- to 
K-strategy continuum, but within most animal species an im- 
portant distinction may be drawn between male and female 
genders. It is this latter differentiation that undergirds what 
has been termed the self- versus other-oriented polarity, im- 
plications of which are briefly elaborated in the following 
discussion. 

Human females typically produce about four hundred 
eggs in a lifetime, of which no more than twenty to twenty- 
five can mature into healthy infants. The energy investment 
expended in gestation, nurturing, and caring for each child, 
both before and during the years following birth, is extraordi- 
nary. Not only is the female required to devote much of her 
energies to bring the fetus to full term, but during this period 
she cannot be fertilized again; in contrast, the male is free to 
mate with numerous females. And should her child fail to sur- 
vive, the waste in physical and emotional exertion not only is 
enormous, but also amounts to a substantial portion of the 
mother’s lifetime reproductive potential. There appears to be 
good reason, therefore, to encourage a protective and caring 
inclination on the part of the female, as evident in a sensitiv- 
ity to cues of distress and a willingness to persist in attending 
to the needs and nurturing of her offspring. 

Although the male discharges tens of millions of sperm 
on mating, this is but a small investment, given the ease 
and frequency with which he can repeat the act. On fertiliza- 
tion, his physical and emotional commitment can end with 
minimal consequences. Although the protective and food- 
gathering efforts of the male may be lost by an early abandon- 
ment of a mother and an offspring or two, much more may be 
gained by investing energies in pursuits that achieve the wide 
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reproductive spread of his genes. Relative to the female of the 
species, whose best strategy appears to be the care and com- 
fort of child and kin—that is, the K-strategy—the male is 
likely to be reproductively more prolific by maximizing self- 
propagation—that is, adopting the r-strategy. To focus primar- 
ily on self-replication may diminish the survival probabilities 
of a few of a male’s progeny, but this occasional reproductive 
loss may be well compensated for by mating with multiple 
females and thereby producing multiple offspring. 

In sum, males lean toward being self-oriented because 
competitive advantages that inhere within themselves maxi- 
mize the replication of their genes. Conversely, females lean 
toward being other-oriented because their competence in nur- 
turing and protecting their limited progeny maximizes the 
replication of their genes. 

The consequences of the male’s r-strategy are a broad range 
of what may be seen as self- as opposed to other-oriented 
behaviors, such as acting in an egotistical, insensitive, incon- 
siderate, uncaring, and minimally communicative manner. In 
contrast, females are more disposed to be other-oriented, 
affiliative, intimate, empathic, protective, communicative, 
and solicitous (Gilligan, 1982; Rushton, 1985; E. O. Wilson, 
1978). 


Personality Implications 


As before, I consider both of the following criteria necessary 
to the definition and determination of a full personality char- 
acterization. I see no necessary antithesis between the two. 
Humans can be both self-actualizing and other-encouraging, 
although most persons are likely to lean toward one or the 
other side. A balance that coordinates the two provides a sat- 
isfactory answer to the question of whether one should be 
devoted to the support and welfare of others (the underlying 
philosophy of the “Democrats”) or fashion one’s life in 
accord with one’s own needs and desires (the underlying 
philosophy of the “Republicans”). 


Constructive Loving: The Other-Nurturing Attribute. 
As described earlier, recombinant replication achieved by 
sexual mating entails a balanced although asymmetrical 
parental investment in both the genesis and the nurturance of 
offspring. By virtue of her small number of eggs and ex- 
tended pregnancy, the female strategy for replicative success 
among most mammals is characterized by the intensive care 
and protection of a limited number of offspring. Oriented to 
reproductive nurturance rather than reproductive propaga- 
tion, most adult females, at least until recent decades in West- 
ern society, bred close to the limit of their capacity, attaining 
a reproductive ceiling of approximately 20 viable births. 
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By contrast, not only are males free of the unproductive 
pregnancy interlude for mating, but they may substantially 
increase their reproductive output by engaging in repetitive 
matings with as many available females as possible. 

The other-versus-self antithesis follows from additional 
aspects of evolution’s asymmetric replication strategy. Not 
only must the female be oriented to and vigilant in identify- 
ing the needs of and dangers that may face each of her few 
offspring, but it is reproductively advantageous for her to be 
sensitive to and discriminating in her assessment of potential 
mates. A bad mating—one that issues a defective or weak 
offspring—has graver consequences for the female than for 
the male. Not only will such an event appreciably reduce her 
limited reproductive possibilities and cause her to forego a 
better mate for a period of time, but she may exhaust much of 
her nurturing and protective energies in attempting to revital- 
ize an inviable or infertile offspring. By contrast, if a male in- 
dulges in a bad mating, all he has lost are some quickly 
replaceable sperm, a loss that does little to diminish his future 
reproductive potentials and activities. 

Before we turn to other indexes and views of the self-other 
polarity, let us be mindful that these conceptually derived 
extremes do not evince themselves in sharp and distinct gen- 
der differences. Such proclivities are matters of degree, not 
absolutes, owing not only to the consequences of recombinant 
“shuffling” and gene “crossing over,” but also to the influential 
effects of cultural values and social learning. Consequently, 
most normal individuals exhibit intermediate characteristics 
on this as well as on the other two polarity sets. 

The reasoning behind different replication strategies de- 
rives from the concept of inclusive fitness, the logic of which 
we owe to the theoretical biologist W. D. Hamilton (1964). 
The concept’s rationale is well articulated in the following 
quote (Daly & Wilson, 1978): 


Suppose a particular gene somehow disposes its bearers to help 
their siblings. Any child of a parent that has this gene has a one- 
half of probability of carrying that same gene by virtue of com- 
mon descent from the same parent bearer. . . . From the gene’s 
point of view, it is as useful to help a brother or sister as it is to 
help the child. 

When we assess the fitness of a... bit of behavior, we must 
consider more than the reproductive consequences for the indi- 
vidual animal. We must also consider whether the reproductive 
prospects of any kin are in any way altered. Inclusive fitness is a 
sum of the consequences for one’s own reproduction plus the 
consequences for the reproduction of kin multiplied by the degree 
of relatedness of those kin [italics added]. 

An animal’s behavior can therefore be said to serve a 
strategy whose goal is the maximization of inclusive fitness. 
(pp. 30-31) 


Mutual support and encouragement represents efforts lead- 
ing to reciprocal fitness—a behavioral pattern consonant 
with Darwin’s fundamental notions. Altruism, however, is a 
form of behavior in which there is denial of self for the ben- 
efit of others, a behavioral pattern acknowledged by Darwin 
himself as seemingly inconsistent with his theory (1871, 
p. 130). A simple extrapolation from natural selection sug- 
gests that those disposed to engage in self-sacrifice would 
ultimately leave fewer and fewer descendants; as a conse- 
quence, organisms motivated by self-benefiting genes would 
prevail over those motivated by other-benefiting genes, a re- 
sult leading to the eventual extinction of genes oriented to 
the welfare of others. The distinguished sociobiologist E. O. 
Wilson states the problem directly: “How then does altruism 
persist?” (1978, p. 153). An entomologist of note, Wilson 
had no hesitation in claiming that altruism not only persists, 
but also is of paramount significance in the lives of social 
insects. In accord with his sociobiological thesis, he illus- 
trates the presence of altruism in animals as diverse as birds, 
deer, porpoises, and chimpanzees, which share food and 
provide mutual defense—for example, to protect the 
colony’s hives, bees enact behaviors that lead invariably to 
their deaths. 

Two underlying mechanisms have been proposed to ac- 
count for cooperative behaviors such as altruism. One derives 
from the concept of inclusive fitness, briefly described in pre- 
ceding paragraphs; E. O. Wilson (1978) terms this form of 
cooperative behavior hard-core altruism, by which he means 
that the act is “unilaterally directed” for the benefit of others 
and that the bestower neither expects nor expresses a desire 
for a comparable return. Following the line of reasoning orig- 
inally formulated by Hamilton (1964), J. P. Rushton (1984), 
a controversial Canadian researcher who has carried out illu- 
minating r-K studies of human behavior, explicates this 
mechanism as follows: 


Individuals behave so as to maximize their inclusive fitness 
rather than only their individual fitness; they maximize the pro- 
duction of successful offspring by both themselves and their rel- 
atives. ... Social ants, for example, are one of the most altruistic 
species so far discovered. The self-sacrificing, sterile worker and 
soldier ants . . . share 75% of their genes with their sisters and so 
by devoting their entire existence to the needs of others . . . they 
help to propagate their own genes. (p. 6) 


The second rationale proposed as the mechanism underly- 
ing other-oriented and cooperative behaviors Wilson terms 
soft-core altruism to represent his belief that the bestower’s 
actions are ultimately self-serving. The original line of rea- 
soning here stems from Trivers’s (1971) notion of reciprocity, 
a thesis suggesting that genetically based dispositions to 


cooperative behavior can be explained without requiring the 
assumption of kinship relatedness. All that is necessary is that 
the performance of cooperative acts be mutual—that is, result 
in concurrent or subsequent behaviors that are comparably 
beneficial in terms of enhancing the original bestower’s sur- 
vivability, reproductive fertility, or both. 

E. O. Wilson’s (1978) conclusion that the self-other 
dimension is a bedrock of evolutionary theory is worth 
quoting: 


In order to understand this idea more clearly, return with me for 
a moment to the basic theory of evolution. Imagine a spectrum of 
self-serving behavior. At one extreme only the individual is 
meant to benefit, then the nuclear family, next the extended fam- 
ily (including cousins, grandparents, and others who might play 
a role in kin selection), then the band, the tribe, chiefdoms, and 
finally, at the other extreme, the highest sociopolitical units. 
(p. 158) 


Intriguing data and ideas have been proposed by several re- 
searchers seeking to identify specific substrates that may re- 
late to the other-oriented polarities. In what has been termed 
the affiliation-attachment drive, Everly (1988), for example, 
provides evidence favoring an anatomical role for the cingu- 
late gyrus. Referring to the work of Henry and Stephens 
(1977), MacLean (1985), and Steklis and Kling (1985), 
Everly concludes that the ablation of the cingulate elimi- 
nates both affiliative and grooming behaviors. The proximal 
physiology of this drive has been hypothesized as including 
serotonergic, noradrenergic, and opoid neurotransmission 
systems (Everly, 1988; Redmond, Maas, & Kling, 1971). 
MacLean (1985) has argued that the affiliative drive may be 
phylogenically coded in the limbic system and may under- 
gird the concept of family in primates. The drive toward 
other-oriented behaviors, such as attachment, nurturing, 
affection, reliability, and collaborative play, has been re- 
ferred to as the “cement of society” by Henry and Stevens 
(1977). 

Let us move now to the realm of psychological and social 
proposals. Dorothy Conrad (1952) specified a straightfor- 
ward list of constructive behaviors that manifest “reproduc- 
tive nurturance” in the interpersonal sphere. She records 
them as follows: 


Has positive affective relationship: The person who is able to re- 
late affectively to even one person demonstrates that he is poten- 
tially able to relate to other persons and to society. 

Promotes another’s welfare: Affective relationships make it 
possible for the person to enlarge his world and to act for the 
benefit of another, even though that person may profit only 
remotely. 
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Works with another for mutual benefit: The person is largely 
formed through social interaction. Perhaps he is most completely 
a person when he participates in a mutually beneficial relation- 
ship. (pp. 456-457) 


More eloquent proposals of a similar prosocial character 
have been formulated by the noted psychologists Maslow, 
Allport, and Fromm. 

According to Maslow, after humans’ basic safety and se- 
curity needs are met, they next turn to satisfy the belonging 
and love needs. Here we establish intimate and caring rela- 
tionships with significant others in which it is just as impor- 
tant to give love as it is to receive it. Noting the difficulty in 
satisfying these needs in our unstable and changing modern 
world, Maslow sees the basis here for the immense popular- 
ity of communes and family therapy. These settings are ways 
to escape the isolation and loneliness that result from our fail- 
ures to achieve love and belonging. 

One of Allport’s criteria of the mature personality, which 
he terms a warm relating of self to others, refers to the capa- 
bility of displaying intimacy and love for a parent, child, 
spouse, or close friend. Here the person manifests an authen- 
tic oneness with the other and a deep concern for his or her 
welfare. Beyond one’s intimate family and friends, there is an 
extension of warmth in the mature person to humankind at 
large, an understanding of the human condition, and a kinship 
with all peoples. 

To Fromm, humans are aware of the growing loss of their 
ties with nature as well as with each other, feeling increas- 
ingly separate and alone. Fromm believes humans must pur- 
sue new ties with others to replace those that have been lost 
or can no longer be depended upon. To counter the loss of 
communion with nature, he feels that health requires that we 
fulfill our need by a brotherliness with mankind and a sense 
of involvement, concern, and relatedness with the world. And 
with those with whom ties have been maintained or reestab- 
lished, humans must fulfill their other-oriented needs by 
being vitally concerned with their well-being as well as fos- 
tering their growth and productivity. 

In a lovely coda to a paper on the role of evolution in 
human behavior, Freedman and Roe (1958) wrote: 


Since his neolithic days, in spite of his murders and wars, his 
robberies and rapes, man has become a man-binding and a time- 
binding creature. He has maintained the biological continuity of 
his family and the social continuity of aggregates of families. He 
has related his own life experiences with the social traditions of 
those who have preceded him, and has anticipated those of his 
progeny. He has accumulated and transmitted his acquired goods 
and values through his family and through his organizations. He 
has become bound to other men by feelings of identity and by 
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shared emotions, by what clinicians call empathy. His sexual 
nature may yet lead him to widening ambits of human affection, 
his acquisitive propensities to an optimum balance of work and 
leisure, and his aggressive drives to heightened social efficiency 
through attacks on perils common to all men. (p. 457) 


The pathological consequences of a failure to embrace the 
polarity criterion of others are seen most clearly in the per- 
sonality maladaptations termed antisocial and narcissistic 
disorders. Both personalities exhibit an imbalance in their 
replication strategy; in this case, however, there is a primary 
reliance on self rather than others. They have learned that 
reproductive success as well as maximum pleasure and min- 
imum pain is achieved by turning exclusively to themselves. 
The tendency to focus on self follows two major lines of 
development. 

In the narcissistic personality maladaptive style, develop- 
ment reflects the acquisition of a self-image of superior worth. 
Providing self-rewards is highly gratifying if one values one- 
self or possesses either a real or inflated sense of self-worth. 
Displaying manifest confidence, arrogance, and an exploitive 
egocentricity in social contexts, this individual believes he or 
she already has all that is important—him- or herself. 

Narcissistic individuals are noted for their egotistical self- 
involvement, experiencing primary pleasure simply by pas- 
sively being or attending to themselves. Early experience 
has taught them to overvalue their self-worth; this confidence 
and superiority may be founded on false premises, however— 
it may be unsustainable by real or mature achievements. 
Nevertheless, they blithely assume that others will recognize 
their special-ness. Hence they maintain an air of arrogant self- 
assurance, and without much thought or even conscious in- 
tent, benignly exploit others to their own advantage. Although 
the tributes of others are both welcome and encouraged, their 
air of snobbish and pretentious superiority requires little con- 
firmation either through genuine accomplishment or social 
approval. Their sublime confidence that things will work out 
well provides them with little incentive to engage in the reci- 
procal give and take of social life. 

Those clinically designated as antisocial personalities 
counter the indifference or the expectation of pain from 
others; this is done by actively engaging in duplicitous or 
illegal behaviors in which they seek to exploit others for self- 
gain. Skeptical regarding the motives of others, they desire 
autonomy and wish revenge for what are felt as past injus- 
tices. Many are irresponsible and impulsive, behaviors they 
see as justified because they judge others to be unreliable and 
disloyal. Insensitivity and ruthlessness with others are the 
primary means they have learned to head off abuse and 
victimization. 


In contrast to the narcissistic form of maladaptation, the 
antisocial pattern of self-orientation develops as a form of 
protection and counteraction. These styles turn to themselves 
first to avoid the depredation they anticipate, and second to 
compensate by furnishing self-generated rewards in their 
stead. Learning that they cannot depend on others, individu- 
als with these personality styles counterbalance loss not only 
by trusting themselves alone, but also by actively seeking 
retribution for what they see as past humiliations. Turning 
to self and seeking actively to gain strength, power, and re- 
venge, they act irresponsibly, exploiting and usurping what 
others possess as just reprisals. Their security is never fully 
assured, however, even when they have aggrandized them- 
selves beyond their lesser origins. 

In both narcissistic and antisocial personality styles, we 
see maladaptations arising from an inability to experience a 
constructive love for others. For the one, there is an excessive 
self-centeredness; for the other, there is the acquisition of a 
compensatory destructiveness driven by a desire for social 
retribution and self-aggrandizement. 


Realizing One’s Potentials: The Self-Actualizing 
Attribute. The converse of other-nurturance is not self- 
propagation, but rather the lack of other-nurturance. Thus, to 
fail to love others constructively does not assure the actualiza- 
tion of one’s potentials. Both may and should exist in normal, 
healthy individuals. Although the dimension of self-other is 
arranged to highlight its polar extremes, it should be evident 
that many if not most behaviors are employed to achieve the 
goals of both self- and kin reproduction. Both ends are often 
simultaneously achieved; at other times one may predomi- 
nate. The behaviors comprising these strategies are driven, 
so to speak, by a blend of activation and affect—that is, com- 
binations arising from intermediary positions reflecting 
both the life enhancement and life preservation polarity of 
pleasure-pain, interwoven with similar intermediary positions 
on the ecological accommodation and ecological modifica- 
tion polarity of activity-passivity. Phrasing replication in 
terms of the abstruse and metaphorical constructs does not ob- 
scure it, but rather sets this third polarity on the deeper foun- 
dations of existence and adaptation, foundations composed of 
the first two polarities previously described. 

At the self-oriented pole, Everly (1988) proposes an 
autonomy-aggression biological substrate that manifests it- 
self in a strong need for control and domination as well as in 
hierarchical status striving. According to MacLean (1986), it 
appears that the amygdaloid complex may play a key role in 
driving organisms into self-oriented behaviors. Early studies 
of animals with ablated amygdalas showed a notable increase 
in their docility (Kluver & Bucy, 1939), just as nonhuman 


primates have exhibited significant decreases in social hier- 
archy status (Pribram, 1962). Although the evidence remains 
somewhat equivocal, norepinephrine and dopamine seem to 
be the prime neurotransmitters of this drive; the testosterone 
hormone appears similarly implicated (Feldman & Quenzar, 
1984). 

Regarding psychological constructs that parallel the no- 
tion of self-actualization, their earliest equivalent was in the 
writings of Spinoza (1677/1986), who viewed development 
as that of becoming what one was intended to be and nothing 
other than that, no matter how exalted the alternative might 
appear to be. 

Carl Jung’s (1961) concept of individuation shares impor- 
tant features with that of actualization in that any deterrent to 
becoming the individual one may have become would be det- 
rimental to life. Any imposed “collective standard is a serious 
check to individuality,” injurious to the vitality of the person, 
a form of “artificial stunting.” 

Perhaps it was my own early mentor, Kurt Goldstein 
(1939), who first coined the concept under review with the 
self-actualization designation. As he phrased it, “There is 
only one motive by which human activity is set going: the ten- 
dency to actualize oneself” (1939, p. 196). 

The early views of Jung and Goldstein have been enriched 
by later theorists, notably Fromm, Perls, Rogers, and Maslow. 

Focusing on what he terms the sense of identity, Fromm 
(1955) spoke of the need to establish oneself as a unique 
individual, a state that places the person apart from others. 
Further—and it is here where Fromm makes a distinct self- 
oriented commitment—the extent to which this sense of 
identity emerges depends on how successful the person is in 
breaking “incestuous ties” to one’s family or clan. Persons 
with well-developed feelings of identity experience a feeling 
of control over their lives rather than a feeling of being con- 
trolled by the lives of others. 

Perls (1969) enlarged on this theme by contrasting self- 
regulation versus external regulation. Normal, healthy persons 
do their own regulating, with no external interference, be it the 
needs and demands of others or the strictures of a social code. 
What we must actualize is the true inner self, not an image we 
have of what our ideal selves should be. That is the “curse of 
the ideal.” To Perls, each must be what he or she really is. 

Following the views of his forerunners, Maslow (1968) 
stated that self-actualization is the supreme development and 
use of all our abilities, ultimately becoming what we have the 
potential to become. Noting that self-actualists often require 
detachment and solitude, Maslow asserted that such persons 
are strongly self-centered and self-directed, make up their 
own minds, and reach their own decisions without the need to 
gain social approval. 
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In like manner, Rogers (1963) posited a single, overreach- 
ing motive for the normal, healthy person—maintaining, ac- 
tualizing, and enhancing one’s potential. The goal is not that 
of maintaining a homeostatic balance or a high degree of ease 
and comfort, but rather to move forward in becoming what is 
intrinsic to self and to enhance further that which one has al- 
ready become. Believing that humans have an innate urge to 
create, Rogers stated that the most creative product of all is 
one’s own self. 

Where do we see failures in the achievement of self- 
actualization, a giving up of self to gain the approbation of 
others? Two maladaptive personality styles can be drawn 
upon to illustrate forms of self-denial. 

Those with dependent personalities have learned that feel- 
ing good, secure, confident, and so on—that is, those feelings 
associated with pleasure or the avoidance of pain—is pro- 
vided almost exclusively in their relationship with others. Be- 
haviorally, these persons display a strong need for external 
support and attention; should they be deprived of affection 
and nurturance, they will experience marked discomfort, if 
not sadness and anxiety. Any number of early experiences 
may set the stage for this other-oriented imbalance. Depen- 
dent individuals often include those who have been exposed 
to an overprotective training regimen and who thereby fail to 
acquire competencies for autonomy and initiative; experienc- 
ing peer failures and low self-esteem leads them to forego at- 
tempts at self-assertion and self-gratification. They learn 
early that they themselves do not readily achieve rewarding 
experiences; these experiences are secured better by leaning 
on others. They learn not only to turn to others as their source 
of nurturance and security, but also to wait passively for oth- 
ers to take the initiative in providing safety and sustenance. 
Clinically, most are characterized as searching for relation- 
ships in which others will reliably furnish affection, protec- 
tion, and leadership. Lacking both initiative and autonomy, 
they assume a dependent role in interpersonal relations, ac- 
cepting what kindness and support they may find and will- 
ingly submitting to the wishes of others in order to maintain 
nurturance and security. 

A less benign but equally problematic centering on the 
wishes of others and the denial of self is seen in what is termed 
clinically as the obsessive-compulsive personality. These per- 
sons display a picture of distinct other-directedness—a con- 
sistency in social compliance and interpersonal respect. Their 
histories usually indicate having been subjected to constraint 
and discipline when they transgressed parental strictures and 
expectations. Beneath the conforming other-oriented veneer, 
they exhibit intense desires to rebel and assert their own self- 
oriented feelings and impulses. They are trapped in an am- 
bivalence; to avoid intimidation and punishment they have 
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learned to deny the validity of their own wishes and emotions 
and instead have adopted as true the values and precepts set 
forth by others. The disparity they sense between their own 
urges and the behaviors they must display to avoid condem- 
nation often leads to omnipresent physical tensions and rigid 
psychological controls. 

Readers who have reached this final paragraph on the 
basic three polarities that undergird all physical forms and or- 
ganic species should have a foundation to move onto our next 
series of polarities, those which are distinctly human—that 
is, these polarities relate to personality attributes found al- 
most exclusively in the human species that set us off from all 
earlier forms of evolution and that pertain to the higher pow- 
ers and adaptive functions of abstraction and their constit- 
uent cognitive modes. 


THE DISTINCTLY HUMAN POLARITIES 
OF EVOLUTION 


This group of personality attributes incorporates the sources 
employed to gather knowledge about the experience of life 
and the manner in which this information is registered and 
transformed. Here, we are looking at styles of cognizing— 
differences (first) in what people attend to in order to 
learn about life, and (second) how they process information: 
what they do to record this knowledge and make it useful to 
themselves. 


Predilections of Abstraction 


The cognitive features of intelligence are judged by me to be 
central elements in personological derivations. Comprising 
the fourth and most recent stage of evolution, they comprise 
the reflective capacity to transcend the immediate and con- 
crete, they interrelate and synthesize the diversity of experi- 
ence, they represent events and processes symbolically, they 
weigh, reason, and anticipate; in essence, they signify a quan- 
tum leap in evolution’s potential for change and adaptation. 

Cognitive differences among individuals and the manner 
in which they are expressed have been much overlooked in 
generating and appraising personality attributes. With an oc- 
casional notable exception or two, little of the recent so- 
called revolution in cognitive science that has profoundly 
affected contemporary psychology has impacted the study of 
personology. Historically, the realms of intellect, aptitude, 
and ability have not been considered to be personality-related 
spheres of study. 

In my view, personology should be broadened to encom- 
pass the whole person, an organically unified and unseg- 
mented totality. Consequently, cognitive dimensions and 


their various styles not only should be included, but also may 
have a significance equal to that of other functions as a source 
of personality attributes (Millon, 1990). Unfortunately, the 
various features comprising cognitive abstraction have only 
rarely been included as components in personality-oriented 
concepts and appraisals. 

Emancipated from the real and present, unanticipated pos- 
sibilities and novel constructions may routinely be created 
cognitively. The capacity to sort, to recompose, to coordinate, 
and to arrange the symbolic representations of experience 
into new configurations is in certain ways analogous to the 
random processes of recombinant replication, but processes 
enabling manipulation of abstractions are more focused and 
intentional. To extend this rhetorical liberty, replication is the 
recombinant mechanism underlying the adaptive progression 
of phylogeny, whereas abstraction is the recombinant mecha- 
nism underlying the adaptive progression of ontogeny. The 
powers of replication are limited, constrained by the finite 
potentials inherent in parental genes. In contrast, experi- 
ences, abstracted and recombined, are infinite. 

Over one lifetime, innumerable events of a random, logi- 
cal, or irrational character transpire, are construed, and are re- 
formulated time and again—some of which prove more and 
others less adaptive than their originating circumstances may 
have called forth. Whereas the actions of most nonhuman 
species derive from successfully evolved genetic programs, 
activating behaviors of a relatively fixed nature suitable for a 
modest range of environmental settings, the capabilities of 
both implicit and intentional abstraction that characterize 
humans give rise to adaptive competencies that are suited to 
radically divergent ecological circumstances, circumstances 
that themselves may be the result of far-reaching acts of sym- 
bolic and technological creativity. 

Although what underlies our self- versus other-oriented 
attributes stems from differential replication strategies, the 
conscious state of knowing self as distinct from others is a 
product of the power of abstraction, the most recent phase of 
evolution’s procession. The reflective process of turning in- 
ward and recognizing self as an object—no less to know one- 
self, and further, to know that one knows—is a uniqueness 
found only among humans. Doubling back on oneself, so to 
speak, creates a new level of reality, consciousness that im- 
bues self and others with properties far richer and more sub- 
tle than those that derive from strategies of reproductive 
propagation and nurturance alone. 

The abstracting mind may mirror outer realities but recon- 
structs them in the process, reflectively transforming them 
into subjective modes of phenomenological reality, making 
external events into a plastic mold subject to creative designs. 
Not only are images of self and others emancipated from 


direct sensory realities, becoming entities possessing a life of 
their own, but contemporaneous time may also lose its im- 
mediacy and impact. The abstracting mind brings the past ef- 
fectively into the present, and its power of anticipation brings 
the future into the present as well. With past and future em- 
bedded in the here and now, humans can encompass at once 
not only the totality of our cosmos, but also its origins and na- 
ture, its evolution, and how they have come to pass. Most im- 
pressive of all are the many visions humans have of life’s 
indeterminate future, where no reality as yet exists. 

Four polarities constitute this distinctly human abstraction 
function. The first two pairs refer to the information sources 
that provide cognitions. One set of contrasting polarities ad- 
dresses the orientation either to look outward, or external-to- 
self, in seeking information, inspiration, and guidance, versus 
the orientation to turn inward, or internal-to-self. The second 
set of abstraction polarities contrasts predilections for either 
direct observational experiences of a tangible, material, and 
concrete nature with those geared more toward intangible, 
ambiguous, and inchoate phenomena. 

The third and fourth set of abstraction polarities relate to 
cognitive processing—that is, the ways in which people eval- 
uate and mentally reconstruct information and experiences 
after they have been apprehended and incorporated. The first 
of these sets of cognitive polarities differentiates processes 
based essentially on ideation, logic, reason, and objectivity 
from those that depend on emotional empathy, personal val- 
ues, sentiment, and subjective judgments. The second set of 
these polarities reflects either a tendency to make new infor- 
mation conform to preconceived knowledge, in the form of 
tradition-bound, standardized, and conventionally structured 
schemas, versus the opposing inclination to bypass precon- 
ceptions by distancing from what is already known and in- 
stead to create innovative ideas in an informal, open-minded, 
spontaneous, individualistic, and often imaginative manner. 

Cognitive functions are consonant with our earlier bioso- 
cial formulations concerning the architecture of human func- 
tioning (Millon, 1990) because we see cognitive processes to 
be an essential component of our fourfold model regarding 
how organisms approach their environments. Beyond the 
driving motivational elements of personality style (as in my 
formulation of the personality disorders), or the factorial 
structure of personality (e.g., as explicated in the Big Five 
model), we seek to conjoin all components of personality 
style by linking and integrating the various expressions and 
functions of personality into an overarching and coherent 
whole. 

Several polar dimensions have been proposed through the 
years as the basis for a schema of cognitive styles. Contrast- 
ing terms such as leveling versus sharpening, narrow versus 
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broad, analytic versus synthetic, constricted versus flexible, 
inductive versus deductive, abstract versus concrete, and 
convergent versus divergent have been used to illustrate the 
stylistic differences among cognitive functions. Although 
each of these pairs contributes to distinctions of importance 
in describing cognitive processes, few were conceptualized 
with personality differences in mind, although some may 
prove productive in that regard. 

As noted above, the model formulated by the author sepa- 
rates cognitive activities into two superordinate functions. The 
first pertains to the contrasting origins from which cognitive 
data are gathered, or what may be termed information sources; 
the second pertains to the methods by which these data are re- 
constructed by the individual, or what we label transforma- 
tional processes. These two functions—the initial gathering 
and subsequent reconstruction of information—are further 
subdivided into two polarities each. As is elaborated later in 
this chapter, the sources of information are separated into 
(a) external versus internal and (b) tangible versus intangible. 
Transformational processes are divided into (a) ideational 
versus emotional and (b) integrative versus imaginative. The 
resulting four personality attributes are by no means exhaus- 
tive. Rather surprisingly, they turn out to be consonant with a 
model formulated in the 1920s by Jung (197 1a). 


Sources of Information 


Information may be seen as the opposite of entropy. What en- 
ergy or nutrients are to physical systems, information is to 
cognitive systems. A physical system sustains itself by suck- 
ing order, so to speak, from its environs, taking in energy or 
nutrients and transforming them to meet tissue needs; a cog- 
nitive system does something similar by sucking information 
from its environs—that is, taking in data and transforming 
them to meet its cognitive needs. In much the same way as 
any other open system, a cognitive structure needs to main- 
tain itself as an integrated and cohesive entity. In the physical 
world, the integrity of a system is achieved by making adap- 
tations that preserve and enhance the physical structure, 
thereby precluding the entropic dissipation of its ordered ele- 
ments. Similarly, a cognitive system achieves its integrity 
through a variety of preserving and enhancing adaptations 
that reduce the likelihood of events that may diminish the 
order and coherence of its knowledge base. 

Moreover, an open cognitive system is purposefully fo- 
cused, as is a physical system. Just as a physical system must 
be selective about its nutrition sources in order to find those 
suitable to meet its tissue needs, so, too, must a cognitive 
system be selective about information sources, choosing 
and processing particular raw inputs according to specific 
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cognitive goals. A cognitive system can no more process ran- 
dom input than a physical system can ingest random material. 
Hence, information (negative entropy) must be acquired se- 
lectively rather than randomly or diffusely; some sources of 
information will be heeded and others ignored or suppressed. 

Coherence may be optimized by adopting and maintaining 
a preferred and regular information source, thereby ensuring a 
consistent confirmatory bias in favor of a cognitive structure’s 
world view and organizational architecture. Conversely, a 
cognitive structure that is exposed to dissonant or contradic- 
tory sources or that heeds diverse or multitudinous sources ul- 
timately may be challenged successfully or may be exhausted 
beyond its ability to maintain coherence. In other words, bur- 
densome processing and discordant sources are likely to re- 
sult in increasing cognitive entropy. A more structured and 
coherent focus that strengthens and confirms prior sources of 
information becomes useful in ensuring optimal cognitive 
survivability. 


External Versus Internal Orientation Polarity: The 
Extraceptive and Intraceptive Attributes. In light of the 
preceding argument, we see two primary stimulative sources 
of information, that which originates external to the self 
and that which originates internally. Whether this polar cog- 
nitive orientation is termed external versus internal, extracep- 
tive versus intraceptive, or extraversing versus introversing, 
each polarity provides a replicable reservoir for cognitive 
information—a selectively narrowed wellspring of knowl- 
edge to which the person will continue to be exposed. 

A few lines paraphrasing Jung, the originator of the 
extraversing-introversing dimension, may be of value in 
highlighting core features of the externally oriented prefer- 
ence. Extraversion, from Jung’s view, was centered in an in- 
terest in the external object noted by a ready acceptance of 
external happenings, a desire to influence and be influenced 
by external events, a need to join in, and the capacity not only 
to endure the bustle and noise of every kind, but actually find 
them enjoyable (Jung, 197 1a). 

Similarly, Jung clearly states a view paralleling ours in 
what we have termed the internal orientation. To Jung, the 
introverted person is “not forthcoming”; he or she “retreats 
before the external object.” Such an individual is aloof from 
external happenings and does not join in. Self-communings 
are a pleasure and the introverted individual experiences his 
or her own world as a safe harbor, a “carefully tended and 
walked-in garden, closed to the public and hidden from pry- 
ing eyes.” The internally oriented person’s own company is 
best. One who is internally oriented feels at home in one’s 
own world, a place where changes are made only by oneself. 
Most significantly, the best work of such individuals is done 


with their own resources, on their own initiative, and in their 
own way (Jung, 1971b). 


Tangible Versus Intangible Disposition Polarity: The 
Realistic and Intuitive Attributes. Information, whether 
its source is internal or external to the self, can be classified 
in numerous ways. A core distinction can be drawn between 
information that is tangible versus that which is intangible. 
By tangible we mean identifiable by human sensory capaci- 
ties, well-defined, distinctive, recognizable, and knowable— 
referring to phenomena that are concrete, factual, material, 
realistic, or self-evident. In contrast, information that is 
termed intangible takes in phenomena that lack an intrinsi- 
cally distinctive order and structural clarity; they are inher- 
ently ambiguous, abstract, insubstantial, vague, mysterious, 
and obscure. Such phenomena usually can be fathomed only 
by means that are unknown, unconscious, and percipient, or 
by glimmerings into their diffuse and elusive nature that are 
materially tenuous or psychical in form. 

The readiness of some individuals to be receptive to infor- 
mation that is well-structured and tangible, and of others to 
receive information that is obscure and intangible, consti- 
tutes, in our view, a fundamental difference in cognitive style 
that is of appreciable personological significance. Although 
Jung’s language is only tangentially formulated in cognitive 
terms, close parallels can be seen between the polarity pre- 
sented here and that offered by Jung in his distinction between 
Sensing and Intuiting. As Jung (1933) wrote decades ago: 


Here we should speak of sensation when sense impressions are 
involved, and of intuition if we are dealing with a kind of per- 
ception which cannot be traced back directly to conscious sen- 
sory experience. Hence, I define sensation as perception via con- 
scious sensory functions, and intuition as perception via the 
unconscious. (pp. 538-539) 


Favoring tangible, structured, and well-defined sources of 
information that call upon one’s five senses will no doubt cor- 
relate with a wide range of associated behaviors, such as 
choosing actions of a pragmatic and realistic nature, prefer- 
ring events in the here and now, and attending to matters call- 
ing for facts and quantitative precision. 

Jung conceived what we would term the tangible disposi- 
tion as the fact-minded men in whom intuition is “driven 
into the background by actual facts.” In contrast, those prefer- 
ring the intangible, unstructured, and ambiguous world of 
information are likely to be inspired by possibilities, by chal- 
lenges, and potentials of an abstract, connotative, and symbolic 
character, as well as by matters that depend on mystery and 
speculation. In Jung’s words, “for these persons, actual reality 


counts only insofar as it harbors possibilities, regardless of the 
way things are in the actual present” (Jung, 1971b, p. 539). 


Transformational Processes 


The first two pairs of cognitive functions were grouped ac- 
cording to attributes that signify choices among the sources 
and styles of gathering information. These next two pairs of 
attribute polarities represent amplification preferences and 
transformational processes, referring to what is done to infor- 
mation after it has been received. Cognitive science has artic- 
ulated a number of concepts related to the registering, 
encoding, and organizing of life experiences. These concepts 
pertain to various questions, such as Through what cognitive 
mode will information be received and amplified—intellective 
or affective? and How shall information be organized; will it 
be assimilated into preformed memory systems or will it be 
recast through imagination into novel schemas? Although 
individuals may be positioned on several other continua or 
polarities—for example, convergent versus divergent, serial 
versus hierarchical, primary versus secondary, verbal versus 
visual—it is the author’s view that the most fruitful cognitive 
distinctions relevant to personality are the pairs selected in 
this and the following section. 


Ideational Versus Emotional Preference Polarity: The 
Intellective and Affective Attributes. Stated simply, there 
are essentially two pathways through which experiences pass 
once recorded by our consciousness or by our senses, if they 
are of sufficient magnitude to activate an encoded response. 
The first pathway accentuates information that is conceptual 
and logical, eliciting a reasoned judgment that signifies in an 
articulate and organized way that the registered experience 
makes sense—that is, it is rationally consistent and coherent. 
The second pathway resonates an emotional response, a sub- 
jective feeling reaction, signaling in a somewhat diffuse and 
global way that the registered event was experienced either as 
affectively neutral, clearly positive, or distinctly negative. 

The ideational pole indicates a preference and elaboration 
of experience in light of reason and logic. Although life 
events may derive from internal or external sources and may 
be of a tangible or intangible nature, the interpretive and 
evaluative process is inclined toward and augments the ob- 
jective and impersonal, as events are amplified by means of 
critical reason and intensified by the application of rational 
and judicious thought. By diminishing affective engage- 
ments—reducing the unruly emotional input of others or the 
upsetting effects of one’s own affective state—the preference 
is to sustain and strengthen a high degree of cognitive logic 
and cohesion. Objective analysis and affective detachment 
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protect against unwanted incursions upon intellectual ratio- 
nality, but often at the price of promoting processes that tend 
to be rigid, overcontrolled, and unyielding. 

In contrast, experiences processed and amplified emotion- 
ally activate subjective states, such as liking versus disliking, 
feeling good versus feeling bad, comfort versus discomfort, 
attracted versus repelled, valuing versus devaluing, and so 
on. Through empathic resonance, the route of enhanced af- 
fectivity inclines the individual to record not so much what 
other people think but rather how they feel. The individual 
who inclines toward the affective attribute uses feeling vibra- 
tions to learn more from the melodic tone that words convey 
than from their content or logic. The usual modality for those 
who exhibit an affective bent is that of a subjective reality, a 
series of more-or-less gut reactions composed of either global 
or differentiated positive or negative moods. For the most 
part, the affective amplification style indicates individuals 
who evince modest introspective analyses, who show an 
open and direct empathic response to others, and who have a 
subconscious susceptibility to the emotional facets of experi- 
ence in as pure a manner as possible. 


Integrating Versus Innovating Bias Polarity: The 
Assimilative and Imaginative Attributes. The second 
cognitive transformational polarity addresses the question of 
whether new information is shaped to fit preformed memory 
schemas (integrated within preexisting cognitive systems), or 
is organized through the imagination to be cast into innova- 
tive and creative forms. Evolutionary theory suggests that the 
best course may be to reinforce (cognitive) systems that have 
proved stable and useful. On the other hand, progress will not 
be made unless promising new possibilities are explored. A 
beneficial tension in evolution clearly exists between conser- 
vation and change, between that of adhering to the habitual 
and that of unleashing the creative. These two contrasting 
cognitive biases demonstrate the two options—integrating 
experiences into already established systems versus explor- 
ing innovative ways to structure them. 

Assimilators are akin in certain features to persons with 
well-structured memory systems to which they routinely at- 
tach new cognitive experiences. Disposed to operate within 
established perspectives, assimilators integrate new informa- 
tion to fit previous points of view, exhibiting thereby a high 
degree of dependability and consistency, if not rigidity, in 
their functioning. Typically, such people are predictable, con- 
ventional, orderly, systematic, decisive, methodical, exact- 
ing, formal, disciplined, conscientious, faithful, loyal, and 
devoted. Hence, in evolutionary terms, the integrating polar- 
ity leads to continuity and tradition, or to the maintenance of 
existing levels of cognitive entropy; this cognitive style 
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promotes an architectural cohesion that remains unchal- 
lenged by variations that could be risky (i.e., potentially di- 
minish established levels of order). 

In contrast, those functioning at the innovating pole are 
characterized by an openness to forming new and imagina- 
tive cognitive constructions of a more-or-less impromptu 
character. They are inclined to search for and enjoy creative 
ideas and solutions, to find novel ways to order information 
and to accumulate negative entropy, so to speak, by stepping 
outside of what is known and given in order to establish a 
new and potentially higher level of cognitive organization. 
Innovators stretch beyond confirmed perspectives, seek to 
broaden interpretations of experience, and are not concerned 
with demonstrating their reliability. The imaginative attribute 
is typically associated with being open-minded, spontaneous, 
extemporaneous, informal, adaptable, flexible, resilient, im- 
pressionable, creative, inventive, and resourceful. 

It is to those who combine these latter two persuasions 
that the present chapter is heartily addressed. 
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During much of its past, psychology represented a culturally 
grounded enterprise that took into account the constitutive 
role of cultural meanings and practices in human develop- 
ment. Yet, as recent historical accounts make clear (Jahoda, 
1993), this attention to culture was muted during the twentieth 
century, with psychology dominated by an idealized physical- 
science model of explanation. This has given rise to the 
enigma that psychologists find it “difficult to keep culture in 
mind,” noted by Cole (1996): 


On the one hand, it is generally agreed that the need and ability to 
live in the human medium of culture is one of the central charac- 
teristics of human beings. On the other hand, it is difficult for many 
academic psychologists to assign culture more than a secondary, 
often superficial role in the constitution of our mental life. (p. 1) 


From this type of perspective, which dominates the field, cul- 
ture is seen as at most affecting the display of individual psy- 
chological processes, but not as impacting qualitatively on 
their form. 

However, although culture thus remains in a peripheral 
role in the contemporary discipline, recent years have seen a 
reemergence of interest in cultural approaches and an in- 
creased recognition of their importance to psychological the- 
ory. As reflected in the interdisciplinary perspective of 
cultural psychology (e.g., Cole, 1990; Greenfield, 1997; J. G. 
Miller, 1997; Shweder, 1990), culture and psychology are 
coming to be understood as mutually constitutive processes. It 
is recognized that human development occurs in historically 
grounded social environments that are structured by cultural 
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meanings and practices. Cultural meanings and practices are 
themselves understood to be dependent on the subjectivity of 
communities of intentional agents. By affecting individuals’ 
understandings and intentions, cultural meanings and prac- 
tices, in turn, are recognized to have a qualitative impact on 
the development of psychological phenomena and to be inte- 
gral to the formulation of basic psychological theory. 

The goal of the present chapter is to highlight some of the 
insights for understanding personality and social psychology 
that emerge from a consideration of the cultural grounding of 
psychological processes. The first section of the chapter con- 
siders factors that have contributed to the downplaying of 
culture in mainstream social psychology and the assumptions 
that guided some of the earliest research in the traditions of 
cross-cultural psychology. In the second section, considera- 
tion is given to key conceptual developments underlying cul- 
tural psychology, recent empirical findings that illustrate the 
existence of cultural variation in basic social psychological 
processes, and challenges for future theory and research. In 
conclusion, consideration is given to the multiple contribu- 
tions of a cultural perspective in psychology. 


APPROACHES TO CULTURE IN MAINSTREAM 
SOCIAL PSYCHOLOGY AND IN EARLY 
CROSS-CULTURAL PSYCHOLOGY 


The present section provides an overview of shifts in the role 
accorded to culture in psychological theory over time, and it 
outlines some of the changing conceptual understandings and 
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disciplinary practices that are affecting these shifts. The first 
section considers factors that are contributing to the tendency 
to assign cultural considerations a relatively peripheral role 
both in social psychology and more generally in the larger 
discipline. The second section provides an overview of some 
of the earliest traditions of cultural research in social psy- 
chology, highlighting respects in which this research, al- 
though groundbreaking in many respects, did not seriously 
challenge this tendency to downplay the importance of cul- 
ture in psychology. Finally, attention turns to the core as- 
sumptions of cultural psychology, assumptions that highlight 
the need to accord culture a more integral role in basic psy- 
chological theory. 


Downplaying of Culture in Mainstream 
Social Psychology 


Signs of the peripheral theoretical role accorded to cultural 
considerations in social psychology may be seen in its being 
downplayed in major social psychological publications. Text- 
books typically either leave the construct of culture theo- 
retically undefined, treat it as the same as the objective 
environment or social ecology, or approach it in an eclectic 
way that lacks conceptual clarity. Likewise, basic theory 
tends to be presented without any reference to cultural con- 
siderations. Culture is treated merely as a factor that influ- 
ences the universality of certain psychological effects but not 
as a process that must be taken into account to explain the 
form of basic psychological phenomena. One example of 
such a stance can be found in Higgins and Kruglanski’s 
(1996) recent handbook on basic principles of social psy- 
chology: The only citations for culture in the index—with 
only one exception—refer to pages within the single chapter 
on cultural psychology by Markus, Kitayama, and Heiman 
(1996), rather than to any of the other 27 chapters of the vol- 
ume. In the following discussion, we argue that this down- 
playing of culture in social psychology reflects to a great 
degree the tendency to conceptualize situations in culture- 
free terms, the embrace of an idealized natural-science model 
of explanation, and the default assumption of cultural homo- 
geneity that dominates the field. 


Culture-Free Approach to Situations 


A key contribution of social psychology—if not its signa- 
ture explanatory feature—is its recognition of the power of 
situations to impact behavior. Such a stance is reflected, for 
example, in a series of classic studies; salient examples in- 
clude the Milgram conformity experiment, which demon- 
strated that to conform with the orders of an experimenter, 


individuals were willing to inflict a harmful electric shock on 
a learner (Milgram, 1963), as well as the prison experiment of 
Zimbardo and his colleagues (Haney, Banks, & Zimbardo, 
1973), which demonstrated that individuals who had been 
thrust into the role relationships of guards and prisoners in a 
simulated prison behaved in ways that reflected these posi- 
tions, with the guards behaving abusively and the prisoners 
becoming passive. It also may be seen in recent lines of in- 
quiry on such topics as individuals’ limited conscious access 
to their cognitive processes, priming effects, and the mere ex- 
posure effect (Bargh, 1996; Bornstein, Kale, & Cornell, 
1990; Zimbardo, Banks, Haney, & Jaffe, 1973). Social psy- 
chological work of this type has shown that contexts affect 
behavior in ways that do not depend on conscious mediation 
and that may even violate individuals’ conscious expectations 
and motivational inclinations. 

Supplementing this focus on the power of situations to af- 
fect behavior, it has also been documented that individual dif- 
ferences influence the meaning accorded to situations. This 
attention to individual differences is evident not only in work 
on personality processes but also in the attention given to 
cognitive and motivational schemas as sources of individual 
variability in behavior. Individual difference dimensions, 
however, typically are accorded a secondary role to situa- 
tional influences within social psychological theory. They are 
believed to affect the display of certain basic psychological 
dimensions, but they are not often implicated in normative 
models of psychological phenomena. To give a representa- 
tive example of such a stance, the theory of communal and 
exchange relationships has been forwarded to distinguish 
qualitatively between relationships that are based on need 
versus those based on exchange considerations (Mills & 
Clark, 1982). In this model, individual differences are in- 
voked only in a descriptive sense (i.e., to distinguish between 
persons who are more or less likely to adopt each type of ori- 
entation) and not in a theoretical sense (that is, to identify dis- 
tinctive approaches to relationships beyond those specified in 
the original conceptual model). 

The crucial point is that the approach to situations that 
dominates social psychological inquiry treats contexts as 
presenting one most veridical structure that can be known 
through inductive or deductive information processing. No 
consideration is given to the possibility that culture is neces- 
sarily implicated in the definition of situations or that cul- 
tural presuppositions constitute prerequisites of what is 
considered objective knowledge. It is assumed that variability 
in judgment arises from differences in the information avail- 
able to individuals or from differences in their information- 
processing abilities, resulting in certain judgments’ being 
more or less cognitively adequate or veridical than others 
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(Nisbett & Ross, 1980). Evidence that individuals from differ- 
ent cultural backgrounds maintain contrasting systems of 
belief, value, or meaning—and that they interpret situations in 
contrasting ways—tends to be assimilated to an individual- 
difference dimension. It is viewed as implying that individual 
differences in attitudes, understandings, or available informa- 
tion may relate to cultural group membership, but not as 
implying that there is a need to give any independent weight to 
cultural meanings and practices per se in explanation. 

In maintaining the present realist approach to situations 
and in adopting explanatory frameworks focused on factors in 
the situation and in the person, cultural considerations are 
downplayed in theoretical importance. It is assumed that cul- 
tural information may substitute for or shortcut individual in- 
formation processing: The individual comes to learn about 
the world indirectly through acquiring the knowledge dissem- 
inated in the culture. As such, culture is viewed as providing 
information redundant with that which individuals could ob- 
tain by themselves through direct cognitive processing. Wells 
(1981), for example, maintains that enculturation processes 
are nonessential to individual knowledge acquisition: 


It is difficult for anyone who has raised a child to deny the per- 
vasive influence of socialized processing that surely surfaces as 
causal schemata originate through secondary sources such as 
parents . . . Even though socialized processing may be an impor- 
tant determinant of knowledge about causal forces at one level, it 
nevertheless begs the question. How is it that the parents knew 
an answer? The issue is circular. That is precisely the reason that 
one must consider a more basic factor-namely original process- 
ing. (p. 313) 


From the present type of perspective, cultural knowledge is 
seen as necessary neither to account for the nature of individ- 
ual knowledge nor to evaluate its adequacy. 


Natural Science Ideals of Explanation 


The tendency to downplay the importance of culture in social 
psychological theory also derives from the field’s embrace 
of an idealized physical-science model of explanation. Al- 
though social psychology makes use of multiple normative 
models of scientific inquiry, it has typically treated physical 
science models of scientific inquiry as the ideal approach. 
This has affected both the goals and methods of inquiry in 
ways that have tended to marginalize cultural approaches. 

In terms of explanatory goals, the foremost aim of psycho- 
logical explanation has been to identify universal laws of 
behavior. Adopting the criteria of parsimony and of predic- 
tive power as the hallmarks of a successful explanation, 


psychological inquiry has been conceptualized as involving 
the identification of deep structural explanatory mechanisms 
that (it is assumed) underlie overt behavior. Higgins and 
Kruglanski (1996) outline this vision for social psychological 
inquiry: 


A discovery of lawful principles governing a realm of phenom- 
ena is a fundamental objective of scientific research ... A useful 
scientific analysis needs to probe beneath the surface. In other 
words, it needs to get away from the ‘phenotypic’ manifestations 
and strive to unearth the ‘genotypes’ that may lurk beneath. . . . 
We believe in the scientific pursuit of the nonobvious. But less in 
the sense of uncovering new and surprising phenomena than 
in the sense of probing beneath surface similarities and differ- 
ences to discover deep underlying structures. (p. vii) 


From this perspective, the assumption is made that funda- 
mental psychological processes are timeless, ahistorical, and 
culturally invariant, with the principles of explanation in the 
social sciences no different from those in the natural or phys- 
ical sciences. 

From the present physical-science view of explanation, 
cultural considerations tend to be regarded as noise; they are 
consequently held constant in order to focus on identifying 
underlying processes. Malpass (1988) articulates this type of 
position: 


Cultural differences are trivial because they are at the wrong 
level of abstraction, and stand as ‘medium’ rather than ‘thing’ in 
relation to the objects of study. The readily observable differ- 
ences among cultural groups are probably superficial, and repre- 
sent little if any differences at the level of psychological 
processes. (p. 31) 


According to this perspective, an explanation that identifies a 
process as dependent on culturally specific assumptions is re- 
garded as deficient. To discover that a phenomenon is cultur- 
ally bound is to suggest that the phenomenon has not as yet 
been fully understood and that it is not yet possible to formu- 
late a universal explanatory theory that achieves the desired 
goals of being both parsimonious and highly general. 
Another consequence of the present physical-science 
model of explanation is that social psychology has tended to 
privilege laboratory-based methods of inquiry and to be dis- 
missive of what is perceived to be the inherent lack of 
methodological control of cultural research. Skepticism sur- 
rounds the issue of whether sufficient comparability can be 
achieved in assessments made in different cultural contexts to 
permit valid cross-cultural comparisons. Equally serious con- 
cerns are raised that methodological weaknesses are inherent 
in the qualitative methods that are frequently involved in 
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assessment of cultural meanings and practices. In particular, 
because such measures are at times based on analyses under- 
taken by single ethnographers or similar methods, measures 
used in cultural assessment are seen as characterized by lim- 
ited reliability and validity, as well as by heavy reliance on 
interpretive techniques. 

It is notable that the adoption within social psychology of 
a physical-science ideal of explanation also promotes disci- 
plinary insularity. Although there is considerable openness to 
the integration of biologically based conceptual models and 
methodologies—a trend seen in the growing interest in 
neuroscience—there is little or no interest in integrating the 
theoretical insights and empirical findings from other social 
science fields, such as anthropology. Rather, the body of 
knowledge developed within anthropology becomes difficult 
for social psychologists to assimilate. Thus, for example, 
psychologists typically treat the findings of anthropological 
research as merely descriptive or anecdotal, with little atten- 
tion even given to such findings as a source of hypotheses 
that might be subject to further testing through controlled so- 
cial psychological procedures. A situation is then created in 
which the findings of cultural variability in human behavior 
(which have been widely documented within anthropology) 
as well as anthropological tools of interpretive methodologi- 
cal inquiry tend to be given little or no attention in social psy- 
chological inquiry. 


Default Assumption of Cultural Heterogeneity 


Finally, the downplaying of the importance of cultural con- 
siderations in social psychology also stems from the tendency 
to assume a universalistic cultural context in recruitment of 
research participants and in formulation of research ques- 
tions. This type of stance has led to skewed population sam- 
pling in research. As critics (Reid, 1994) have charged, the 
field has proceeded as though the cultural context for human 
development is homogeneous; consequently, research has 
adopted stances that treat middle-class European-American 
research populations as the default or unmarked subject of 
research: 


Culture ...has been assumed to be homogenous, that is, based 
on a standard set of values and expectations primarily held by 
White and middle-class populations. . . . For example, in devel- 
opmental psychology, children means White children (McLoyd, 
1990); in psychology of women, women generally refers to 
White women (Reid, 1988). When we mean other than White, it 
is specified. (p. 525) 


In this regard, slightly over a decade ago, it was observed that 
fewer than 10% of all hypothesis testing research undertaken 


in social psychology involved samples drawn from two or 
more cultures (Pepitone & Triandis, 1987). Likewise, a re- 
view conducted of more than 14,000 empirical articles in 
psychology published between 1970 and 1989 yielded fewer 
than 4% centering on African Americans (Graham, 1992). 

However, it is not only these skewed sampling practices but 
also the resulting skewed knowledge base brought to bear in 
inquiry that contributes to the downplaying of the importance 
of cultural considerations. Commonly, research hypotheses 
are based on investigators’ translations of observations from 
their own experiences into testable research hypotheses. In 
doing this, however, researchers from non-middle-class 
European-American backgrounds frequently find themselves 
having to suppress intuitions or concerns that arise from their 
own cultural experiences. As reflected in the following ac- 
count by a leading indigenous Chinese psychologist (Yang, 
1997), the present type of stance may give rise to a certain 
sense of alienation among individuals who do not share the so- 
called mainstream cultural assumptions that presently domi- 
nate the field: 


I found the reason why doing Westernized psychological re- 
search with Chinese subjects was no longer satisfying or reward- 
ing to me. When an American psychologist, for example, was 
engaged in research, he or she could spontaneously let his or her 
American cultural and philosophical orientations and ways of 
thinking be freely and effectively reflected in choosing a re- 
search question, defining a concept, constructing a theory and 
designing a method. On the other hand, when a Chinese psy- 
chologist in Taiwan was conducting research, his or her strong 
training by overlearning the knowledge and methodology of 
American psychology tended to prevent his or her Chinese val- 
ues, ideas, concepts and ways of thinking from being adequately 
reflected in the successive stages of the research process. (p. 65) 


It has been suggested, in this regard, that to broaden psycho- 
logical inquiry to be sensitive to aspects of self emphasized in 
Chinese culture, greater attention would need to be paid to 
such presently understudied concerns as filial piety, impres- 
sion management, relationship harmony, and protection of 
face (Hsu, 1963, 1985; Yang, 1988; Yang & Ho, 1988). Tak- 
ing issues of this type into account, researchers of moral de- 
velopment, for example, have challenged the Kohlbergian 
claim that a concern with human rights fully captures the end 
point of moral development (Kohlberg, 1969, 1971); such re- 
searchers have uncovered evidence to suggest that within 
Chinese cultural populations, the end point of moral develop- 
ment places greater emphasis on Ch’ing (human affection or 
sentiment) as well as on the Confucian value of jen (love, 
human-heartedness, benevolence, and sympathy; Ma, 1988, 
1989). 
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As a consequence of its tendency to privilege considera- 
tions emphasized in European-American cultural contexts, 
psychology in many cases has focused on research concerns 
that have a somewhat parochial character, as Moscovici 
(1972) has argued in appraising the contributions of social 


psychology: 


. . . The real advance made by American social psychology 
was ...in the fact that it took for its theme of research and for 
the content of its theories the issues of its own society. Its merit 
was as much in its techniques as in translating the problems of 
American society into sociopsychological terms and in making 
them an object of scientific inquiry. (p. 19) 


In proceeding with a set of concepts that are based on a rela- 
tively narrow set of cultural experiences, psychological re- 
search then has tended to formulate theories and research 
questions that lack adequate cultural inclusiveness and instead 
are based on the experiences of highly select populations. 


Summary 


Despite its concern with social aspects of experience and 
with units of analysis, such as groups, that are larger than in- 
dividuals, social psychological inquiry has tended to down- 
play cultural factors. This downplaying, as we have seen, 
reflects in part the field’s tendency to give weight both to sit- 
uational and individual difference considerations, while ac- 
cording no independent explanatory force to cultural factors. 
Equally, it reflects the field’s embrace of natural-science 
models of explanation, which emphasize generality as the 
hallmark of a successful explanation and controlled experi- 
mentation as the most adequate approach to scientific inquiry. 
Finally, in both its sampling practices and in its consideration 
of research questions, social psychology has privileged a 
middle-class European-American outlook that gives only 
limited attention to the perspectives and concerns of diverse 
cultural and subcultural populations. 


Early Research in Cross-Cultural Psychology 


Although cultural considerations have tended to be accorded 
little importance in social psychological theory, there exists a 
long-standing tradition of research in cross-cultural psychol- 
ogy that has consistently focused attention on them. The 
scope of work in cross-cultural psychology is reflected in the 
vast body of empirical research that has been conducted. 
Empirical work from this perspective is extensive enough 
to fill the six-volume first edition of the Handbook of Cross- 
Cultural Psychology (Triandis & Lambert, 1980), as well as 


numerous textbooks and review chapters (e.g., Berry, Poor- 
tinga, Segall, & Dasen, 1992; Brislin, 1983). 

Research in cross-cultural psychology shares many of the 
conceptual presuppositions of mainstream psychology— 
which explains, at least in part, why it has not fundamentally 
posed a challenge to the mainstream discipline (see discussion 
in Shweder, 1990; J. G. Miller, 2001a). These assumptions in- 
volve a view of culture as an independent variable affecting 
psychological processes understood as a dependent variable. 
From such a perspective, culture is seen as affecting the dis- 
play or level of development of psychological processes, but 
not their basic form—a stance similar to the assumption in 
mainstream social psychology that culture has no impact on 
fundamental psychological phenomena. Research in cross- 
cultural psychology also assumes an adaptive approach to cul- 
ture that is consonant with the view of the environment 
emphasized in mainstream psychology. Naturally occurring 
ecological environments are viewed as presenting objective 
affordances and constraints to which both individual behavior 
and cultural forms are adapted. 

A major thrust of work in cross-cultural psychology has 
been to test the universality of psychological theories under 
conditions in which there is greater environmental variation 
than is present in the cultural context in which the theories 
were originally formulated. Brief consideration of early 
cross-cultural research in the traditions of culture and person- 
ality, culture and cognition, and individualism-collectivism 
highlights both the groundbreaking nature of this work as 
well as the limited extent to which it challenges the core the- 
oretical presuppositions of the mainstream discipline. 


Culture and Personality 


The research tradition of culture and personality constituted 
an interdisciplinary perspective that generated great interest 
and inspired extensive research throughout the middle years 
of the twentieth century (e.g., LeVine, 1973; Shweder, 1979a, 
1979b; Wallace, 1961; J. W. Whiting & Child, 1953; B. B. 
Whiting & Whiting, 1975). Although many of the classic as- 
sumptions of this perspective were subject to challenge, and 
although interest in this viewpoint diminished after the 
1980s, work in culture and personality has served as an im- 
portant foundation for later work on culture and the develop- 
ment of self. 

Some of the earliest work in the tradition of culture and 
personality adopted a critical case methodology to test the 
generality of psychological theories. For example, in a clas- 
sic example of this type of approach, Malinowski tested the 
universality of the Oedipus complex against case materials 
from the Trobriand Islands (1959). In contrast to the Freudian 
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assumption that the father is both the disciplinarian and the 
mother’s lover, in this society, the mother’s brother, rather 
than the father, assumed the role of disciplinarian. Based on 
his analysis, Malinowski concluded that there was no evi- 
dence for the occurrence of the Oedipus complex under these 
societal conditions. Likewise, in another early example of 
this type of approach, Margaret Mead provided evidence that 
adolescence does not invariably involve the patterns of psy- 
chosocial conflict that are observed in Western populations 
and that were once assumed in psychological theory to be 
universal (1928, 1939). 

Later work in culture and personality developed models 
that portrayed culture as an amalgam of parts that conformed 
to the dominant pattern of individual personality possessed 
by members of the culture. Such an assumption may be seen 
reflected, for example, in the stance adopted by Benedict as 
she portrayed culture and personality as highly integrated 
entities: “A culture, like an individual, is a more or less con- 
sistent pattern of thought and action” (1932, p. 42). Applying 
this model to an analysis of Japan, Benedict (1946) traced 
broad consistencies that characterized Japanese values, social 
institutions, national policy, and interpersonal relations. 
Similar types of assumptions characterized the national- 
character studies that were conducted—research that fre- 
quently involved studying culture at a distance by relying on 
sources such as literature, art, and history (Adorno, Frenkel- 
Brunswick, Levinson, & Sanford, 1950; Gorer, 1955; Gorer 
& Rickman, 1962). For example, in examining why Nazism 
was embraced in Germany, researchers identified an assumed 
“authoritarian” personality that they maintained was charac- 
teristic of the German psyche and that they saw as contribut- 
ing to the emphasis on obedience to authority observed in 
Germany under Nazi rule (Fromm, 1941). 

Still a third thrust of work in culture and personality for- 
warded a personality-integration-of-culture model (Kardiner, 
1945; B. B. Whiting & Whiting, 1975). From this viewpoint, 
individual personality structure was regarded as adapted to 
cultural meanings and practices that in turn were regarded as 
adapted to the demands of particular ecological settings. It 
was assumed from this perspective that individuals come 
over time to be socialized to behave in ways that fit what is re- 
garded as the dominant psychological orientation of adults in 
the culture. As reflected in research that made use of the 
ethnographic reports compiled in the Human Relations Area 
Files (HRAF; J. W. Whiting & Child, 1953), studies empiri- 
cally tested assumed causal relationships between features of 
the natural ecology, modes of social organization, child so- 
cialization, and expressive aspects of culture, such as reli- 
gious beliefs. In a groundbreaking program of research that 
stands as one of the most influential contributions of this 


school of thought, the Six Culture study tested these relations 
in an investigation that involved conducting behavioral ob- 
servations of parenting and child behavior in everyday con- 
texts in a worldwide sampling of cultures (J. W. Whiting & 
Child, 1975). As one example of the many findings from the 
Six Culture project, it was demonstrated that cultures with 
rich natural ecologies give rise to societies with complex 
social structures, which, in turn, lead to the development of 
egoistic personality dispositions among members of the cul- 
tures and to cultural meanings and practices that emphasize 
competitiveness. 

In terms of criticisms, research in the tradition of culture 
and personality was subject to challenge in terms of the theo- 
ries of personality and of culture that it embodied (Shweder, 
1979a, 1979b). Concerns were raised regarding the determin- 
ism of treating culture merely as a concomitant of individual 
personality, as well as regarding what was viewed as its 
overly socialized conception of the person—a conception 
that treated the individual as merely passively conforming to 
prevailing norms. Additionally, it was argued that work in 
culture and personality overestimated the thematic nature of 
cultural forms, as well as failed to take into account the lim- 
ited longitudinal stability and cross-situational consistency of 
personality. For example, evidence suggested that what had 
been interpreted as a difference in personality between 
cultural populations in fact could be explained in normative 
terms—as individuals responding to the behavioral expecta- 
tions of different everyday cultural settings (Shweder, 1975). 
Thus, the observation was later made that one of the most 
important influences of culture on individual development 
is that it provides contrasting socialization experiences rather 
than affects individual personalities. For example, the de- 
gree to which children in different cultures emphasize com- 
petitive versus cooperative behavior appears closely linked to 
whether children spend their days in the competitive atmos- 
phere of formal school settings versus the more prosocial 
atmosphere of sibling caregiving activities (B. B. Whiting & 
Edwards, 1988). 

In terms of enduring contributions, work on culture and 
personality succeeded in highlighting the importance of un- 
derstanding the mutual influence of ecological, psychologi- 
cal, and cultural processes. Methodologically rich, research 
in this tradition not only demonstrated the importance of in- 
tegrating both ethnographic and quantitative approaches in 
psychological investigation, but also called attention to the 
value of observing behavior in naturalistic contexts and of 
capturing the dynamics of everyday cultural activities and 
practices (e.g., Ford, 1967; Honigmann, 1954; LeVine, 1973; 
Spindler, 1980; Spiro, 1958, 1965, 1982; Wallace, 1961; J. W. 
Whiting & Child, 1953; B. B. Whiting & Whiting, 1975). 
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However, although the study of culture and personality left 
a rich and highly influential legacy with many investigators 
associated with this tradition at the forefront of contemporary 
work in cultural psychology, work in culture and personality 
did not directly move into the issues of culture and basic psy- 
chological theory that are being addressed in contemporary 
research in cultural psychology. Rather, most work in culture 
and personality assumed psychological universalism or what 
theorists have characterized as the “postulate of psychic 
unity” (e.g., Shweder, 1990). Personality theories were 
treated as having universal validity and thus as applicable in 
unchanged form in diverse cultural populations. Little consid- 
eration was given to respects in which these theories (e.g., 
psychoanalysis) might themselves be culturally bound. 


Individualism-Collectivism 


Work on individualism-collectivism represents one of the 
most influential and long-standing traditions of research in 
cross-cultural psychology. Associated particularly with the 
early theoretical work of investigators such as Hofstede and 
Triandis (Hofstede, 1980; Triandis, 1972, 1980, 1988), this 
perspective has been applied to explain variation in a wide 
range of behavioral domains on a worldwide scale. Thus, 
the constructs of individualism-collectivism have been in- 
voked in explaining such diverse phenomena as values 
(Hofstede, 1980; S. H. Schwartz, 1994), cognitive differen- 
tiation (Witkin & Berry, 1975), and modernity (Inkeles, 
1974). Embracing the explanatory goals of predictive power 
and parsimony as well as the quantitative methodological 
approaches of the mainstream discipline, the primary focus 
of work on individualism-collectivism has been to forward a 
universal framework that predicts the nature of both cultural 
forms and individual psychological experience. 
Individualism and collectivism are conceptualized as syn- 
dromes of beliefs and attitudes that distinguish different cul- 
tural populations. Collectivism is seen as encompassing such 
core ideas as an emphasis on the views, needs, and goals of 
one’s in-group as having priority over one’s own personal 
views, needs, and goals, and a readiness to cooperate with 
in-group members. In contrast, individualism is seen as en- 
tailing such core ideas as that of individuals as ends in them- 
selves who should realize their own selves and cultivate their 
own judgment. In collectivist cultures, in-groups are assumed 
to influence a broad range of behaviors, with individuals ex- 
periencing pressure to conform to in-group norms or leave 
the groups. In contrast, in individualistic cultures, in-groups 
are seen as providing only limited norms, with individuals 
readily able to enter and exit in-groups: The relationship of 
individuals with their in-groups is of limited intensity. 


Further distinctions are made in this broad dichotomy to 
capture dimensions of variation between different individual- 
istic and collectivist cultures (e.g., Triandis, 1989, 1996). 
Thus, for example, cultures are seen as differing in terms of 
which in-groups are important (e.g., family vs. country), the 
particular collectivist values emphasized (e.g., harmony vs. 
dignity), and the ease with which individuals can join in- 
groups and deviate from their norms (e.g., tightness vs. loose- 
ness of norms; Triandis, 1988). In addition to the global 
constructs of individualism-collectivism, additional con- 
structs are invoked to explain individual differences. Thus, 
the constructs of idiocentrism and allocentrism have been 
proposed as the psychological manifestations at the level of 
individual self-definitions, beliefs, and attitudes of individu- 
alism and collectivism. It is assumed that individuals in all 
cultures maintain both idiocentric and allocentric aspects of 
their selves. Cultural differences at the psychological level, 
then, are seen as reflecting the differential sampling of idio- 
centric as compared with allocentric features of self in di- 
verse sociocultural contexts (Triandis, 1990, 1996). 

In terms of explaining the cultural syndromes of individu- 
alism and collectivism, research has shown that factors such 
as affluence, exposure to mass media, modernization, mobil- 
ity, movement from rural to urban settings, and industrializa- 
tion are linked to societal shifts from collectivism toward 
individualism. In turn, a wide range of psychological conse- 
quences are seen as linked to such shifts, with individualism, 
as compared with collectivism, associated with such out- 
comes as higher self-esteem and subjective well-being (e.g., 
Diener & Diener, 1995; Diener, Diener, & Diener, 1995), 
values such as being curious and broad-minded as compared 
with emphasizing family security and respect for tradition 
(S. H. Schwartz, 1994), as well as direct and frank communi- 
cation styles, as compared with relatively indirect communi- 
cation styles that emphasize context and concern for the 
feelings of the other (Gudykunst, Yoon, & Nishida, 1987; 
Kim, Sharkey, & Singelis, 1994; Triandis, 1994). 

The prototypical research conducted by investigators in the 
tradition of individualism-collectivism involves multiculture 
survey or questionnaire research. This work is concerned with 
developing ecological models of culture that can be invoked 
to explain the distribution of individualism-collectivism and 
of related psychological characteristics on a worldwide scale 
(for review, e.g., see Berry et al., 1992). 

In recent years, researchers have shown increased interest 
in the constructs of individualism and collectivism as a con- 
sequence of these constructs being linked to the distinction 
drawn by Markus and Kitayama (1991) between independent 
versus interdependent modes of self-construal. In introduc- 
ing the contrast between independent versus interdependent 
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modes of self-construal, Markus and Kitayama did not adopt 
all of the assumptions of the individualism-collectivism 
framework as developed by early cross-cultural psycholo- 
gists. In contrast to such theorists, for example, they were 
concerned with the cultural psychological agenda of identi- 
fying insights for basic psychological theory of cultural 
variation (e.g., identifying new culturally based forms of mo- 
tivation), rather than with the cross-cultural agenda of apply- 
ing existing psychological theories in diverse cultural 
contexts (e.g., identifying cultural variation in the emphasis 
placed on internal vs. external locus of control, as specified 
by Rotter’s framework). They tended to eschew the use of 
scale measures of individualism-collectivism; they also did 
not draw some of the global contrasts made within much 
work within this framework, such as devaluation of the self in 
collectivism or of relationships in individualism (see discus- 
sion in Kitayama, in press; J. G. Miller, 2002). However, in 
part as a reflection of the interest in the distinction between 
independent versus interdependent self-construals introduced 
by Markus and Kitayama (1991), the number of investigators 
concerned with individualism and collectivism has grown in 
recent years, with many investigators drawing on this frame- 
work to further the cultural psychological agenda of broaden- 
ing basic psychological theory (e.g., Greenfield & Cocking, 
1994; Greenfield & Suzuki, 1998), and other investigators in 
social psychology drawing on the framework to further the 
original agenda of theorists such as Triandis to develop a uni- 
versal, ecologically based framework to explain psychologi- 
cal variation on a worldwide scale (e.g., Oyserman, Coon, & 
Kemmelmeier, 2002). 

In terms of criticisms, the tradition of cross-cultural re- 
search on individualism is limited in its emphasis on testing 
the generality of existing psychological theories in diverse 
cultural contexts, and in its inattention to examining the de- 
gree to which such theories themselves may be culturally 
bound and take somewhat contrasting forms in different cul- 
tural contexts. This stance represents perhaps the most central 
reason that mainstream psychologists have tended to view the 
findings of research on individualism-collectivism as primar- 
ily descriptive in nature rather than to view them as contribut- 
ing to basic psychological theory (e.g., Shweder, 1990). The 
framework of individualism-collectivism has also been sub- 
ject to criticism for its global view of culture: Much work in 
this tradition fails to account for subtleties in cultural mean- 
ings and practices, and it has also been criticized for the some- 
what stereotypical nature of its portrayal of these two cultural 
systems (e.g., Dien, 1999). Thus, for example, as numerous 
theorists have noted (e.g., Markus & Kitayama, 1991; J. G. 
Miller, 1994, 2002; Rothbaum, Pott, Azuma, Miyake, & 


Weisz, 2000), much work on individualism-collectivism has 
failed to recognize that concerns with self have importance 
in collectivist cultures rather than only in individualistic 
cultures—although they may take somewhat contrasting 
forms in the two cultural contexts, just as concerns with rela- 
tionships have importance but may take different forms in the 
two cultural contexts. Finally, methodological criticisms have 
been directed at the widespread use of attitudinal scale mea- 
sures in work in this tradition (e.g., Kitayama, 2002), with the- 
orists noting the many problems associated with the limited 
ability of individuals to report on the orientations emphasized 
in their culture and with the inattention to everyday cultural 
practices, artifacts, and routines that has characterized much 
work in this tradition with its reliance on attitudinal indexes of 
culture. 

The individualism-collectivism framework has made 
major and enduring contributions to understanding culture 
and society in ecological terms. Work in this tradition has 
been of great value in providing insight into processes of 
modernization and cultural change, and it has assisted in 
modeling how both factors in the physical environment and 
social structural considerations affect psychological out- 
comes. The broad framework of individualism-collectivism 
has also proven useful heuristically as a source of initial 
research hypotheses, with this distinction embraced—at least 
in a limited way—not only by investigators concerned with 
the more universalistic agenda of cross-cultural psychology, 
but also by some theorists identified more explicitly with 
cultural psychology (e.g., Greenfield & Suzuki, 1998). 


Culture and Cognitive Development 


Early work on culture and cognitive development was theo- 
retically diverse and international in character, drawing on 
Piagetian as well as Vygotskiian viewpoints among others. 
Within Piagetian viewpoints, cross-cultural research was un- 
dertaken to test the presumed universality of cognitive devel- 
opmental theory (Dasen, 1972; Dasen & Heron, 1981). This 
work involved administering standard Piagetian cognitive 
tests in different cultures after translating the tests and mak- 
ing minor modifications to ensure their ecological validity. 
Likewise, in the domain of moral development, Kohlbergian 
measures of moral judgment were administered in a large 
number of cultural settings after only minor changes in re- 
search protocols were made, such as substituting local names 
for those originally in the text (eg, Edwards, 1986; 
Kohlberg, 1969; Snarey, 1985). The findings on Piagetian 
tasks suggested that in certain African settings, cognitive de- 
velopment proceeds at a slower rate than that observed in 
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Geneva, with the highest level of formal operations generally 
not obtained. Likewise, cross-cultural Kohlbergian research 
indicated that populations not exposed to higher levels of 
education do not reach the highest (postconventional) stage 
of moral judgment. Results of this type were generally inter- 
preted as reflecting the cognitive richness of the environment 
that resulted in more advanced cognitive development in cer- 
tain cultures over others. They were also interpreted as sup- 
porting the universality of cognitive developmental theory. It 
was concluded that culture is nonessential in development, in 
that the sequence and end point of developmental change are 
culturally invariant (e.g., Piaget, 1973). 

Inspired by Vygotsky and other Soviet investigators (e.g., 
Vygotsky, 1929, 1934/1987; 1978; Luria, 1928, 1976), theo- 
rists in the early sociocultural tradition of cross-cultural re- 
search on cognitive development proceeded by undertaking 
experiments in diverse cultural settings. However, in contrast 
to cognitive developmental viewpoints, they assumed that 
cognitive development has a formative influence on the emer- 
gence of basic psychological processes. Rather than view- 
ing development as proceeding independently of cultural 
learning, cultural learning was assumed to be necessary for 
development to proceed. Vygotskiian theory and related so- 
ciocultural approaches emphasized the importance of tool use 
in extending cognitive capacities. From this perspective, cul- 
tural transmission was assumed to be essential, with cognitive 
development involving the internalization of the tools pro- 
vided by the culture. Among the key cultural tools assumed to 
transform minds were literacy and formal schooling, through 
their assumed effects of providing individuals exposure to ab- 
stract symbolic resources and giving rise to modes of reason- 
ing that are relatively decontextualized and not directly tied to 
practical activity (e.g., Goody, 1968). In viewing cultural 
processes as a source of patterning of thought, work in the so- 
ciocultural tradition shared many assumptions with and may 
be considered part of cultural psychology. However, at least in 
its early years, research in this tradition focused on establish- 
ing the universality of basic cognitive processes; this linked it 
closely to other contemporary traditions of cross-cultural 
cognitive developmental research. 

The earliest traditions of cross-cultural experimental re- 
search undertaken by sociocultural theorists resembled those 
of Piagetian researchers in both their methods and their find- 
ings. After making only minor modifications, experimental 
tests were administered to diverse cultural populations. These 
populations were selected to provide a contrast in the cultural 
processes thought to influence cognitive development, such 
as literacy and schooling (e.g., Bruner, Olver, & Greenfield, 
1966; Cole, Gay, Glick, & Sharp, 1971). Results revealed 


that individuals who were illiterate or who lacked formal ed- 
ucation scored lower in cognitive development, failing to 
show such features as abstract conceptual development or 
propositional reasoning, which appeared as end points of 
cognitive development in Western industrialized contexts. 
Such findings supported a “primitive versus modern mind” 
interpretation of cultural differences, in which it was as- 
sumed that the cognitive development of certain populations 
remains arrested at lower developmental levels. This type of 
argument may be seen, for example, in the conclusion drawn 
by Greenfield and Bruner (1969) in drawing links between 
such observed cross-cultural differences and related differ- 
ences found in research contrasting cognition among main- 
stream and minority communities within the United States: 


... As Werner (1948) pointed out, ‘development among primi- 
tive people is characterized on the one hand by precocity and, on 
the other, by a relatively early arrest of the process of intellectual 
growth.’ His remark is telling with respect to the difference we 
find between school children and those who have not been 
to school. The latter stabilize earlier and do not go on to new lev- 
els of operation. The same ‘early arrest’ characterizes the dif- 
ferences between ‘culturally deprived’ and other American 
children. 

. .. Some environments ‘push’ cognitive growth better, ear- 
lier, and longer than others. . . . Less demanding societies—less 
demanding intellectually—do not produce so much symbolic 
embedding and elaboration of first ways of looking and thinking. 
(p. 654) 


From this perspective, the impact of culture on thought was 
assumed to be highly general, with individuals fully internal- 
izing the tools provided by their culture and that resulting in 
generalized cultural differences in modes of thought. 

Later experimental research in the sociocultural tradition 
challenged these early conclusions about global differences 
in thought and about the transformative impact of cultural 
tools on minds. Programs of cross-cultural research were un- 
dertaken that focused on unpacking the complex cognitive 
processes that are tapped in standard cognitive tests and in 
assessing these components under diverse circumstances 
(Cole & Scribner, 1974). Thus, for example, rather than using 
the multiple objects that tended to be employed in Piagetian 
seriation tasks, with their extensive memory demands, re- 
searchers employed fewer objects in memory procedures. 
Also, processes such as memory were assessed in the context 
of socially meaningful material, such as stories, rather than 
merely in decontextualized ways, such as through the presen- 
tation of words. These and similar modifications showed that 
cognitive performance varied depending on features of the 
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task situation and that cultural differences did not remain sta- 
ble. For example, in experimental research, it was shown that 
whereas Liberian schoolchildren are superior to unschooled 
Mano rice farmers in abstract classification of geometric 
shapes, the farmers tended to display more abstract levels of 
classification than shown by the school children on a rice- 
sorting task (Irwin & McLaughlin, 1970). 

Notably, in this early tradition, experimental research fo- 
cused on isolating the impact on thought of literacy and 
schooling, as two of the dimensions believed to be most influ- 
ential in affecting cognitive development. In one landmark 
program of such research, Scribner and Cole (1981) conducted 
research among the Vai tribal community as a way of assessing 
the impact of literacy on thought independently of the effects 
of schooling. Whereas in most societies, literacy covaries with 
schooling, among the Vai certain individuals became literate 
through working as priests without attending school. Results 
of the Scribner and Cole (1981) investigation revealed that lit- 
eracy had no independent impact on thought beyond the ef- 
fects of schooling. In turn, the many programs of research 
focused on evaluating the cognitive consequences of school- 
ing revealed that formal schooling enhanced performance on 
tests of cognitive achievement, but suggested that they had 
highly limited generality in everyday domains of thought out- 
side of school contexts (Sharp, Cole, & Lave, 1979). 

In sum, early research on culture and cognition set a 
strong foundation for contemporary cognitive work in cul- 
tural psychology. Whereas its early findings suggested that 
culture had the effect of arresting the rate of cognitive devel- 
opment or the highest levels of cognitive development 
attained, this finding became qualified as conclusions pointed 
to the need for a more contextually based view of cognition. 
The early image of global cultural differences in thought, 
linked to an image of a primitive versus modern mind, gave 
way to a view of common basic cognitive competencies. 

Early work on culture and thought left many significant 
legacies that remain influential in the field. There was a 
recognition of the need to treat cognition as contextually de- 
pendent rather than highly global. Equally, it was demon- 
strated that experimental tasks do not provide pure measures 
of cognitive ability. Rather, research revealed that greater 
cognitive competence tends to be evident when individuals 
respond to experimental tasks that are more motivationally 
engaging or when individuals are observed interacting in the 
contexts of everyday activities. However, at least in its early 
period, a strong agenda had not yet been developed for cul- 
tural psychology. As the anthropologist T. Schwartz (1981) 
once commented, work in this tradition arrived at a conclu- 
sion of universal cognitive competencies that, although it 
represented a welcome advance from the early emphasis on 


global cultural differences in thought, seemed to be proving 
something that was already assumed by many anthropolo- 
gists who held a view of individuals as competent in fulfilling 
the cognitive demands of their culture. The field had not yet 
reached the point of articulating a positive agenda of charac- 
terizing how culture affected cognition. It was this kind of 
stance that emerged as sociocultural work, and work on cul- 
ture and cognition began to turn more explicitly to cultural 
psychology. 


Summary 


In sum, early research in cross-cultural psychology laid im- 
portant groundwork for contemporary research in the newly 
reemerging framework of cultural psychology. In terms of 
major empirical findings, this early work challenged the idea 
that cultural differences map onto personality differences of 
individual members of a culture, and pointed instead to the 
role of normative practices in underlying observed differ- 
ences in individual behavior. It also challenged claims of 
global differences in cognitive capacity linked to moderniz- 
ing influences, and instead identified modernizing influences 
as having localized effects on cognitive capacities. It is im- 
portant to note, however, that although in many respects it 
was a precursor to much contemporary work in cultural psy- 
chology, early work in these traditions of cross-cultural psy- 
chology tended to remain in a relatively peripheral role in the 
discipline and not to impact fundamentally on psychological 
theory. Thus, in particular, work on culture and personality 
never challenged the universality of psychological theories 
themselves, such as psychoanalysis, but merely applied them 
in understanding levels of personality development or dis- 
play of assumed personality traits in different cultures. 
Equally, work on individualism and collectivism was con- 
cerned with developing parameters that affected the level of 
development of particularly psychological attributes, but not 
the nature of the attributes themselves. Thus, for example, the 
prediction was made that self-esteem would be emphasized 
more in individualistic than in collectivist cultures (e.g., 
Triandis, 1989), but culture was not assumed to affect quali- 
tatively the nature of self-processes or the relevance of self- 
esteem as a dimension of self in different cultural contexts 
(e.g., Heine, Lehman, Markus, & Kitayama, 1999). Finally, 
early comparative research in the sociocultural tradition 
approached cognitive processes as culturally dependent, but 
(at least in these earlier years) tended not to go beyond a con- 
textually based view of cognition and claims of universal 
cognitive competencies in its implications for psychological 
theory. In the next section, consideration is given to some of 
the theoretical insights that underlay the turn from these 


earlier traditions of cultural research to a more explicit cul- 
tural psychological stance. 


INSIGHTS AND CHALLENGES 
OF CULTURAL PSYCHOLOGY 


Cultural psychology represents an eclectic interdisciplinary 
perspective that has many roots. In many (but not all) cases, 
investigators associated with some of these traditions of re- 
search in cross-cultural psychology moved toward a cultural 
psychological outlook in response to the perceived limita- 
tions of some of the conceptual frameworks and goals of their 
earlier research. Thus, for example, many leading investiga- 
tors associated with culture and personality, such as individ- 
uals who worked on the Six Culture project (B. B. Whiting & 
Whiting, 1975), as well as those associated with early work 
in the Vygotskiian tradition on culture and thought, are at 
the forefront of contemporary work in cultural psychology. 
Equally, however, research in cultural psychology has drawn 
from disciplinary perspectives outside psychology. Thus, 
within psychological and cognitive anthropology, many in- 
vestigators moved in a cultural psychological direction both 
from a concern that some of the early theories of culture and 
personality were parochial and needed to be formulated in 
more culturally grounded terms and from a sense that to un- 
derstand culture requires attention to psychological and not 
merely anthropological considerations (e.g., Lutz & White, 
1986; T. Schwartz, White, & Lutz, 1992; Shore, 1996; 
Strauss & Quinn, 1997). Thus, for example, arguments were 
made that to avoid an oversocialized conception of the person 
as merely passively conforming to cultural expectations re- 
quired taking into account the subjectivity of intentional 
agents (e.g., Strauss, 1992). Equally, in another major re- 
search tradition, interest developed in cultural work within 
sociolinguistics. Thus, in work on language learning, it was 
recognized that individuals come to acquire not only the code 
of their language but also the meaning systems of their cul- 
ture through everyday language use (e.g., Heath, 1983; 
P. Miller, 1986; Ochs & Schieffelin, 1984). Likewise, it came 
to be understood that everyday discourse contexts serve as a 
key context of cultural transmission. 


Key Conceptual Premises 


The perspective of cultural psychology is defined concep- 
tually by its view of culture and psychology as mutually con- 
stitutive phenomena. From this perspective, cultural processes 
are seen as presupposing the existence of communities of in- 
tentional agents who contribute meanings and form to cultural 
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beliefs, values, and practices. Equally, psychological func- 
tioning is seen as dependent on cultural mediation, as individ- 
uals participate in and come to acquire as well as create and 
transform the shared meaning systems of the cultural commu- 
nities in which they participate. It is this monistic assump- 
tion of psychological and cultural processes as mutually 
dependent—not the type of methodology adopted—that is 
central to cultural psychology. Thus, for example, whether 
an approach employs qualitative versus quantitative methods 
or comparative versus single cultural analysis does not mark 
whether the approach may be considered as within the tradi- 
tion of cultural as compared with cross-cultural psychology. 


Active Contribution of Meanings to Experience 


A core assumption underlying cultural psychology is linked 
to the insight of the Cognitive Revolution regarding the im- 
portance of meanings in mediating behavior (Bruner, 1990). 
It came to be understood that individuals go beyond the 
information given as they contribute meanings to experi- 
ence, with these meanings in turn influencing individuals’ 
affective, cognitive, and behavioral reactions. The cultural 
implications of this cognitive shift were not appreciated 
immediately within psychology. Rather, as Bruner (1990) 
observes in presenting a brief history of the field, there was a 
tendency for many years to emphasize the autonomous 
self-construction of knowledge—independently of cultural 
transmission. The cultural implications of the Cognitive Rev- 
olution were also not apparent for many years because of the 
ascendance of information-processing accounts of cognition, 
which stress the automatic processing of information rather 
than the more active and creative processes of meaning- 
making. Nonetheless, although this image of an active con- 
structivist agent for many years was not linked with cultural 
viewpoints, it formed an important theoretical basis for cul- 
tural psychology. The recognition that an act of interpretation 
mediates between the stimulus and the response established a 
theoretical basis upon which investigators could draw as they 
began to appreciate the cultural aspects of meanings and these 
meanings’ impact on thought and behavior. 


Symbolic Views of Culture 


The development within anthropology of symbolic views of 
culture (Geertz, 1973; Sahlins, 1976; Shweder & LeVine, 
1984) also contributed to the emergence of cultural psychol- 
ogy in that it highlighted the need to go beyond the prevailing 
tendency to treat culture merely in ecological terms as an as- 
pect of the objective environment. Ecological views of culture 
are critically important in calling attention to the adaptive 
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implications of features of the context (Bronfenbrenner, 
1979). However, they also are limited in treating the context 
exclusively in objective terms, as presenting affordances and 
constraints that are functional in nature. In such frameworks, 
which have tended to be adopted in both mainstream and 
cross-cultural psychology, culture is seen as nonessential to 
interpretation or construction of reality. In contrast, within 
symbolic approaches, cultural systems are understood as bear- 
ing an indeterminate or open relationship to objective con- 
straints rather than being fully determined by objective 
adaptive contingencies. Within symbolic approaches to cul- 
ture, it is recognized that cultural meanings serve not merely 
to represent reality, as in knowledge systems, or to assume a 
directive function, as in systems of social norms. Rather, they 
are seen as also assuming constitutive or reality-creating roles. 
In this latter role, cultural meanings serve to create social re- 
alities, whose existence rests partly on these cultural defini- 
tions (Shweder, 1984). This includes not only cases in which 
culturally based social definitions are integral to establishing 
particular social institutions and practices (e.g., marriage, 
graduation, etc.) but also cases in which such definitions form 
a key role in creating psychological realities. Thus, it is in- 
creasingly recognized that aspects of psychological function- 
ing (e.g., emotions) depend, in part, for their existence on 
cultural distinctions embodied in natural language categories, 
discourse, and everyday practices. For example, the Japanese 
emotional experience of amae (Doi, 1973; Yamaguchi, 2001) 
presupposes not only the concepts reflected in this label but 
also norms and practices that support and promote it. As an 
emotional state, amae involves a positive feeling of depend- 
ing on another’s benevolence. At the level of social practices, 
amae is evident not only in caregiver-child interactions in 
early infancy (Doi, 1973, 1992), but also in the everyday in- 
teractions of adults, who are able to presume that their inap- 
propriate behavior will be accepted by their counterparts in 
close relationships (Yamaguchi, 2001). 

The significance of a symbolic view of culture for the de- 
velopment of cultural psychology was in its complementing 
the attention to meaning-making heralded by the cognitive 
revolution. It became clear that not only were meanings in 
part socially constructed and publicly based, but they also 
could not be purely derived merely by inductive or deductive 
processing of objective information. Culture, then, in this 
way became an additional essential factor in psychological 
explanation, beyond merely a focus on objective features of 
the context and subjective features of the person. 


Incompleteness Thesis 


Finally, and most critically, the theoretical grounding of 
cultural psychology emerged from the realization of the 


necessary role of culture in completion of the self, an insight 
that has been termed the incompleteness thesis (Geertz, 1973; 
T. Schwartz, 1992). This stance does not assume the absence 
of innate capacities or downplay the impact of biological in- 
fluences as a source of patterning of individual psychological 
processes. However, without making the assumption that 
psychological development is totally open in direction, with 
no biological influences either on its initial patterning or on 
its subsequent developmental course, this stance calls atten- 
tion to the essential role of culture in the emergence of 
higher-order psychological processes. Individuals are viewed 
not only as developing in culturally specific environments 
and utilizing culturally specific tools, but also as carrying 
with them, in their language and meanings systems, cultur- 
ally based assumptions through which they interpret experi- 
ence. Although there has been a tendency within psychology 
to treat this culturally specific input as noise that should be 
filtered out or controlled in order to uncover basic features of 
psychological functioning, the present considerations suggest 
that it is omnipresent and cannot be held constant or elimi- 
nated. Rather, it is understood that the culturally specific 
meanings and practices that are essential for the emergence 
of higher-order psychological processes invariably introduce 
a certain cultural-historical specificity to psychological func- 
tioning, as Geertz (1973) once noted: 


We are... incomplete or unfinished animals who complete or 
finish ourselves through culture—and not through culture in gen- 
eral but through highly particular forms of it. (p. 49) 


From the present perspective, it is assumed that whereas an 
involuntary response may proceed without cultural media- 
tion, culture is necessary for the emergence of higher-order 
psychological processes. Wertsch (1995) articulates this 
point: 


Cultural, institutional, and historical forces are ‘imported’ into 
individuals’ actions by virtue of using cultural tools, on the one 
hand, and sociocultural settings are created and recreated 
through individuals’ use of mediational means, on the other. The 
resulting picture is one in which, because of the role cultural 
tools play in mediated action, it is virtually impossible for us to 
act in a way that is not socioculturally situated. Nearly all human 
action is mediated action, the only exceptions being found per- 
haps at very early stages of ontogenesis and in natural responses 
such as reacting involuntarily to an unexpected loud noise. 
(p. 160) 


Thus, for example, whereas involuntary physiological reac- 
tions may be elicited by situational events, whether they 
become interpreted and experienced in emotional terms 
depends in part on such input as culturally based theories 


regarding the nature, causes, and consequences of emotions, 
cultural routines for responding to emotions, natural lan- 
guage categories for defining emotions, and a range of other 
sociocultural processes. 

This assumption of the interdependence of psychological 
and cultural processes represents the central idea of cultural 
psychology. Notably, the term cultural psychology was se- 
lected by theorists to convey this central insight that psycho- 
logical processes need to be understood as always grounded 
in particular socio-cultural-historical contexts that influence 
their form and patterning, just as cultural communities de- 
pend for their existence on particular communities of inten- 
tional agents. The present considerations then lead to the 
expectation that qualitative differences in modes of psy- 
chological functioning will be observed among individuals 
from cultural communities characterized by contrasting self- 
related sociocultural meanings and practices. 


Summary 


Among the key conceptual insights giving rise to cultural 
psychology were the emergence of a view of the individual as 
actively contributing meanings to experience and an under- 
standing of culture as a symbolic system of meanings and 
practices that cannot be explained exclusively in functional 
terms as mapping onto objective adaptive constraints. Crucial 
to the field’s development was that it also came to be recog- 
nized that higher-order psychological processes depend for 
their emergence on individuals’ participation in particular 
sociocultural contexts, and thus that culture is fundamental 
to the development of self. 


Select Overview of Empirical Research 
in Cultural Psychology 


The present section examines representative examples of em- 
pirical studies that embody this core insight regarding the 
cultural grounding of psychological processes, an insight that 
is central to the many traditions of work in cultural psychol- 
ogy (e.g., Cole, 1990, 1996; Markus et al., 1996; J. G. Miller, 
1997; Shweder, 1990; Shweder et al., 1998). Although the 
overview presented here is necessarily highly selective and 
incomplete, it serves to illustrate ways in which cultural re- 
search is offering new process explanations of psychological 
phenomena as well as identifying fundamental variability in 
the forms that psychological processes assume. 


Sociocultural Traditions of Research 


The discussion here makes reference to findings from a di- 
verse range of related viewpoints that have derived from the 
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work of such major cultural theorists as Vygotsky (1978, 
198 1a, 1981b), Leontiev (1979a, 1979b), Luria (1979, 1981), 
Bakhtin (1986), and Bourdieu (1977) among others; their 
work is reflected in the many contemporary traditions of re- 
search in sociocultural psychology (e.g., Cole, 1988, 1990; 
Rogoff, 1990; Valsiner, 1988, 1989; Wertsch, 1979, 1991). 
Central to theoretical work within this tradition is an empha- 
sis on the mediated nature of cognition. Human behavior is 
seen as dependent on cultural tools or on other mediational 
means, with language recognized as one of the most central 
of these cultural supports. Embodying a broad lens, sociocul- 
tural approaches focus on understanding human activity at 
phylogenetic, historical, ontogenetic, and microgenetic lev- 
els, with cultural practices and activities viewed in terms of 
their place in larger sociopolitical contexts. 

Considerable research in this area focuses on document- 
ing how interaction with cultural tools and participation in 
everyday cultural activities leads to powerful domain- 
specific changes in thought. In work on everyday cognition 
(see review in Schliemann, Carraher, & Ceci, 1997), it has 
been shown, in fact, that everyday experiences can produce 
changes that represent an advance on those produced by 
schooling. For example, Scribner (1984) documented that 
individuals who work as preloaders in a milk factory and 
have less formal education than do white-collar workers are 
able to solve a simulated loading task more rapidly than do 
white-collar workers through using a more efficient percep- 
tual solution strategy as contrasted with a slower enumerative 
approach. Likewise, in a growing body of research on exper- 
tise, it has been revealed, for example, that compared with 
novice adult chess players, child chess experts use more 
complex clustering strategies in organization and retrieval of 
chess information; they are also more proficient in their 
memory for chess pieces (Chi, Glaser, & Farr, 1998). Similar 
effects have equally been shown to occur in the solving of 
math problems among expert versus novice abacus users 
(Stigler, 1984). 

It is important to note that sociocultural research is also 
providing new process models of the nature of everyday cog- 
nition. For example, recent research on situated cognition has 
challenged the view of learning as a distinct activity or as an 
end in itself set off from daily life and has emphasized its em- 
beddedness in everyday activities and social contexts (Lave, 
1988, 1993; Lave & Wenger, 1991). Research has revealed, 
for example, that in contrast to the forms of instruction that 
occur in formal school settings, learning in everyday situa- 
tions is more oriented toward practical problems. In part as a 
result, individuals tend to be more motivationally involved in 
tasks and spontaneously to search for and generate more flex- 
ible task solutions in everyday situations than they do in 
formal school contexts. 
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Sociocultural research is also offering new answers to 
long-standing questions in psychological development. For 
example, work by Cole and his colleagues (Cole & Enge- 
stroem, 1995) has offered a novel process explanation of one 
of the central theoretical problems of cognitive development 
and language learning—explaining how individuals can ob- 
tain a more powerful conceptual structure if they do not al- 
ready in some way possess that structure, or how qualitative 
and not merely quantitative developmental change may 
occur. In research conducted on the teaching of reading, it has 
been demonstrated that a range of mediational means, such as 
simplified reading materials, expert guidance, and so on, are 
available in everyday socialization contexts that support 
learning to read. Thus, it is noted that many of the structures 
entailed in the achievement of competence in reading exist 
between persons before they appear as individual competen- 
cies that may be manifest without this level of cultural 
support. Equally, in another example, evidence has been ob- 
tained to suggest that changes in children’s forms of social 
participation explain some of the marked advances in cogni- 
tive and social functioning that have been linked to the 5- to 
7-year-old age shift among the schooled populations that 
have been subject to most study by cognitive developmental 
psychologists (Rogoff, 1996). 


Cultural Social Psychological Traditions 
of Cognitive Research 


Cultural social psychological work on cognition has a more 
recent history, tracing its origins most directly to early chal- 
lenges to the universality of certain well-established attribu- 
tional phenomena. It is giving rise to a rapidly growing 
experimental literature that points to qualitative cultural vari- 
ation in basic modes of cognitive processing. 

In some of the early groundbreaking work in this tradition, 
Shweder and Bourne (1984) challenged the completeness of 
contemporary social psychological theories of social attribu- 
tion. It was documented that, as compared with European- 
Americans, Oriyan Hindu Indians place significantly greater 
emphasis in person description on actions versus abstract 
traits, with their person descriptions more frequently making 
reference to the context. Thus, for example, their investiga- 
tion revealed that whereas European-Americans are more 
likely to describe a friend by saying she is friendly, Oriyan 
Indians are more likely to describe the friend by saying she 
brings cakes to my family on festival days. This type of cul- 
tural difference, it was observed, was not explicable in terms 
of the types of objective ecological or individual psychologi- 
cal factors that had been emphasized in previous studies, such 
as variation in schooling, literacy, socioeconomic status, 


linguistic resources, or capacities for abstract thought. Rather, 
the results appeared explicable only when taking into account 
cultural factors. In particular, the trends were demonstrated to 
reflect the contrasting cultural conceptions of the person and 
related sociocultural practices emphasized in Hindu Indian 
versus European-American cultural communities. 

Subsequent cross-cultural developmental research on so- 
cial attribution demonstrated that these types of cultural con- 
siderations give rise to cultural variation in the paths and 
endpoints of development (J. G. Miller, 1984, 1987). It was 
documented that whereas European-American children show 
an age increase in their reference to traits (e.g., she is aggres- 
sive) but no age-related change in their reference to contextual 
considerations, Hindu Indian children show an age increase in 
their references to the social context (e.g., there are bad rela- 
tions between our families) but no age increase in their refer- 
ences to traits. More recently, this type of work has been 
further extended to understanding the development of indi- 
viduals’ conceptions of mind, with cultural work calling into 
question claims that theory of mind understandings develop 
spontaneously toward an end point of trait psychology—and 
providing evidence that they proceed in directions that reflect 
the contrasting epistemological assumptions of local cultural 
communities (Lillard, 1998). 

In other lines of work on social attribution and cognition, 
culturally based social psychological research is calling into 
question the universality of various attributional and cognitive 
tendencies long assumed to be basic to all psychological func- 
tioning, such as motives to maintain self-consistency or to 
emphasize dispositional over situational information. Thus, 
for example, it has been demonstrated that Japanese college 
students tend to maintain weaker beliefs in attitude-behavior 
consistency than do Australian college students (Kashima, 
Siegal, Tanaka, & Kashima, 1992), while being less prone 
than are Canadian college students to show cognitive disso- 
nance biases—that is, tendencies to distort social perceptions 
to make them more congruent with behavior (Heine & Leh- 
man, 1997). Also, relative to European-Americans, various 
East Asian populations have been documented to display 
greater sensitivity to situational information in object percep- 
tion and less vulnerability to the fundamental attributional 
error (Ji, Peng, & Nisbett, 2000), a tendency to treat behaviors 
as correspondent with dispositions. 

New lines of research in this area are also linking cultural 
views of the self and related cultural practices to variation in 
fundamental styles of cognitive processing, such as tenden- 
cies to privilege analytic versus dialectical epistemological 
stances. In one illustration of such a cultural difference, ex- 
perimental research has demonstrated that American under- 
graduates tend to treat information in a polarized manner, as 


seen in their considering scientific evidence as more plausi- 
ble when it is presented alone rather than in conjunction with 
contradictory information (Peng & Nisbett, 1999). In con- 
trast, Chinese undergraduates tend to process information in 
ways that involve greater acceptance of opposing viewpoints, 
as seen in their considering scientific evidence as more plau- 
sible when it is presented in conjunction with contradictory 
information rather than alone. Work of this type calls into 
question the primacy of analytic modes of thought in work in 
cognitive science, highlighting the salience of fundamentally 
different styles of cognitive processing in various East Asian 
cultural populations. 


Self-Processes 


In the area of the self-concept, psychological research is 
challenging the long-standing assumption that individuals 
spontaneously engage in self-maintenance strategies that 
are oriented toward self-enhancement, and that self-esteem 
is universally fundamental to psychological well-being. 
Open-ended attributional research on self-description, for 
example, has documented that whereas the open-ended self- 
descriptions of American adults emphasize positive attri- 
butes (Herzog, Franks, Markus, & Holmberg, 1998), those 
of Japanese adults emphasize either weakness or the absence 
of negative self-characteristics (e.g., m poor at math, I’m 
not selfish). Research has also documented that whereas the 
scores of Americans on measures of self-esteem tend to be 
higher than the scale midpoints—an indication of a tendency 
toward self-enhancement—those of Japanese persons tend 
to be at or slightly below the scale midpoint, an indication of 
a tendency to view the self as similar to others (Diener & 
Diener, 1995). 

One of the most far-reaching implications of this type of 
research is that it calls into question the centrality of self- 
esteem in psychological functioning in various collectivist 
cultural communities, and it suggests that other types of self- 
processes may be more central in everyday adaptation in such 
contexts. In this regard, cross-national survey research has 
shown that self-esteem is more closely associated with life 
satisfaction in individualist than in collectivist cultures 
(Diener & Diener, 1995). In contrast, it is documented that a 
concern with maintaining relationship harmony shows a 
stronger relationship with life satisfaction in collectivist than 
in individualist cultures (Gabrenya & Hwang, 1996). These 
contrasting patterns of interrelationship are further docu- 
mented to distinguish everyday socialization practices and to 
have important adaptive consequences. Thus, for example, 
Chinese as well as Japanese mothers tend to be more self- 
critical of their children’s academic performance than are 
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American mothers (Crystal & Stevenson, 1991), with this 
stance implicated in the tendencies of Chinese and Japanese 
versus American mothers to place greater emphasis on their 
children’s expending effort toward self-improvement and 
having children who show superior levels of academic 
achievement (Stevenson & Lee, 1990). 

Cultural research on the self is also challenging basic psy- 
chological theory in the domain of self-consistency. Social 
psychological theory has long assumed that individuals are in- 
herently motivated to maintain a consistent view of the self 
and that such consistency is integral to psychological well- 
being. This stance is evident not only in classic theories of cog- 
nitive dissonance (Festinger, 1957), but also in more recent 
work on attribution. For example, work on self-verification 
has shown that individuals tend to prefer information that is 
consistent rather than inconsistent about themselves (Swann, 
Wenzlaff, Krull, & Pelham, 1992), as well as that autobio- 
graphical memories are structured in ways that preserve a 
consistent sense of self (Ross, 1989). Equally, work on psy- 
chological health has suggested that having an integrated 
and consistent view of self has adaptive value (Jourard, 1965; 
Suh, 2000). 

A growing body of attributional research in Asian cul- 
tures, however, is suggesting that in these cultures the self 
tends to be experienced as more fluid than is typically ob- 
served in U.S. populations, with sensitivity to context valued. 
Work on self-description has demonstrated, for example, that 
the self-descriptions of Japanese but not of Americans tend to 
vary as a function of the presence of others (Kanagawa, 
Cross, & Markus, 2001). Likewise, experimental research 
has documented that cognitive dissonance effects tend not to 
be observed among Japanese as compared with Canadian 
populations (Heine & Lehman, 1997; Heine & Morikawa, 
2000), and that consistency across situations shows a much 
weaker relationship to psychological well-being among 
Korean as compared with American populations (Suh, 2000). 


Emotions 


Emotions provide a particularly challenging area for cultural 
research because they are phenomena that involve not merely 
perceptions but also behavioral action tendencies and so- 
matic reactions. They thus entail a biological grounding even 
as they also involve essential cultural components. Notably, 
as suggested in the following discussion, culture affects the 
expression of emotions and their form, as well as their role in 
mental health outcomes. 

One important influence of cultural processes on emotion 
occurs in the degree of an emotion’s elaboration or suppres- 
sion. It has been documented that cultural meanings and 
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practices affect the degree to which particular emotions are 
hypercognized (in the sense that they are highly differentiated 
and implicated in many everyday cultural concepts and prac- 
tices) versus hypocognized (in that there is little cognitive or 
behavioral elaboration of them; Levy, 1984). Even universal 
emotions, it has been observed, play contrasting roles in indi- 
vidual experience in different cultural settings. For example, 
whereas in all cultures both socially engaged feelings (e.g., 
friendliness, connection) and socially disengaged feelings 
(e.g., pride, feelings of superiority) may exist, among the 
Japanese only socially engaged feelings are linked with 
general positive feelings, whereas among Americans both 
types of emotions have positive links (Kitayama, Markus, & 
Kurokawa, 2000). 

Cross-cultural differences have also been observed in emo- 
tion categories as well as in individuals’ appraisals of emotions. 
Thus, variation in emotion concepts has been documented not 
only in the case of culturally specific categories of emotion, 
such as the concept of amae among the Japanese (Russell & 
Yik, 1996; Wierzbicka, 1992), but also among such assumed 
basic emotions as anger and sadness (Russell, 1991, 1994). It 
has been shown that Turkish adults make systematically dif- 
ferent appraisals of common emotional experiences than do 
Dutch adults, whose cultural background is more individualist 
(Mesquita, 2001). Thus, as compared with Dutch adults’ ap- 
praisals, Turkish adults tend to categorize emotions as more 
grounded in assessments of social worth, as more reflective of 
reality than of the inner subjective states of the individual, and 
as located more within the self-other relationship than confined 
within the subjectivity of the individual. 

Notably, work on culture and emotions is also providing 
evidence of the open relationship that exists between physio- 
logical and somatic reactions and emotional experiences. For 
example, research has revealed that although Minangkabu 
and American men show the same patterns of autonomic 
nervous system arousal to voluntary posing of prototypical 
emotion facial expressions, they differ in their emotional 
experiences (Levenson, Ekman, Heider, & Friesen, 1992). 
Whereas the Americans tend to interpret their arousal in this 
type of situation in emotional terms, Minangkabu tend not to 
experience an emotion in such cases, because it violates their 
culturally based assumptions that social relations constitute 
an essential element in emotional experience. 

Finally, important cultural influences on the mental health 
consequences of affective arousal are also being documented. 
For example, various somatic experiences—such as fatigue, 
loss of appetite, or agitation—that are given a psychological 
interpretation as emotions by European-Americans tend not 
to be interpreted in emotional terms but rather as purely 


physiological events among individuals from various Asian, 
South American, and African cultural backgrounds (Shweder, 
Much, Mahapatra, & Park, 1997). It is notable that such 
events tend to be explained as originating in problems of 
interpersonal relationships, thus requiring some form of 
nonpsychological form of intervention for their amelioration 
(Rosaldo, 1984; White, 1994). 


Motivation 


Whereas early cross-cultural research was informed exclu- 
sively by existing theoretical models, such as Rotter’s frame- 
work of internal versus external locus of control (Rotter, 
1966), recent work is suggesting that motivation may assume 
socially shared forms. This kind of focus, for example, is re- 
flected in the construct of secondary control, which has been 
identified among Japanese populations, in which individuals 
are seen as demonstrating agency via striving to adjust to sit- 
uational demands (Morling, 2000; Morling, Kitayama, & 
Miyamoto, 2000; Weisz et al., 1984). Equally, work in India 
has also pointed to the existence of joint forms of control, in 
which the agent and the family or other social group are 
experienced as together agentic in bringing forth certain 
outcomes (Sinha, 1990). 

In another related area of work on motivation, research is 
highlighting the positive affective associations linked with 
fulfillment of role-related responsibilities. This type of docu- 
mentation notably challenges what has been the assumption 
informing much psychological theory—that behavior is ex- 
perienced as most agentic when it is freely chosen rather than 
socially constrained and that social expectations are invari- 
ably experienced as impositions on individual freedom of 
choice. For example, behavioral research on intrinsic motiva- 
tion has documented that Asian-American children experi- 
ence higher intrinsic motivation for an anagrams task that has 
been selected for them by their mothers than for one that they 
have freely chosen (Iyengar & Lepper, 1999). In contrast, it is 
shown that European-American children experience greater 
intrinsic motivation when they have selected such a task for 
themselves. 

Further support for this view that agency is compatible 
with meeting role expectations may be seen in attributional 
research, which has documented that Indian adults indicate 
that they would want to help as much and derive as much 
satisfaction in helping when acting to fulfill norms of rec- 
iprocity as when acting in the absence of such normative 
expectations (J. G. Miller & Bersoff, 1994). Such a trend 
contrasts with that observed among Americans, who assume 
that greater satisfaction is associated with more freely chosen 


helping. These kinds of results challenge prevailing models 
of communal relationships, which assume that a concern with 
obligation detracts from a concern with being responsive to 
the others’ needs (Mills & Clark, 1982). They also challenge 
models of self-determination, which assume that internaliza- 
tion involves a greater sense of perceived autonomy (Deci & 
Ryan, 1985). Rather, it appears that in certain collectivist cul- 
tures individuals may experience their behavior as demanded 
by role requirements, while also experiencing themselves 
as strongly endorsing, choosing to engage in, and deriving 
satisfaction from the behavior. 

In turn, work in the area of morality, relationships, and at- 
tachment highlights the need to expand current conceptual- 
izations of motivation. For example, research in the domain 
of morality with both Hindu Indian populations (Shweder, 
Mahapatra, & Miller, 1990) as well as with orthodox reli- 
gious communities within the United States (Jensen, 1997) 
has documented forms of morality based on concerns with 
divinity that are not encompassed in existing psychological 
theories of morality, with their exclusive stress on issues of 
justice, individual rights, and community (e.g., Kohlberg, 
1971; Turiel, 1983). Furthermore, work on moralities of com- 
munity have documented the highly individualistic cultural 
assumptions that inform Gilligan’s morality of caring frame- 
work (Gilligan, 1982), with its emphasis on the voluntaristic 
nature of interpersonal commitments. Cross-cultural work 
conducted on the morality of caring among Hindu Indian 
populations and cross-cultural work conducted utilizing 
Kohlbergian methodology have uncovered the existence of 
forms of duty-based moralities of caring that although fully 
moral in character, differ qualitatively in key respects from 
those explained within Gilligan’s framework (J. G. Miller, 
1994, 2001b; Snarey & Keljo, 1991). 

In terms of relationship research, a growing cross-cultural 
literature on attachment is suggesting that some of the ob- 
served variation in distribution of secure versus nonsecure 
forms of attachment arises at least in part from contrasting 
cultural values related to attachment, rather than from certain 
cultural subgroups’ having less adaptive styles of attachment 
than others. For example, research conducted among Puerto 
Rican families suggests that some of the greater tendency of 
children to show highly dependent forms of attachment re- 
flects the contrasting meanings that they place on interdepen- 
dent behavior (Harwood, Miller, & Irizarry, 1995). Thus, an 
analysis of open-ended responses of mothers revealed that 
compared with European-American mothers, Puerto Rican 
mothers viewed dependent behavior relatively positively as 
evidence of the child’s relatedness to the mother. Suggesting 
that present dimensions of attachment may not be fully 
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capturing salient concerns for Puerto Rican mothers, this 
work further demonstrated that Puerto Rican mothers spon- 
taneously emphasized other concerns—such as display of 
respect and of tranquility—that are not tapped by present at- 
tachment formulations. 

In other research, recent work on attachment among 
Japanese populations highlights the greater emphasis on 
indulgence of the infant’s dependency and on affectively 
based rather than informationally oriented communication in 
Japanese versus American families (Rothbaum, Weisz, Pott, 
Miyake, & Morelli, 2000). In contrast to the predictions of at- 
tachment theory, however, such forms of parenting are not 
linked with maladaptive outcomes; rather, these parenting 
styles have positive adaptive implications, in fitting in with 
the cultural value placed on amae, an orientation that in- 
volves presuming upon another’s dependency and plays an 
important role in close relationships throughout the life cycle. 
Such research has pointed out that the common finding that 
Japanese attachment more frequently takes what are consid- 
ered as insecure or overly dependent forms reflects biases in 
present conceptions of attachment, which fail to take into 
account the concerns with interdependence in the Japanese 
context. Furthermore, it is noted that methodologically, the at- 
tachment research paradigm presents a separation context that 
is much rarer and thus much more stressful for Japanese than 
for American infants. Equally, it is suggested that (rather than 
treat the individual as the unit of attachment) to fully capture 
Japanese attachment-related concerns would require treat- 
ing the individual-caregiver unit rather than the indi- 
vidual alone as the object of attachment assessment, with a 
focus on how well individuals can anticipate each other’s 
responses. 


Summary 


Work in cultural psychology is not only documenting cultural 
variability in psychological outcomes, but is also focused on 
uncovering respects in which this variation has theoretical 
implications in pointing to the implicit cultural underpinnings 
of existing psychological effects, as well as respects in which 
psychological theory needs to be conceptually expanded to 
account for culturally diverse modes of psychological func- 
tioning. We have seen specifically that cultural work is high- 
lighting the culturally mediated nature of cognition through 
individuals’ participation in everyday cultural practices and 
use of culturally specific tools; such work has also uncovered 
the existence of contrasting culturally based cognitive styles, 
as well as extensive cultural variation in basic psychological 
processes involving the self, emotions, and motivation. 
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Challenges 


Whereas there has been a dramatic increase in interest in cul- 
tural research in recent years, there nonetheless remains a 
sense in which cultural perspectives remain in a marginal 
position in the discipline. This may be seen in the stance 
adopted for cultural considerations most frequently—to be 
treated in a diversity sense, as relevant in explaining excep- 
tions from what are assumed to be the general or default 
patterns—and for psychological theory and psychological 
generalizations commonly to be formulated without refer- 
ence to cultural considerations. Concerns have also been 
raised about the quality of existing cultural research. In this 
regard, for example, criticisms have been made of the predic- 
tive power of recent work in social psychology based on the 
individualism-collectivism paradigm (e.g., Oyserman et al., 
2002; Matsumoto, 1999). Charges have also been made that 
at least some contemporary cultural research is somewhat 
simplistic, if not stereotypical, and fails to capture the sub- 
tlety of particular cultural outlooks or to forward sophisti- 
cated contextually sensitive accounts of psychological 
functioning (J. G. Miller, 2002). Consideration here is given 
to ways to overcome some of these limitations and of how to 
further the promise and potential of cultural psychology to 
broaden and enrich basic psychological theory. 


Process-Oriented Views of Culture 


Social psychological traditions of cultural research in particu- 
lar have been influenced by the views of culture held in the tra- 
dition of individualism-collectivism. This link has occurred 
largely because of the tremendous influence of the distinction 
introduced by Markus and Kitayama (1991) between interde- 
pendent versus independent self-construals. As introduced, 
this distinction embodied a set of dichotomous contrasts that 
were presented as characterizing a wide range of cultures, 
with the independent view of self characteristic of North 
American as well as many Western European cultural popula- 
tions and the interdependent view of self characteristic of 
many Asian and African cultures. Thus, for example, whereas 
the independent self was defined as “separate from social con- 
text, bounded, unitary, stable, and focused on internal private 
features (abilities, thoughts, feelings)”, the interdependent 
self was defined in polar opposite ways as “connected with so- 
cial context, flexible, variable, and focused on external public 
features (status, roles, relationships)” (Markus & Kitayama, 
1991, p. 230). 

When presenting this global dichotomy, Markus and 
Kitayama (1991) cautioned about drawing direct links be- 
tween the type of general cultural schemas that they were 


identifying and individual self-representations. In this regard, 
for example, they noted respects in which individual self- 
concepts reflect a range of factors, including “gender, race, 
religion, social class, and one’s particular social and devel- 
opmental history” (p. 230). They also stressed that both 
independent and interdependent orientations toward self are 
found in all societies, although these orientations take some- 
what culturally specific forms. However, many social psycho- 
logical investigators adopted the independent-interdependent 
self distinction in a nonnuanced manner that has ended up 
being somewhat stereotypical and simplistic in its characteri- 
zation of culture and overly global in its views of how culture 
influences psychological phenomena. 


Variation Between and Within Cultural Communities 


In future research, it is critical to attend to the variation 
within different collectivist and individualist cultures and to 
the frequent overlap between cultural groups. Also, greater 
attention needs to be given to variation within culture that 
may be linked to social class, ethnicity, and experiences of 
discrimination or oppression. 

In this regard, recent cultural research that has focused on 
varieties of individualism and collectivism has been valuable 
in that it points to psychological consequences linked to such 
variation. For example, research has suggested that Japanese 
individuals tend to approach social relations by focusing on 
the peer group, whereas Chinese individuals tend to adopt 
more of an authority-directed stance (Dien, 1999). It has also 
been documented that regional variation occurs in forms of 
individualism within the United States, such as the concerns 
with a culture of honor found in southern and western parts of 
the United States (Nisbett & Cohen, 1996). Notably, socio- 
linguistic and ethnographic research has also documented 
that within lower-class and working-class communities 
within the United States, there tends to be what has been 
characterized as a “hard defensive” type of individualism, 
which stresses adoption of abilities to cope in harsh everyday 
environments, in contrast with the “soft” individualism, 
which stresses the cultivation of individual uniqueness and 
gratification within middle-class contexts (Kusserow, 1999). 


Attention to Cultural Practices 


A limitation of current work on culture has also been the 
tendency to conceptualize culture purely in ideational terms. 
This type of stance is reflected in the reliance on scale 
measures of individualism-collectivism that have tended to 
portray cultures as systems of value orientations. Current 
conceptualizations have also been problematic in treating 


cultural meanings as individual-difference attitudinal or per- 
sonality variables—a stance that fails to recognize the multi- 
ple motives and personality factors that may be satisfied by 
given cultural practices, resulting in the lack of a one-to-one 
relationship between personality and culture. 

In future research, it is important to recognize the com- 
plexity of cultural meanings. This means acknowledging cul- 
ture not merely as knowledge about experience or as norms 
but also as constitutive propositions that serve to define and 
create social realities. It is equally critical to view cultural 
meanings as embodied in material artifacts, social institu- 
tions, and cultural tools, as well as expressed and communi- 
cated in everyday activities and practices. It is important that 
this type of stance is being recognized in the recent emphasis 
on the construct of cultural “selfways” or “custom com- 
plexes” that treat culture as including ideational and process- 
oriented elements that are mutually supportive (Greenfield, 
1997; Markus, Mullally, & Kitayama, 1997; Shweder et al., 
1998). It is important that the present type of concern also ex- 
pands current understandings of culture in highlighting the 
frequently implicit and covert nature of cultural meanings, 
with many cultural commitments experienced by agents as 
facets of nature rather than of culture per se—a stance that 
contributes to their motivational force for individuals. 

Finally, in future research, there is a need to integrate both 
symbolic and ecological views of culture. Symbolic views 
call attention to the arbitrary nature of cultural meanings and 
the extent to which they rest on nonrational commitments, 
rather than purely on functional considerations of utility. In 
turn, ecological approaches call attention to the material as- 
pects of sociocultural systems, pointing to the need to take 
into account material constraints, resources, and issues of 
power and control in understanding sociocultural processes. 
In this regard, it is important to understand respects in which 
cultural and ecological effects are mutually influential. Thus, 
for example, research has shown not only that Puerto Rican 
mothers differ qualitatively in their views of attachment from 
European-American mothers, but also that both common and 
culturally specific effects of social class are observed in each 
case (Harwood et al., 1995). 


Culturally Nuanced Models of Cultural Influences 


One of the limitations of existing views of cultural influences 
on psychological processes has been the tendency to treat cul- 
tural differences as mapping onto personality differences. 
Ironically, this was one of the problematic aspects of early 
work in the tradition of culture and personality. As noted ear- 
lier, theorists criticized this work as presenting an overly so- 
cialized conception of the person as merely conforming to 
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existing social norms and requirements. It also was criticized 
for positing an isomorphism between personality and individ- 
ual motivation, and for failing to recognize the open-ended 
relationship between them. Notably, another problematic as- 
pect of contemporary treatment of cultural influences has 
been the tendency to view cultural influences on psychologi- 
cal processes as highly generalized rather than as context- 
ually dependent. This also appears related to a tendency to 
adopt a dispositional view of cultural effects as giving rise to 
global orientations that generalize across contexts or as uni- 
form and noncontextually mediated forms of perceptual 
biases. 

To develop more nuanced views of cultural influences on 
psychological functioning, it is critical, then, to attend both to 
individual differences and to cultural influences rather than to 
assume that individual differences map directly onto cultural 
differences. This involves recognizing the variation in indi- 
vidual attitudinal and personality measures within culture. It 
also involves taking into account that culture frequently has 
its impact on psychological processes through affecting indi- 
viduals’ participation in normative contexts—with their var- 
ied normative requirements—rather than through affecting 
enduring psychological individual-difference variables. 

Notably, to develop contextually sensitive views of cul- 
tural influences on psychological functioning requires taking 
into account the variation observed across contexts. Thus, for 
example, it cannot be assumed that because a concern with 
social relations and with a more interdependent view of self 
has been seen in collectivist cultures, individuals from col- 
lectivist cultures always give more weight to contextual ef- 
fects than do individuals from individualist cultures. Rather, 
it must be recognized that culture influences the meanings 
given to contexts, and—depending on these meanings—there 
will be occasions in which individuals from collectivist cul- 
tures may show less variation in their judgments across con- 
texts than do individuals from individualist cultures; or in 
some situations, observed cultural differences may even 
reverse (e.g., Cousins, 1989). 


International and Interdisciplinary Approaches 
to Scholarship 


In order to formulate approaches to culture that are dynamic 
and nuanced, it is essential for researchers to gain an under- 
standing of the meanings and practices emphasized in the 
particular cultural communities in which they work. Such an 
understanding can be promoted through a range of processes, 
including collaboration with individuals from the culture, 
spending time in residence in the culture, learning the local 
language, or any combination of these. It is also likely that 
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research that is informed by in-depth understandings of dif- 
ferent cultural communities will become more common in 
psychology in the future. As the field becomes increasingly 
international and culturally diverse, investigators will be able 
to bring to their research cultural sensitivities and concerns 
contrasting with those presently dominating the discipline. 

There is equally a need for future research on culture to 
become increasingly interdisciplinary, with investigators tak- 
ing into account the conceptual and methodological insights 
of anthropological and sociolinguistic research traditions and 
avoiding the present insularity that results from ignoring or 
dismissing work from different disciplinary viewpoints. This 
neglect can yield findings considered to have relatively little 
importance from the perspective of the other traditions. How- 
ever, given the overlap in concerns across these research 
traditions and given their contrasting strengths, greater inter- 
disciplinary exchange can only serve to enhance progress in 
the field. 


Summary 


To enhance the quality of existing cultural research, it is im- 
portant for investigators to go beyond dichotomous frame- 
works for understanding cultural differences, such as the 
global dimensions of individualism-collectivism. These types 
of frameworks fail to capture the complexity of individual 
cultural systems, portraying cultures in ways that are overly 
static, uniform, and isolated. Effort must be made to develop 
more nuanced views of culture through attending to everyday 
cultural activities and practices as well as to symbolic culture 
and ecological dimensions of contexts. Additionally, attention 
must be given both to individual differences and to cultural 
influences—the assumption should not be made that individ- 
ual differences map directly onto cultural variation. Finally, 
the sensitivity of cultural research stands to be enhanced 
through researchers’ working to gain a greater understanding 
of the specific cultural communities which they study. 


CONCLUSION 


In conclusion, the present examination of culture in social 
psychological theory highlights the importance of recogniz- 
ing that culture is part of human experience and needs to be 
an explicit part of psychological theories that purport to pre- 
dict, explain, and understand that experience. What work in 
culture aims to achieve, and what it has already accomplished 
in many respects, is more than to lead investigators to treat 
psychological findings and processes as limited in generality. 
Rather than leading to an extreme relativism that precludes 


comparison, work in this area holds the promise of leading to 
the formulation of models of human experience that are in- 
creasingly culturally inclusive. By calling attention to the 
cultural meanings and practices that form the implicit context 
for existing psychological effects, and by broadening present 
conceptions of the possibilities of human psychological func- 
tioning, work in cultural psychology is contributing new con- 
structs, research questions, and theoretical insights to expand 
and enrich basic psychological theory. 
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Until recently, the study of personality was handicapped 
by the lack of a systematic taxonomy of constructs to repre- 
sent individual differences. A confusing array of con- 
structs and measures was available, and different measures of 
purportedly the same construct often showed little correspon- 
dence. This diversity hindered the development of a system- 
atic understanding of individual differences. Recently, the 
situation began to change with emerging agreement about 
some of the major dimensions of personality. Broad traits 
such as neuroticism-stability, extraversion-introversion, and 
psychoticism-constraint are identified in most analyses of 
personality traits and part of most descriptive systems. There 
is also agreement about the way personality is organized. 
Models based on trait concepts assume that traits differ along 
a dimension of breadth or generalization and that traits are hi- 
erarchically organized, with global traits such as neuroticism 
subdividing into a set of more specific traits such as anxious- 
ness and dependence (Goldberg, 1993; Hampson, John, & 
Goldberg, 1986). 

Within this framework, attention has focused particularly 
on the five major factors as a parsimonious taxonomy of per- 
sonality traits (Goldberg, 1990). Lexical analyses of the nat- 
ural language of personality description (Digman, 1990; 
Goldberg, 1990) and subsequent psychometric studies of 
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personality inventories (Costa & McCrae, 1992) have con- 
verged in identifying five broad factors typically labeled 
extraversion or surgency, agreeableness, conscientiousness, 
emotional stability versus neuroticism, and intellect, culture, 
or openness. It is widely assumed that this structure is trans- 
forming our understanding of personality and that the higher- 
order structure of personality is becoming more clearly 
delineated. Enthusiasm for the emergent structure, although 
understandable because it promises to bring coherence to a 
field characterized more by conceptual and theoretical debate 
than by substantive findings, tends to minimize confusions 
that still exist regarding the number and content of higher- 
order domains (Zuckerman, 1991, 1995, 1999; Zuckerman, 
Kulhman, Joireman, Teta, & Kraft, 1993) and nature of the 
assumed hierarchical arrangement of traits. 

These problems remain unresolved despite numerous 
attempts to explicate personality structure, partly because the 
methods used incorporate subjective elements regarding 
choice of analytic strategies and data interpretation, and 
partly because personality concepts are inherently fuzzy, a 
factor that contributes to interpretive problems. In this chap- 
ter, we examine the contribution that behavioral genetic 
approaches can make to explicating the structure of personal- 
ity and resolving issues of the number and content of 


60 Genetic Basis of Personality Structure 


domains. The argument we advance is that an approach that 
contributes to understanding of the causes of trait covariation 
(as opposed to approaches that simply offer descriptions of 
trait covariation) offer an important perspective on these 
intractable taxonomic problems. 


DOMAIN DEFINITION: UNRESOLVED PROBLEMS 
WITH PHENOTYPIC STRUCTURE 


Number of Domains 


Despite the dominance of the five-factor approach, disagree- 
ment still exists on the number of dimensions required to 
represent the higher-order structure of personality. Almagor, 
Tellegen, and Waller (1995), for example, suggested that 
five factors do not capture all dimensions of the natural lan- 
guage of personality because lexical analyses excluded terms 
that were evaluative or described temporary states such as 
mood. When they used an unrestricted set of terms, seven 
factors were identified. Five factors—Positive Emotionality, 
Negative Emotionality, Dependability, Agreeableness, Nega- 
tive Emotionality, and Conscientiousness—corresponded to 
the five-factor dimensions of Extraversion, Neuroticism, 
Conscientiousness, Agreeableness, and Openness (nega- 
tively), respectively. The remaining factors were evaluative 
dimensions, Positive Valence and Negative Valence, which 
are not represented in the five-factor model. They concluded 
that the seven-factor model provides a better representation 
of lexical descriptions of personality. McCrae and John 
(1992) and Widiger (1993) refuted this conclusion, claiming 
that positive and negative valence factors could be assumed 
under the five factors. 

Whereas Almagor, Tellegen, and Waller (1995) main- 
tained that the five-factor model is too parsimonious, Eysenck 
(1991) suggested that it is not parsimonious enough. He ar- 
gued that the five domains differ in abstractness and that the 
five dimensions could be accommodated within his three- 
factor model of Psychoticism, Extraversion, and Neuroticism 
because the Openness and Agreeableness domains are merely 
facets of Psychoticism. Studies examining the relationship 
between NEO-PI-R (Neurosis Extraversion Openness- 
Personality Inventory-Revised) and EPQ-R (Eysenck Person- 
ality Questionnaire-Revised), however, suggest that although 
the two scales overlap they assess unique aspects of personal- 
ity (Avia et al., 1995; Draycott & Kline, 1995). These prob- 
lems occur because the five factors, although assumed to be 
orthogonal, in fact intercorrelate. For example, correlations 
between NEO-PI-R Neuroticism and Conscientiousness 


domains and Extraversion and Openness to Experience do- 
mains are —.53 and .40 respectively (Costa & McCrae, 1992). 
These values raise the important issue of what degree of over- 
lap or covariation between domains is tolerable. Whether 
these values are interpreted as unimportant or substantial de- 
pends largely on the investigator’s theoretical perspective. 


Domain Definition 


A related issue is lack of agreement on the lower-order traits 
that define each domain. Identification of an optimal set of 
lower-order traits has proved difficult (Costa & McCrae, 
1998). Questions about whether a facet belongs to a proposed 
domain are raised when it consistently correlates with facets 
comprising another domain. For example, although Costa 
and McCrae (1992) report a moderate correlation of —.25 
between total domain scores for Neuroticism and Agreeable- 
ness, the correlations between the Neuroticism facet Angry 
Hostility and Agreeableness facets Trust, Altruism, and Com- 
pliance are —.42, —.34, and —.49, respectively, and the cor- 
relation between Angry Hostility and the total Agreeableness 
domain score is —.47. Similarly, the correlation between the 
total Neuroticism domain score and the Agreeableness facet 
Trust is —.37. How this overlap is interpreted often forms the 
basis of many authors’ claims as to why their model provides 
the “correct” description of personality. As with the intercor- 
relations among domains, the interpretations placed on the 
findings are largely arbitrary. 

This problem is also revealed by factor analyses of facet 
scales. Although factor loadings may conform to simple 
structure and the hypothesized five-factor pattern, some facets 
may have an appreciably lower loading than do the other 
facets defining a domain. This occurs with the NEO-PI-R 
Neuroticism facet of Impulsiveness. The correlations be- 
tween Impulsiveness and the other Neuroticism facets range 
from .31 to .40 (Costa & McCrae, 1992). The median inter- 
correlation is .35, whereas the median intercorrelation among 
the other facets is .57. Findings such as these raise questions 
about the definition of domains and the possibility that addi- 
tional domains are required to provide a comprehensive 
taxonomy. 

The issue of establishing a coherent set of traits for each 
domain is related to the interpretation of each domain. Even 
within the five-factor approach there are differences in the in- 
terpretation of some domains, especially the domain that 
Costa and McCrae label Openness to Experience. They em- 
phasize such defining characteristics as artistic, curious, orig- 
inal, and having wide interests (McCrae & Costa, 1985a, 
1985b). In the NEO-PI-R, the factor is defined by ideas 


(curious), fantasy (imagination), aesthetics (artistic), actions 
(wide interests), feelings (excitable), and values (unconven- 
tional). Others consider the domain to represent culture or 
intellect (Digman, 1990; Saucier & Goldberg, 1996). John 
and Srivastava (1999) maintained that the culture label 
(Passini & Norman, 1966) is not supported by evidence that 
traits referring to culture such as civilized, polished, digni- 
fied, foresighted, and logical load more highly on the consci- 
entiousness factor. This leaves the alternative interpretation 
of intellect (Digman & Inouye, 1986; Goldberg, 1990). How- 
ever, John and Srivastava (1999) concluded that the evidence 
supports the Costa and McCrae interpretation and that intel- 
lect is merely a component of a broader openness factor. This 
interpretation is supported by studies of the relationship 
between the domain and measures of cognitive ability. For 
example, the openness-intellect factor (Understanding, Sen- 
tience, Change, and Autonomy) based on the Personality 
Research Form (Jackson, 1984) correlates highly with mea- 
sures of crystallized intelligence (e.g., verbal subscales) but 
less with measures of fluid ability (arithmetic and perfor- 
mance subscales; Ashton, Lee, Vernon, & Jang, 1999). 

There are other, less easily resolved confusions about the 
definition and facet structure of other domains. For example, 
Conscientiousness according to Costa and McCrae (1995) 
consists of a single factor defined by competence, order, du- 
tifulness, achievement striving, self-discipline, and deliber- 
ateness. Paunonen and Jackson (1996), however, question the 
unity of conscientiousness: “... The domain is best thought 
of as three separate, but somewhat overlapping, dimensions 
related to being (a) methodical and orderly, (b) dependable 
and reliable, and (c) ambitious and driven to succeed. More- 
over, the amount of overlap among these three facets may not 
be high enough to justify their inclusion in an overall Consci- 
entiousness measure” (p. 55). 

The cluster of traits labeled impulsive-sensation seeking 
poses an even greater problem. Earlier, we discussed prob- 
lems with the placement of impulsiveness within the five- 
factor model. The controversy, however, is deeper. For Zuck- 
erman (1991, 1994), impulsivity and sensation seeking define 
a separate higher-order factor within an alternative five-factor 
structure. The factor resembles Eysenck’s psychoticism and 
Tellegen’s (1985) constraint. There appears, therefore, to be 
strong support for this domain. The five-factor model of 
Costa and McCrae, however, divides this factor into impul- 
sivity and sensation seeking and assigns them to different 
domains. Impulsivity is considered part of neuroticism, an 
interpretation that is not shared by other conceptions of neu- 
roticism, whereas sensation seeking is assigned to extraver- 
sion. This leads to similar problems with extraversion. 
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Extraversion is defined by subsets of traits that differ across 
models. These subsets include such traits as sociability or af- 
filiation, agency, activation, impulsive-sensation seeking, 
positive emotions, and optimism (Depue & Collins, 1999; 
Watson & Clark, 1997). Depue and Collins (1999) pointed 
out that most accounts of extraversion postulate two central 
features, an interpersonal engagement component consisting 
of affiliation or sociability and agency, and an impulsivity 
component that includes sensation seeking. They suggested 
that impulsive-sensation seeking arises from the interaction 
of extraversion and a second independent trait represented by 
Tellegen’s (1985) constraint. This proposal differs, however, 
from Eysenck’s model that places impulsivity in the psy- 
choticism domain and Costa and McCrae’s proposal that it 
belongs to neuroticism. It also differs from Gray’s (1973, 
1987; Pickering & Gray, 1999) model that considers impul- 
sivity as assessed by questionnaire to be a blend of Eysenck’s 
higher-order dimensions of extraversion and psychoticism. It 
appears, therefore, that there are major unresolved defini- 
tional problems with most domains that compromise claims 
that the five-factor model provides a basic assessment frame- 
work (McCrae & Costa, 1986). 

The existence of such basic uncertainty about the taxon- 
omy of personality traits would seem to suggest that state- 
ments that the structure of personality is becoming delineated 
might be a little premature. Uncertainty about the relation- 
ships among traits is a major obstacle to constructing a theory 
of individual differences and clarification of these issues is 
essential for the field to advance. The ordering of traits within 
each domain forms the basis for developing theoretical ex- 
planations by defining relationships that require explanation. 
In effect, a descriptive taxonomy shapes subsequent research 
and theory development. 


Approaches to Domain Definition 


In response to these challenges, especially Paunonen and 
Jackson’s (1996) critique of conscientiousness, Costa and 
McCrae (1998) outlined six methodological approaches that 
can be used to demonstrate the unity of any domain: (a) item 
content analysis, (b) definitions of psychological opposites, 
(c) examination of empirical correlates, (d) interpreting sec- 
ondary and tertiary factor loadings, (e) identification of 
equivalents in specialized languages and (f) case studies. 
Costa and McCrae (1998) applied these approaches to show 
that the Conscientiousness domain was unitary in nature. The 
limitation of these proposals is their reliance on an array of 
criteria that incorporate a subjective element. The proposal 
relies on a convergence of evidence across sets of traditional 
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phenotypic and psychometric analyses. However, numerous 
psychometric studies have not resolved these problems, rais- 
ing the possibility that studies of phenotypes alone may not 
be sufficient. 

The problem with phenotypic analyses is their reliance on 
constructs that are by their nature fuzzy and imprecise. This 
is illustrated by the confusion noted about the components of 
extraversion (Depue & Collins, 1999; Watson & Clark, 
1997). Conceptions of extraversion include sociability or af- 
filiation (includes agreeableness, affiliation, social recogni- 
tion, gregariousness, warmth, and social closeness), agency 
(surgency, assertion, endurance, persistence, achievement, 
social dominance, ascendancy, ambitiousness), activation 
(liveliness, talkativeness, energy level, activity level, activity 
level), impulsive-sensation seeking (impulsivity, sensation 
seeking, excitement seeking, novelty seeking, boldness, risk 
taking, unreliability, disorderliness, adventurousness, thrill 
and adventure seeking, monotony avoidance, boredom sus- 
ceptibility), positive emotions (positive affect, elatedness, 
enthusiasm, exuberance, cheerfulness, merriness, joviality), 
and optimism (Depue & Collins, 1999). 

This list reveals the problems faced by attempts to delin- 
eate phenotypic structure. Not only does the content of extra- 
version differ across models, but the definition of each basic 
or lower-order trait may also differ across models and mea- 
sures. Moreover, the meaning of putatively distinct traits 
overlaps so that facet traits defining a given domain shade 
into each other and into facet traits defining other domains. 
This fuzziness is probably an inevitable consequence of 
using natural language concepts that evolved to capture so- 
cially significant behaviors that are multidetermined. It adds 
to concerns that the taxonomies of phenotypic traits may not 
represent natural cleavages in the way behavior is organized 
nor reflect underlying etiological structures. 

This fuzziness contributes to the considerable variability 
in personality phenotypes so that minor variations in mea- 
sures and samples influence the number and contents of fac- 
tors. The problem is compounded by the fact that many 
decisions about methodology and analytic strategies have an 
arbitrary component. More objective criteria are needed to 
guide decisions on the number of higher-order domains and 
the location of lower-order or basic traits within domains and 
to define a systematic set of basic traits. Phenotypic analyses 
are concerned primarily with describing trait covariation. 
This evokes the oft-voiced criticism of the five-factor 
approach—it is descriptive rather than explanatory. The 
basic problem of why traits are related to each other is not 
considered. An understanding of etiology of trait covariance, 
especially genetic etiology, would provide a conceptual foun- 
dation for current models. At each level of the trait hierarchy, 


traits and behaviors, including test items, could be grouped 
according to a shared etiology. Etiology would provide an 
additional criterion to supplement the usual psychometric cri- 
teria such as proposed by Costa and McCrae (1997) to guide 
decisions on the number and content of domains. Identifica- 
tion of a robust model of personality structure would be facil- 
itated by evidence that a given phenotypic structure reflects 
the genetic architecture of personality traits. Unfortunately 
there are few studies of the genetic architecture underlying 
multiple personality traits compared to studies of phenotypic 
structure. Evidence that a given phenotypic structure paral- 
lels genotypic structure would support the validity and gener- 
alizability of the structure. 


HERITABILITY 


The foundation for an etiological understanding of personal- 
ity structure and for a behavioral genetic approach is pro- 
vided by evidence that genetic influences account for 
approximately 40-60% of the variance for virtually all per- 
sonality traits, with most of the remaining variance being ex- 
plained by nonshared environmental effects (Bouchard, 
1999; Loehlin & Nicholls, 1976; Plomin, Chipeur, & 
Loehlin, 1990). The broad traits of extraversion and neuroti- 
cism have received most attention. The data from several 
twin studies yield heritability estimates of approximately 
60% for extraversion and 50% for neuroticism. Loehlin 
(1992) also examined multiple personality scales organized 
according to the five-factor framework. Estimates of about 
40% heritability were obtained for each domain. Subsequent 
studies using the NEO-PI-R yielded heritability estimates of 
41% for neuroticism, 53% for extraversion, 41% for agree- 
ableness, and 40% for conscientiousness (Jang, Livesley, 
Vernon, & Jackson, 1996; see also Bergeman et al., 1993; 
Jang, McCrae, Angleitner, Riemann, & Livesley, 1998). Non- 
additive genetic effects accounted for 61% the variance in 
openness to experience. 

Although the evidence points to a significant genetic com- 
ponent to personality traits, it has been suggested that traits 
could be divided into temperament traits that have a substan- 
tial heritable component and character traits that are largely 
environmental in origin. If this is the case and environmental 
factors give rise to distinct traits, the role of genetic criteria in 
clarifying trait structure would be limited. The evidence does 
not, however, support the proposal. Putatively charactero- 
logical traits such as openness to experience are as herita- 
ble as so-called temperament traits. Moreover, molecular 
genetic studies have found significant allelic associations be- 
tween so-called character traits such as cooperativeness and 


self-directedness as assessed using the Temperament and 
Character Inventory and the 5-HTTLPR allele (Hamer, 
Greenberg, Sabol, & Murphy, 1999). 

To date, a self-report measure of personality that has no 
genetic influence has not been identified (Plomin & Caspi, 
1998). The qualification should be added that heritability stud- 
ies have relied largely on self-report measures—alternative 
methods of assessment may yield different results. However, 
this was not the case with the few studies using other methods 
(Heath, Neale, Kessler, Eaves, & Kendler, 1992; Riemann, 
Angleitner, & Strelau, 1997). Riemann and colleagues (1997), 
for example, reported a twin study conducted in Germany and 
Poland that compared assessments of the five factors using 
self-report questionnaires with peer ratings. Estimates of her- 
itability based on self-report were similar to those reported by 
other studies. The peer ratings also showed evidence of heri- 
tability, although estimates were lower than those obtained 
from self-reports. Multivariate genetic analyses showed that 
the same genetic factors contributed to self-report and peer 
ratings. These results suggest that findings of a heritable com- 
ponent to all self-report measures are likely to generalize to 
other methods of measurement. 

Evidence of heritability alone, however, is not sufficient to 
justify the use of behavioral genetic criteria to clarify trait 
structure. It is possible that environmental factors that ac- 
count for about 50% of the variance have a substantial effect 
on trait covariation. If this were the case, the finding that 
traits are genetically related would be of less value in clarify- 
ing personality structure. The evidence, however, suggests 
that the phenotypic structure of traits closely parallels the un- 
derlying genetic architecture (Livesley, Jang, & Vernon, 
1998; Loehlin, 1987)—a point that is discussed in detail later 
in this chapter. 

It should be noted, however, that information about heri- 
tability merely explains the variance in a single trait as op- 
posed to the covariance between traits. Such information has 
limited value in explicating personality structure. As Turk- 
heimer (1998) argued, all individual differences in behavior 
are heritable and “...the very ubiquity of these findings 
make them a poor basis for reformulating scientists’ concep- 
tions of human behavior” (p. 782). Nevertheless, information 
on heritability forms the foundation for understanding of the 
etiology of personality. The major contribution of behavior 
genetics to understanding personality structure, however, 
comes from multivariate genetic analyses that elucidate the 
genetic structure underlying multiple traits (Carey & DiLalla, 
1994). Multivariate analyses extend univariate analysis of the 
genetic and environmental influences on a trait to evaluate 
genetic and environmental components of the covariation be- 
tween two or more traits (DeFries & Fulker, 1986). It is this 
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extension that promises to contribute to personality theory by 
explicating the etiological basis for trait covariance by evalu- 
ating the degree to which different traits are influenced by the 
same genetic and environmental factors. This issue is central 
to resolving some of the problems of personality description 
and structure. 


THE ETIOLOGICAL BASIS OF COVARIANCE 


The phenotypic covariation between two traits may be due to 
pleiotropy—that is, the degree to which the traits share a 
common genetic influence, environmental effects common to 
both traits, or both. The degree to which two variables have 
genetic and environmental effects in common is indexed by 
genetic (rg) and environmental correlation coefficients (rg). 
These statistics are interpreted as any other correlation coef- 
ficient and they may be subjected to other statistical proce- 
dures such as factor analysis (Crawford & DeFries, 1978). 
Genetic and environmental correlation coefficients are read- 
ily estimated from data obtained from monozygotic (MZ) 
and dizygotic (DZ) twin pairs. 

The calculation of the genetic correlation is similar to that 
used to estimate the heritability of a single variable. A higher 
within-pair correlation for MZ twins than for DZ twins sug- 
gests the presence of genetic influences because the greater 
similarity is directly attributable to the twofold increase in 
genetic similarity in MZ versus DZ twins. In the multivariate 
case, a common genetic influence is suggested when the MZ 
cross-correlation (the correlation between one twin’s score 
on one of the variables and the other twin’s score on the other 
variable) exceeds the DZ cross-correlation. 

The phenotypic correlation (r,) between two variables 
(traits), x and y, is expressed by the following equation: 


Tp = (hy - hy - rg) + (er ey re) (3.1) 


where the observed or phenotypic correlation, (r,), is the 
sum of the extent to which the same genetic (r,) and/or envi- 
ronmental factors (r.) influence each variable, weighted by 
the overall influence of genetic and environmental causes on 
each variable (hx, hy, ex, ey, respectively). The terms h and 
e are the square roots of heritability and environmental effect 
(h? and e°) for variables x and y, respectively. 

It should be noted that a genetic correlation describes 
statistical pleiotropism—that is, the extent to which allelic 
effects on trait predict allelic effects on the other trait. As 
Carey (1987) pointed out, statistical pleiotropism is not to be 
confused with biological pleiotropism in which two variables 
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share the same loci. Unlike statistical pleiotropism, biological 
pleiotropism unequivocally links actual genes to behavior. 


PHENOTYPIC STRUCTURE AND GENETIC 
ARCHITECTURE OF PERSONALITY 


A critical issue for understanding the etiological structure of 
personality and for the use of multivariate genetic analyses to 
clarify personality structure is the degree to which the pheno- 
typic organization of traits reflects an underlying biological 
structure as opposed to the influence of environmental fac- 
tors. The evidence indicates that the phenotypic structure of 
traits closely resembles the underlying genetic architecture 
and to a lesser degree environmental structure. The evidence 
also suggests that environmental factors do not appreciably 
influence trait covariation. These conclusions are based on 
comparisons of the factors extracted from matrices of pheno- 
typic, genetic, and environmental correlations computed 
among traits comprising a given model or measure. 

In one of the earliest studies of this kind, Loehlin (1987) 
analyzed the structure of item clusters from the California 
Psychological Inventory (CPI; Gough, 1989) in samples of 
MZ and DZ twins. Three matrices were derived that repre- 
sented the covariance among different traits due to genetic, 
shared environmental, and nonshared environmental factors. 
When these matrices were examined with factor analysis, 
four factors emerged from analyses of genetic covariance that 
could be interpreted as representing Neuroticism, Extraver- 
sion, Openness, and Conscientiousness (few items related to 
the fifth factor, Agreeableness, are included in the CPI; see 
McCrae, Costa, & Piedmont, 1993). Analysis of shared envi- 
ronmental effects yielded two factors: family problems and 
masculinity-femininity. The former is not an aspect of per- 
sonality per se, and the latter is probably an artifact of the ex- 
clusive use of same-sex twins (Loehlin, 1987). It should be 
noted, however, that shared environmental effects make rela- 
tively little contribution to the variance of personality traits. 
Hence, the important finding is the structure of nonshared en- 
vironmental effects. Analysis of the nonshared environmen- 
tal covariance matrix yielded three interpretable factors that 
resembled Neuroticism, Extraversion, and Conscientious- 
ness. Thus, the structure of nonshared environmental influ- 
ences largely mirrored genetic influences. This is not an 
isolated finding: Livesley et al. (1998) found similar struc- 
tures in genetic and nonshared environmental components of 
traits related to personality disorder. 

Livesley and colleagues (1998) examined the congru- 
ence of genetic and phenotypic factor structures and com- 
pared phenotypic structure across samples of personality 


disordered patients and two samples recruited from the gen- 
eral population. The clinical sample consisted of 602 patients 
with personality disorder. The general population samples 
consisted of 939 volunteer general population participants 
and 686 twin pairs. The twin sample allowed the computation 
of matrices of genetic and environmental correlations that 
could be compared against the phenotypic structures from all 
three samples. Personality was assessed with the Dimen- 
sional Assessment of Personality Pathology (DAPP; Livesley 
& Jackson, in press). This measure assesses 18 traits underly- 
ing personality disorder diagnoses that were identified in pre- 
vious studies using a combination of clinical judgments, 
rational methods, and psychometric procedures (Livesley, 
1986; Livesley, Jackson, & Schroeder, 1992). 

Phenotypic correlations were computed in all three sam- 
ples separately, and genetic and environmental correlations 
were computed on the twin sample. The phenotypic, genetic, 
and environmental correlation matrices were subjected to 
separate principal components analyses with rotation to 
oblimin criteria. Phenotypic structure was similar across all 
samples. Four factors were extracted from all five matrices 
(see Tables 3.1 and 3.2). 

The first factor, Emotional Dysregulation, represents un- 
stable and reactive affects and interpersonal problems. The 
factor resembled neuroticism as measured by the NEO-PI-R 
(Costa & McCrae, 1992; Schroeder, Wormworth, & Livesley, 
1992) or the Eysenck Personality Questionnaire (EPQ; Jang & 
Livesley, 1999) and the DSM-IV diagnosis of borderline per- 
sonality disorder. The second factor, Dissocial Behavior, was 
negatively correlated with NEO-PI-R Agreeableness. It de- 
scribed antisocial traits and resembled the DSM-IV Cluster B 


TABLE 3.1 Rotated Principal Component Factor Loadings: 
DAPP-BQ Dimensions (clinical sample) 


Factor 
Dimension 1 2 3 4 
Submissiveness 0.85 
Cognitive Dysregulation 0.64 
Identity Problems 0.81 
Affective Instability 0.64 
Stimulus Seeking 0.76 
Compulsivity 0.93 
Restricted Expression 0.75 
Callousness 0.81 
Oppositionality 0.64 —0.47 
Intimacy Problems 0.85 
Rejection 0.78 
Anxiousness 0.86 
Conduct Problems 0.74 
Suspiciousness 0.50 
Social Avoidance 0.76 
Narcissism 0.41 
Insecure Attachment 0.70 —0.44 
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TABLE 3.2 Rotated Principal Component Factor Loadings of Additive Genetic and Nonshared 


Environmental Correlations 


Genetic Factors 


Environmental Factors 


Dimension 1 2 3 4 1 2 3 4 
Submissiveness 0.91 0.76 

Cognitive Dysregulation 0.66 0.70 

Identity Problems 0.84 0.68 

Affective Lability 0.69 0.70 

Restricted Expression 0.45 0.78 
Oppositionality 0.74 0.54 

Anxiousness 0.96 0.86 

Suspiciousness 0.61 0.45 

Social Avoidance 0.76 0.69 

Narcissism 0.60 0.47 0.45 

Insecure Attachment 0.64 0.69 

Stimulus Seeking 0.61 0.81 

Callousness 0.88 0.66 

Rejection 0.82 0.65 

Conduct Problems 0.75 0.69 

Restricted Expression 0.67 

Intimacy Problems 0.93 0.75 
Compulsivity 0.93 0.85 
Suspiciousness 0.45 


antisocial personality diagnosis, Eysenck’s Psychoticism, and 
Zuckerman’s Impulsive-Sensation Seeking. The third factor, 
labeled Inhibition, was defined by intimacy problems and re- 
stricted expression of inner experiences and feelings. The 
factor correlated negatively with NEO-PI-R and EPQ Extra- 
version and resembled the DSM-IV avoidant and schizoid 
personality disorders. The fourth factor, Compulsivity clearly 
resembled NEO-PI-R Conscientiousness and DSM-IV 
obsessive-compulsive personality disorder. The loadings de- 
rived from the phenotypic correlation matrices were remark- 
ably similar: Congruence coefficients ranged from .94 to .99. 
The congruency coefficients between the genetic and pheno- 
typic factors on Emotional Dysregulation, Dissocial, Inhibi- 
tion, and Compulsivity were .97, .97, .98, and .95, respectively. 
The congruence between factors extracted from the pheno- 
typic and nonshared environmental matrices were also high at 
.99, .96, .99, and .96, respectively. These data suggest that the 
phenotypic structure of personality and personality disorder 
traits closely reflects the underlying etiological architecture. 

This conclusion is also supported by a study of the pheno- 
typic structure and genetic architecture of the five-factor 
model assessed using the NEO-PI-R in two independent 
samples of twins recruited in Germany and Canada (Jang, 
Livesley, Angleitner, Riemann, & Vernon, in press). Factor 
analysis of the genetic and nonshared environmental covari- 
ance matrices yielded five factors that strongly resembled N, 
E, O, A, and C (Neurotic, Extraversion Openness, Agreeable- 
ness, and Conscientiousness). Congruence coefficients 
computed between the genetic factors and the published 


normative structure were .83, .72, .92, .88, and .70 for N, E, 
O, A, and C, respectively. The congruence of the nonshared 
environmental factors and normative structure was even 
higher at .96, .93, .90, .93, and .97 for N, E, O, A, and C, 
respectively. 

The interesting feature of these results is not only that phe- 
notypic structure resembles genetic structure, but also that the 
structure of environmental effects is similar to the genetic 
structure. Plomin, DeFries, and McClearn (1990) noted that 
across a range of studies, “the structure of genetic influences 
seems to be similar to the structure of [nonshared] environ- 
mental influences” (p. 236). They added that this is surprising: 
“Most of us would probably predict different patterns of ge- 
netic and environmental influences” (p. 236). Recently, how- 
ever, it has been suggested that genetic factors are more 
important than are environmental influences in shaping trait 
structure because the resemblance of the structure of non- 
shared environmental effects to the observed structure of 
traits may be artifactual (McCrae, Jang, Livesley, Riemann, & 
Angleitner, in press). 

Nonshared environmental effects are usually estimated 
as a residual term that may include systematic bias such as 
that introduced by implicit personality theory. Passini and 
Norman (1966) demonstrated this bias by asking students to 
rate the personalities of complete strangers. Although each 
rating was presumably a guess, a clear pattern to the ratings 
was found. Students who assumed that strangers were talka- 
tive also assumed that they were sociable and cheerful. Across 
a range of targets, these associations defined an Extraversion 
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factor. Factor analysis of the ratings yielded the familiar five 
factors. Some researchers concluded from such studies that 
trait structure merely reflects the effects of semantic biases on 
person perceptions (Shweder, 1975). Ratings of strangers 
must contain bias due to implicit personality theory because 
they cannot be influenced by the true personalities of the tar- 
gets. It is also likely that self-reports and ratings of well- 
known targets incorporate a similar bias. For example, two 
observers may agree that a person is sociable but disagree on 
the extent of his or her sociability. The observer assigned a 
higher rating for sociability is also likely to assign a higher 
rating for cheerfulness and talkativeness. Thus, part of the co- 
variance of these traits may be attributable to systematic bi- 
ases in person perception that lead to correlated errors in 
individual judgments. If this is the case, similarities in struc- 
ture between genetic covariance and nonshared environmen- 
tal covariance could reflect the biasing effects of implicit 
personality theory on the latter. 

To test for this bias, self-report twin data were supple- 
mented with cross-observer correlations on the NEO-PI-R. 
This allowed the computation of two matrices of nonshared 
environmental covariance. The first estimated the covariance 
due to implicit personality theory bias alone. Factorial analy- 
sis of this matrix yielded the familiar five factors. Comparison 
with normative structure yielded congruence coefficients of 
.81, .45, .81, .89, and .85 for Neuroticism, Extraversion, 
Openness, Agreeableness, and Conscientiousness, respec- 
tively. The second matrix of nonshared environmental covari- 
ance estimated was free from systematic bias. Factor analysis 
of this “unbiased” matrix with targeted rotations to the nor- 
mative NEO-PI-R factors produced low congruence coeffi- 
cients at .53, .68, .22, .61, and .80 for Neuroticism, 
Extraversion, Openness, Agreeableness, and Conscientious- 
ness, respectively. Subsequent factor analysis of this matrix 
yielded two factors. The first resembled a broad form of Con- 
scientiousness with salient loading of the facets Activity, 
Order, Dutifulness, Achievement Striving, Self-Discipline, 
and (low) Impulsiveness. The second factor was defined by 
the facets Warmth, Gregariousness, Positive Emotions, Open- 
ness to Feelings, Altruism, and Tender-Mindedness. This 
combination of Extraversion and Agreeableness facets resem- 
bles the Love axis of the Interpersonal Circumplex (Wiggins, 
1979). The other interpersonal axis—Dominance—does not 
appear to be influenced by the nonshared environment. As- 
sertiveness did not load on either factor. 

These results suggest that when the conventional estimates 
of nonshared environmental covariances are decomposed 
into implicit personality theory bias and true nonshared ef- 
fects, much of the resemblance to the five-factor structure 
appears attributable to bias. Overall, these studies point to the 


conclusion that genetic factors are largely responsible for the 
observed pattern of trait covariation. 


THE HIERARCHICAL STRUCTURE 
OF PERSONALITY 


Beyond problems with the content of personality taxono- 
mies, there are also uncertainties about the nature of the pro- 
posed hierarchical structure of traits and the relationship 
between higher- and lower-order traits. Factor analytic stud- 
ies provide consistent evidence that specific traits are orga- 
nized into more global entities. Lexical studies also show that 
natural language reflects this structure. Substantial agreement 
exists among individuals in judgments of trait breadth 
(Hampson et al., 1986). Despite this evidence, the nature and 
origins of the hierarchy are unclear. This is clearly a problem 
that requires explanation. 

Fundamental differences exist among models on the way 
the personality hierarchy is conceptualized. The lexical ap- 
proach seems to consider the higher-order domains to be 
lexical categories that impose structure on personality de- 
scriptors by organizing them into clusters that are not neces- 
sarily discrete or equally important (Saucier & Goldberg, 
1996). The lexical structure “provides a framework for de- 
scription, but not necessarily for explanation” (Saucier & 
Goldberg, 1996, p. 24-25). Saucier and Goldberg also as- 
serted that “as a representation of phenotypes based on nat- 
ural language, the Big Five structure is indifferent and thus 
complementary to genotypic representations of causes, moti- 
vations, and internal personality dynamics” (p. 42). The 
higher-order terms do not appear, therefore, to have any sig- 
nificance beyond that of description. 

Traits psychologists, including other five-factor theorists, 
make different assumptions. For Allport (1961), a trait is “a 
neuropsychic structure” (p. 347) and therefore an explana- 
tory concept. Eysenck also adopted this approach: Traits have 
heritable biological basis. Similarly, the five-factor model 
assumes that traits are “endogenous basic tendencies” with 
a substantial heritable component (McCrae & Costa, 1996, 
p. 72). For Eysenck and Costa and for McCrae, traits are 
explanatory as well as descriptive. In contrast to the lexical 
approach, the five-factor model assumes that domains are 
equally important and equal in breadth. 

Assumptions that trait theories make about the psycho- 
biological basis for the higher-order domains initially cre- 
ated uncertainty about the status of the lower-order traits. 
Most research effort has been directed toward understanding 
higher-order factors and little attention has been paid to 
parsing these domains into more specific components. Until 


recently, it was unclear whether the lower-order traits were 
merely facets of the higher-order traits or distinct entities 
with their own etiology. The use of the term facet to de- 
scribe the lower-order traits, a convention adopted by Costa 
and McCrae, implies that they are merely exemplars or 
components of a more fundamental global trait. In this 
sense, the facet traits can be understood in terms of the do- 
main sampling approach used in test construction in which 
facets are merely arbitrary ways to subdivide global traits 
to ensure adequate domain sampling. Identification of gen- 
eral genetic factors that have a broad influence on personal- 
ity phenotypes also raises questions about the significance 
of the lower-order or facet traits—in particular, whether 
these traits are heritable simply because of their association 
with the broader domains or whether they are also subject to 
specific genetic influences. Clarification of this issue is crit- 
ical to constructing an explanatory account of personality 
structure. 


Heritability of Lower-Order Traits 


If lower-order traits are only subcomponents of broader traits, 
all variance in a facet apart from error variance should be ex- 
plained by the variance in the global trait. Recently, however, 
behavioral genetic research has suggested that lower-order 
traits have a distinct heritable component (Jang et al., 1998; 
Livesley et al., 1998). These studies estimated whether lower- 
order traits have a unique genetic basis when the heritable 
component of higher-order traits is removed from them. Jang 
and colleagues (1998) partialled out all of the common vari- 
ance due to each of higher-order Neuroticism, Extraversion, 
Openness, Agreeableness, and Conscientiousness scales from 
the 30 facet scales of the NEO-PI-R. When the residual vari- 
ances on the facets were subjected to heritability analyses, a 
substantial genetic influence remained. Additive genetic ef- 
fects accounted for 25 to 65% of the reliable specific variance, 
with most heritabilities ranging from .20 to .35 (see Table 3.3). 

When these values were corrected for unreliability, the 
values increased to the usual range observed for personality 
traits. The implication is that these traits are not merely facets 
of more general traits, but rather distinct heritable entities. 

A similar approach was used to study the residual heri- 
tability of the 18 traits underlying personality disorder 
(Livesley et al., 1998). Factor scores were computed for the 
four factors described previously. A standardized residual 
score for each scale was computed by regressing the four fac- 
tor scores on each of the 18 basic traits. Monozygotic twin 
correlations were higher that the dizygotic twin correlations 
for all 18 traits. Estimates of the heritability of the residual 
trait scores showed substantial residual heritability for 11 of 
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TABLE 3.3 Heritability Estimates, Retest Reliabilities, and 
Relative Reliabilities of Revised NEO Personality Inventory Residual 
Facet Scores 


Domain and Facet Scale h? e e r, hir, 
Neuroticism 
Anxiety 0.25 — 0.75 0.58 0.43 
Hostility 0.21 — 0.79 0.53 0.40 
Depression 0.25 — 0.75 0.50 0.50 
Self-Consciousness 0.29 — 0.71 0.54 0.54 
Impulsiveness 0.27 — 0.73 0.59 0.46 
Vulnerability 0.26 — 0.74 0.56 0.46 
Extraversion 
Warmth 0.23 — 0.77 0.60 0.38 
Gregariousness 0.28 — 0.72 0.71 0.39 
Assertiveness 0.29 — 0.71 0.72 0.40 
Activity 0.27 — 0.73 0.70 0.39 
Excitement Seeking 0.36 — 0.64 0.69 0.52 
Positive Emotions 0.30 — 0.70 0.63 0.48 
Openness 
Fantasy 0.25 — 0.75 0.60 0.42 
Aesthetics 0.37 — 0.63 0.72 0.51 
Feelings 0.26 — 0.74 0.57 0.46 
Actions 0.34 — 0.66 0.69 0.49 
Ideas 0.33 — 0.67 0.69 0.48 
Values 0.35 — 0.65 0.71 0.49 
Agreeableness 
Trust 0.31 — 0.69 0.62 0.50 
Straightforwardness 0.25 — 0.75 0.56 0.45 
Altuism — 0.20 0.80 0.50 — 
Compliance 0.26 — 0.74 0.54 0.48 
Modesty — 0.26 0.74 0.64 — 
Tendermindedness 0.28 — 0.72 0.64 0.44 
Conscientiousness 
Competence 0.11 — 0.89 0.44 0.25 
Order 0.26 — 0.74 0.69 0.38 
Dutifulness 0.28 — 0.72 0.43 0.65 
Achievement Striving — 0.26 0.74 0.54 — 
Self-Discipline 0.28 — 0.72 0.61 0.46 
Deliberation — 0.18 0.82 0.71 — 


the 18 basic traits that ranged from .26 for Intimacy Prob- 
lems to .48 for Conduct Problems. 

These studies, in contrast to studies of phenotypic struc- 
ture, point to the significance of the lower-order traits. 
Although these traits have tended to be neglected in personal- 
ity research, they appear to be important for understanding 
personality. This suggests that a bottom-up approach to per- 
sonality structure would provide additional information to 
complement that provided by the traditional top-down 
approach of the three- and five-factor models that identify the 
higher-order domains first and then seek to define an appro- 
priate complement of facet traits. Before considering these is- 
sues in greater depth, it is important to recognize a limitation 
of the methods used. The regression method does not model 
genetic effects directly, and the results need to be replicated 
using multivariate genetic analyses. This introduces another 
feature of behavioral genetic analyses that is pertinent to 
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understanding the genetic basis of personality: the use of path- 
ways models to evaluate competing models of personality. 


Independent and Common Pathways Models 


In heritability analyses, components of variance are esti- 
mated by fitting models to the observed covariance matrices. 
In the univariate case, the heritability of a variable is esti- 
mated by comparing the similarity (estimated by Pearson’s r) 
of MZ to DZ twins. In the bivariate case, common genetic 
influences are suggested when the MZ cross-correlation 
exceeds the DZ cross-correlation used to compute the genetic 
correlation, rg. The multivariate extension of this idea is 
found in two general classes of path analytic models that are 
pertinent to personality research: independent and common 
pathways models (see Figures 3.1 and 3.2; McArdle & 
Goldsmith, 1990; Neale & Cardon, 1992). The independent 
pathway model specifies direct links from one or more ge- 
netic and environmental influences common to each variable 
and unique genetic and environmental effects to each vari- 
able. The common pathways model is a more stringent ver- 
sion of the independent pathways model. The primary 
difference between the two models is that the common path- 
ways model postulates that of the covariation in a set of vari- 
ables is mediated by a single latent variable that has its own 
genetic and environmental basis. Both models provide the 
opportunity to examine variance specific to each variable— 
that is, each lower-order trait. Factor analytic studies of 


Independent Pathways Model 


G = additive genetic effects common to all variables 


personality have been concerned with reducing the covari- 
ance between lower-order traits to fewer factors. Residual 
variance specific to each trait is neglected. Biometric path 
models applied to twin data decompose this variance into 
etiological components. This makes it possible to evaluate 
the significance of these specific traits. 

These models offer the opportunity to evaluate the hierar- 
chical structure of personality by comparing the fit of the two 
models to the same data set. The common pathways model 
is the biometric equivalent to the traditional model of ex- 
ploratory factor analysis used to delineate the phenotypic 
structure of traits. As applied to each of the five-factor do- 
mains, the model postulates a single latent factor for each do- 
main that mediates the effects of genetic and environmental 
effects on each lower-order trait. In the case of NEO-PI-R 
Neuroticism, a latent variable of neuroticism is hypothesized 
through which genetic and environmental factors influence 
the six facets of Anxiety, Hostility, Depression, Self- 
Consciousness, Impulsivity, and Vulnerability. In contrast, 
the independent pathways model postulates direct genetic 
and environmental effects on each facet trait. The fits of these 
models provide an opportunity to evaluate different concep- 
tions of personality structure. If the common pathways model 
provides the best fit, the implication is that the hierarchical 
structure of personality arises from the effects of higher-order 
factors that have a genetic and environmental basis. The task 
is then to explain how this entity differs from lower-order 
or facet traits and the role it plays in the formation of the 


E = nonshared environmental effects common to all variables 


g = additive genetic effects unique to each variable 


e = nonshared environmental effects unique to each variables 
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Figure 3.1 Independent pathways model; G = additive genetic effects common to all variables, E = 


nonshared environmental effects common to all variables, g = additive genetic effects unique to each vari- 


able, and e = nonshared environmental effects unique to each variable 


Common Pathways Model 


G = additive genetic effects common to all variables 
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E = nonshared environmental effects common to all variables 


g = additive genetic effects unique to each variable 


e = nonshared environmental effects unique to each variables 
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Figure 3.2 Common pathways model; G = additive genetic effects common to all variables, E = non- 
shared environmental effects common to all variables, g = additive genetic effects unique to each vari- 
able, and e = nonshared environmental effects unique to each variable. 


hierarchy. If the independent pathways model provides the 
best fit, however, the implication is that the higher-order con- 
structs of phenotypic analyses do not reflect the effects of a 
phenotypic entity, but rather the pleiotropic action of the 
genes shared by all lower-order or facet traits that define 
the domain. Under these circumstances, the task is to expli- 
cate the mechanisms that lead to trait clusters. Regardless of 
which model provides the best fit to the data, a useful feature 
of both models is that the magnitude of the path coefficients 
between each facet scale and the common genetic factor or 
latent variable along with information on the magnitude of 
genetic and environmental influences unique to each facet 
provides the basis for determining which facets should be 
grouped together within the taxonomy. 


Five-Factor Model 


Jang and colleagues (in press) fit common and independent 
pathways models to evaluate the coherence of the five do- 
mains assessed with the NEO-PI-R. The models were applied 
separately to a sample of 253 identical and 207 fraternal 
twin pairs from Canada and 526 identical and 269 fraternal 
pairs from Germany. The two samples made it possible to 
examine the universality of the etiological basis for personal- 
ity structure by investigating whether the same genetic and 


environmental factors influenced personality traits in the two 
samples and whether they had similar effects. 

For each sample, a single-factor common pathways 
model and a series of independent pathway models specify- 
ing variable numbers of genetic and nonshared environmen- 
tal factor were fit to the six facets defining each domain. 
Shared environmental effects were omitted from the models 
because their effects were minimal. For each domain, the 
best fit was obtained with an independent pathways model. 
Table 3.4 illustrates the findings for the Neuroticism do- 
main. An independent pathways model specifying two ge- 
netic factors and two nonshared environmental factors 
provided the most satisfactory explanation of the covariance 
between the six Neuroticism facets in the two samples. In 
both samples, the first genetic factor was marked by the 
Angry Hostility facet and, to a lesser extent, Anxiety. The 
second factor influenced all facets except Angry Hostility 
and Impulsivity. The depression facet had the highest load- 
ing in both samples. 

In addition to demonstrating that the independent path- 
ways model provided the best fit, these findings also suggest 
that the broad domains of personality are nonhomogeneous. 
This raises important questions about the factors that account 
for the apparent hierarchical structure of personality traits 
and the nature and conceptual status of the higher-order 
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TABLE 3.4 Multivariate Genetic Analysis (independent pathways 
model) of the NEO-PI-R Neuroticism Facets on a Sample of German 
and a Sample of Canadian Twins 


Common 
Common Nonshared Variable- 
Genetic Environmental specific 
Factors Factors Factors 
Facet Scale 1 2 1 2 A E 
Canadian Sample 
Anxiety 48 27 20 20 20 56 
Hostility .65 — .29 21 — .67 
Depression .45 A3 59 .26 Ke 38 
Self-Consciousness ` A7 ‚39 42 24 37 37, 
Impulsivity .36 — .78 — .50 — 
Vulnerability AT A0 A0 SA .28 1 
German Sample 
Anxiety .46 34 .29 46 36 Eu 
Hostility .66 — .76 — — — 
Depression AT 45 33 .46 Ke EN 
Self-Consciousness ` 28 A4 22 35 A0 .60 
Impulsivity KI — 19 — .57 77 
Vulnerability A3 42 33 .49 .28 .46 


Note. All parameters are significant at p < .05. 


dimensions. These conclusions are, however, based on a sin- 
gle study using only a single measure of personality. Replica- 
tion is clearly needed, given the results’ significance for 
understanding trait structure. The conclusions are, however, 
similar to those drawn from a study of personality disorder 
traits (Livesley & Jang, 2000). 


Personality Disorder Traits 


Livesley and Jang (2000) investigated the etiological struc- 
ture of personality disorder by fitting independent and com- 
mon pathways models to the 18 lower-order traits of 
personality disorder assessed by administering the DAPP to a 
volunteer sample of 686 twin pairs. Each trait consists of two 
or more specific traits so that a total of 69 specific traits define 
the 18 basic traits. The 18 traits in turn define four higher- 
order factors. Thus the DAPP system incorporates three lev- 
els of construct (higher-order factors, lower-order traits, and 
specific traits) whereas the NEO-PI-R has only two levels 
(domains and facets). This makes it possible to explore the 
genetic architecture of personality in more detail. For exam- 
ple, the basic trait of Anxiousness is defined by four specific 
traits: trait anxiety, guilt proneness, rumination, and indeci- 
siveness. Each basic trait represents a single phenotypic fac- 
tor. If personality is inherited as a few genetic dimensions 
represented by the four higher-order factors, a single genetic 
dimension should underlie each basic trait that is shared by 
other traits constituting the higher-order factor. Evidence of a 
genetic effect specific to each trait would be provided by 


evidence that the 18 basic traits are composed of two or more 
genetic dimensions. 

A one-factor common pathways model did not provide a 
satisfactory fit for any of the 18 basic traits. On the other hand, 
an independent pathways model postulating a single genetic 
dimension explained the covariation among specific traits 
for 12 of the 18 basic trait scales: Anxiousness, Cognitive 
Dysregulation, Compulsivity, Conduct Problems, Identity 
Problems, Insecure Attachment, Intimacy Problems, Opposi- 
tionality, Rejection, Stimulus Seeking, Submissiveness, and 
Suspiciousness. The results of model fitting for illustrative 
scales are provided in Table 3.5. For three of these scales, Inti- 
macy Problems, Rejection, and Stimulus Seeking, the com- 
mon genetic dimension accounted for little of the variance for 
one or more of the specific trait scales, indicating that a spe- 
cific genetic factor influenced these traits. Two genetic dimen- 
sions were found to underlie four scales: Affective Lability, 
Narcissism, Restricted Expression, and Social Avoidance. 
Three common genetic dimensions contributed to Callousness 
(see Table 3.5). 

Multivariate analyses of normal and disordered personal- 
ity traits suggest that multiple genetic and environmental fac- 
tors influence the covariant structure of traits. They also 
confirm the findings of the regression analyses that many 
lower-order traits are influenced by one or more genetic di- 
mensions specific to those traits. Finally, in both sets of 
analyses, the common pathways model did not provide a bet- 
ter fit to the data than did the independent pathways model. 
This suggests that the general genetic dimensions found by 
Livesley and colleagues (1998) and others by factor analyz- 
ing matrices of genetic correlations do not influence each trait 
through a latent phenotypic variable, but rather exert a direct 
influence on each trait. 


IMPLICATIONS FOR PERSONALITY STRUCTURE 


The studies described in the previous section reveal a com- 
plex genetic basis for personality. Multiple genetic dimen- 
sions differing in the breadth of their effects contribute to 
personality phenotypes (Jang et al., 1998; Livesley et al., 
1998; Livesley & Jang, 2000). Some are relatively specific 
dimensions that influence single phenotypic traits, whereas 
others have broader effects influencing multiple phenotypi- 
cally distinct but covarying traits. Consequently, many traits 
appear to be influenced by multiple genes and gene systems. 
Similarly, trait covariation seems to arise from multiple 
genetic effects. Genetic effects on traits appear to be direct 
rather than mediated by higher-order entities. These findings 
require replication. Nevertheless they appear to challenge 
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TABLE 3.5 Illustrative Scales: Multivariate Genetic Analyses of the DAPP-DQ Facet Scales 


Al A2 A3 


El E2 A G. E 


x? = 52.45, df = 54, p = 23 


Rejection 
Rigid Cognitive Style A3 — 38 — .56 
Judgmental .13 — — 46 — 34 — 7 
Interpersonal Hostility 46 — — 31 — 34 — Eu 
Dominance 53 — A0 — 61 


x? = 90.63, df = 84, p = .22 


Restricted Expression 


Self-Disclosure 55 30 — 
Affective Expression .31 .58 — 
Angry Affects .24 .38 — 
Positive Affects .40 .51 — 
Self-Reliance .55 .15 — 


12 49 33 — .49 
21 56 — — AS 
.75 — .49 — — 
— .46 33 — 52 
.17 50 23 — 59 


x? = 154.48, df = 166, p =.73 


Callousness 
Contemptuousness 36 3 42 
Egocentrism 28 36 .28 
Exploitation .26 54 A3 
Irresponsibility 40 83 23 
Lack of Empathy 53 .20 .16 
Remorselessness 42 .16 34 
Sadism 36 — .66 


28 15 A4 — 57 
46 21 .47 — .48 
ER 18 — — ER 
KR 23 A0 — .65 
ER .23 .26 — .65 
— 76 232 — — 

27 14 A0 — 65 


models of personality positing links between specific geneti- 
cally based neurotransmitter systems and specific personality 
traits. They also suggest a different conception of the trait 
hierarchies from that assumed by many trait taxonomies. 


Hierarchical Structure 


Factor analyses of genetic correlations and the modeling 
studies cited in the previous section identified general genetic 
factors that account for trait covariation. The model-fitting 
analyses also confirmed conclusions based on regression 
analyses that lower-order traits are not merely components of 
higher-order traits, but rather are distinct etiological entities. 
It appears that each basic or facet trait is influenced by general 
and specific genetic factors. Genetic dimensions that affect 
multiple traits appear to influence each trait directly rather 
than indirectly through a higher-order phenotypic entity. This 
raises questions about the basis for the hierarchy consistently 
identified by factor analytic studies and the conceptual status 
of higher-order constructs like neuroticism and extraversion 
and their role in theories of individual differences. 

Although the facets delineating each of the five-factor do- 
mains covary due to shared genetic effects, it is not necessary 
to invoke a higher-order latent construct to explain this 
covariation. This raises the possibility that higher-order con- 
structs such as neuroticism merely represent the pleiotropic 
action of genes. If this is the case, neuroticism and other 


higher-order domains are not entities that are distinct from 
the specific traits that delineate them. They are not traits in 
Allport’s sense of distinct phenotypic entities with an under- 
lying biology, but rather heuristic devices that represent clus- 
ters of traits that covary because of a common genetic effect. 
This is consistent with the conception of domains as lexical 
categories (Saucier & Goldberg, 1996). Nevertheless, facet 
traits defining domains such as neuroticism and extraversion 
overlap sufficiently to justify grouping them into an overall 
global measure. 

The model of trait structure implied by these findings dif- 
fers from that of traditional trait theories. With traditional 
models in which lower-order traits are nested within a few 
higher-order factors, it follows that any statement about the 
higher-order factor applies to all subordinate traits. This is 
not the case with the model proposed because each basic trait 
has its own specific etiology. A second difference is that tra- 
ditional hierarchical models seem to assume that trait tax- 
onomies are similar to any classification based on set theory 
principles. At each level in the hierarchy, categories are as- 
sumed to be exhaustive and exclusive (Simpson, 1961). 
Exhaustiveness means that trait categories exist to classify all 
subordinate traits, whereas exclusiveness refers to the princi- 
ple that each subordinate feature can be classified into only 
one superordinate trait. Considerable effort has been ex- 
pended in attempts to delineate a structure with these proper- 
ties. Indeed, this is the reason for debate on number and 
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content of domains. It also explains Costa and McCrae’s in- 
sistence that domains are equal in breadth. If they are not, the 
five-factor model is open to the criticism that the model is not 
sufficiently parsimonious, as argued by Eysenck. This theo- 
retical structure is understandable if trait taxonomies are con- 
ceptualized only as lexical structures. It is possible, however, 
that traits at the biological level are not organized in the sys- 
tematic way proposed by the five-factor model. 

There are no a priori reasons to assume that all basic traits 
must be organized into a hierarchy or that each higher-order 
domain is equally broad and defined by an equal number of 
facets as hypothesized by the five-factor model. An equally 
plausible model is that traits are organized into clusters that 
differ in the number of basic traits that they subsume and that 
the hierarchy is incomplete, with some specific traits showing 
minimal degrees of covariation. This structure is illustrated 
by the findings regarding the structure of the higher-order di- 
mension of compulsivity identified in studies of personality 
disorder traits (Livesley et al., 1998). Pathways models iden- 
tified a single genetic dimension underlying the specific traits 
that define this construct. Factor analyses show that it is con- 
sistently not related to other traits—hence, the three pheno- 
typic traits that delineate compulsivity from separate 
higher-order factors. Compulsivity is, however, a trait nar- 
rower than other higher-order domains. It appears to repre- 
sent a distinct basic or lower-order trait based on a single 
genetic dimension that does not have a hierarchical relation- 
ship with other basic traits. 


Basic-Level Traits: Defining the Basic 
Unit of Personality 


The idea that personality is inherited as a few genetic mod- 
ules with broad effects and a large number of modules with 
more specific effects focuses attention on the significance of 
lower-order or basic traits. These findings are similar to eval- 
uations of hierarchical models of cognitive ability that also 
provide evidence that specific abilities are heritable (Casto, 
DeFries, & Fulkner, 1995; Pedersen, Plomin, Nesselroade, & 
McClearn, 1992). Basic traits do not appear to be specific 
exemplars of the higher-order traits that they define or blends 
of two or more factors (Hofstee, DeRaad, & Goldberg, 1992). 
Rather, they are discrete genetic entities with their own bio- 
logical basis. This suggests that personality models that re- 
duce traits to a few global domains do not reflect the genetic 
architecture of normal or disordered personality. As noted 
earlier, personality research has tended to neglect these traits 
in favor of more global dimensions. Yet evidence of speci- 
ficity of genetic effects suggests that the basic traits are the 
fundamental building blocks of personality that are more 


important for understanding personality than are the global 
constructs that have traditionally been the focus of research 
and explanation. This approach again raises the question of 
how basic traits should be conceptualized and defined, as 
well as which criteria are relevant to defining domains. 

Costa and McCrae (1998) noted the challenges of delin- 
eating a comprehensive set of basic traits. The specificity of 
genetic effects also reveals the challenge involved because of 
the large number of genetic dimensions that are likely to be 
involved. A genetic perspective does, however, provide a de- 
finition of a basic dimension that could facilitate the identifi- 
cation and assessment of these traits. The usual psychometric 
criteria used to develop homogeneous scales could be sup- 
plemented with the genetic criterion that a basic trait scale 
represents a single specific genetic dimension. With this ap- 
proach, items assessing a basic trait would form a genetically 
homogeneous unit as opposed to a factorially homogeneous 
unit. Items could then be selected according to their correla- 
tion with the underlying genetic dimension. Thus items form- 
ing a scale would share the same general and specific genetic 
etiology. With this approach, the goal would be to use behav- 
ioral genetic techniques to bring about definitions of the phe- 
notype that correspond to what Farone, Tsuang, and Tsuang 
(1999) refer to as “genetically crisp categories” (p. 114). 

An example of this approach is provided by a study of 
the genetic structure of the Eysenck Personality Question- 
naire (Heath, Eaves, & Martin, 1989). This instrument has 
three broad scales composed of 21 to 25 items that assess 
Neuroticism, Extraversion, and Psychoticism. Heath and col- 
leagues extracted a common genetic and environmental 
factor for Neuroticism and Extraversion, indicating that these 
items are etiologically homogeneous. In contrast, little evi- 
dence was found for a common genetic factor for the Psy- 
choticism items. Subsequent analyses showed that the items 
formed into two distinct genetic factors: paranoid attitudes 
and hostile behavior. The results of such a systematic evalua- 
tion of item etiology could be used to form etiologically 
homogeneous scales. 

This approach could be used either to develop new scales 
or modify existing scales so that they resemble the underly- 
ing genetic architecture more closely. This could be achieved 
by applying differential weights that index the influence of 
specific genetic and environmental influences on different 
traits. In this way, questions about the phenotypic structure of 
personality are addressed, and scales could be constructed so 
that they do not reflect competing genetic and environmental 
influences. 

The estimation of genetic and environmental factor scores 
is a relatively new and active area of research. Sham et al. 
(2001) recently described a method that permits these genetic 


factor scores to be computed. Their method uses the follow- 
ing equation: 


yayıık (3.2) 


where y = factor score for the common genetic factor, d = 
the factor loadings of each variable on the genetic factor of 
interest (i.e., the column vector of estimated path coefficients 
that represent the correlations between the common genetic 
or environmental factor and the observed measures), U~! = 
correlation matrix between all of the variables (e. the in- 
verse of the correlation matrix of the observed measures), 
and x = each person’s score or response to each of the vari- 
ables (i.e., column vector of observed values on the mea- 
sures). Other methods are also available to compute genetic 
and environmental factor scores (Thomis et al., 2000). 


Domain Content 


As discussed earlier, the facet structure of several five-factor 
domains is still unclear. The same behavioral genetic 
approach used to define and measure basic trait scales could 
also be applied to the delineation of domain content. The 
unity of a domain is demonstrated by evidence that a single 
common genetic factor influences all the facets composing 
the domain. This approach could be used to clarify the loca- 
tion of impulsivity within the higher-order structure. The 
five-factor model locates impulsivity in Neuroticism, 
whereas Eysenck places it within Extraversion. As noted 
earlier, the bivariate correlations of this facet with other 
Neuroticism facets assessed with the NEO-PI-R are lower 
than correlations between other facets. Etiological data could 
be used to relocate impulsivity with other traits with which it 
shares a common etiology. Alternatively the item content 
could be changed based on genetic and environmental etiol- 
ogy so that correlations with the other Neuroticism facets are 
increased (of the loadings on the common factors are in- 
creased). In the case of the DAPP scales, impulsivity is part 
of the phenotypic trait of stimulus seeking along with sensa- 
tion seeking and recklessness. Multivariate genetic analyses 
showed that a single common genetic factor underlies this 
dimension that is defined by sensation seeking and reckless- 
ness (see Table 3.5). Impulsivity has a low loading on the 
factor and a substantial specific heritable component. It 
appears that impulsivity as defined within the DAPP structure 
is a specific heritable entity and not the result of interaction 
between extraversion and constraint or psychoticism as sug- 
gested by Depue and Collins (1999) or extraversion and psy- 
choticism as suggested by Gray (1970, 1973, 1987; Pickering 
& Gray, 1999), although it is consistent with Gray’s argument 
that impulsivity is a fundamental dimension of temperament. 
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The findings of behavioral genetic studies of personality 
structure also have implications for attempts to identify the 
putative genes for personality. Most molecular genetic stud- 
ies of personality use an analytic strategy that correlates a 
total personality trait score such as Neuroticism with varia- 
tions in the candidate allele (Lesch et al., 1996). As the stud- 
ies described show, the total scale score confounds multiple 
genetic and environmental effects and reduces the power to 
detect putative loci. The use of etiological factor scores that 
index the proportions of the personality phenotype directly 
attributable to specific genetic and environmental effects 
(Boomsma, 1996; Sham et al., 2001; Thomis et al., 2000) 
could reduce these confounds. 


UNIVERSALITY OF TRAIT STRUCTURE 


Most models of personality traits including Eysenck’s three- 
factor model (Eysenck & Eysenck 1992), the five-factor 
model, and diagnostic categories of personality disorder 
proposed in the DSM-IV (American Psychiatric Association, 
1994) assume that the taxonomies proposed reflect a univer- 
sal structure. This assumption is also assumed to apply to the 
measures developed to assess these constructs. The only dif- 
ferences that these models of personality (and their mea- 
sures) permit between cultures and other groups (e.g., 
gender) are quantitative in nature; they typically mean differ- 
ences in trait levels or severity. If these assumptions are cor- 
rect, we should find that the etiological architecture of 
personality is also invariant across cultures and other basic 
groupings. We discuss this idea with respect to cross-cultural 
comparisons and the effects of gender. 


Cross-Cultural Comparisons 


Multiple studies show that the observed factorial structure of 
scales such as the NEO-PI-R is stable across cultures. For ex- 
ample, McCrae and Costa (1997) reported that the five-factor 
structure is consistent across samples from the United States, 
Western Europe, and Asia (see also Costa & McCrae, 1992; 
McCrae et al., 2000). The issue of cross-cultural stability also 
applies to etiological structure. Earlier, we described fitting 
an independent pathways model to the six facets defining 
NEO-PI-R domains in independent samples of German and 
Canadian twins. The universality of genetic effects can be 
evaluated by testing the equivalence of the genetic and envi- 
ronmental structures across independent samples. It is possi- 
ble to test whether: (a) the same genetic and environmental 
factors influenced the Canadian and German samples; and 
(b) whether these factors influenced each sample to the same 
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TABLE 3.6 Model-Fitting Statistics 


Canadian Sample 


German Sample 


Model x? p RMSEA x? p RMSEA AIC 

Neuroticism 
D 199.91 .00 .040 64.09 216.56 .00 .039 47.44 
Ke 172.11 .00 .036 79.89 149.82 .07 .019 102.18 
3° 151.12 .00 .029 —88.88 135.86 AS .015 104.14 
4 144.88 .03 .029 83.12 131.14 .13 .016 96.86 
5° 145.12 .03 .030 82.88 130.40 .14 .014 97.60 
6 210.86 .00 .043 61.14 220.57 .00 .038 —51.43 


Note. All models specified additive genetic and nonshared environmental factors unique to each facet. “df = 132, one 
common additive and one common nonshared environmental factor. Caf = 126, one common additive and two common 
nonshared environmental factors. “df = 120, two common additive and two common nonshared environmental factors. 
ddf = 114, two common additive and three common nonshared environmental factors. "dt = 114, three additive and two 
nonshared environmental factors.'df = 136, common pathways model. 


degree. Two tests of equivalency were applied. The first eval- 
uated equivalency of model form by testing the hypothesis 
that the same kind and number of genetic parameters are 
required to explain the data across the two samples. Sample 
differences are hypothesized to be limited to differences in 
the magnitude of the genetic and environmental influence 
exerted on a domain’s facet scales. If equivalence of model 
form was supported across the samples, the next step was to 
evaluate the magnitude of genetic and environmental influ- 
ences across samples. This was accomplished by applying a 
model with the same parameters to both samples. That is, the 
model specified the same number and type of factors in both 
samples and identical and constrained the factor loadings to 
be identical. 

The results of tests of model form and magnitude for 
NEO-PI-R Neuroticism are shown in Table 3.6. The same 
number and types of genetic and environmental influences 
(two additive genetic and two nonshared environmental com- 
mon factors) were identified in both samples, suggesting that 
the structure of neuroticism was similar across the samples. 
When the factor loadings on the common factors from the 
German sample were made to be the same as those on the 
Canadian sample (and vice versa), the model no longer fit 
the data. The results suggested that the primary differences 
between the German and Canadian samples were limited to 
the magnitude rather than kind of genetic and environmental 
effects supporting the claim that the factorial structure of the 
NEO-PI-R facets is universal. 


Gender Differences 


Personality tests are usually constructed to minimize gender- 
based differences by eliminating items whose intercorrela- 
tions with the other items can be attributable to gender and 


eliminating items evoking marked gender differences in 
endorsement. The approach yields scales that are applicable 
to both females and males but it overlooks the possibility of 
gender differences in the etiology. Behavioral genetic meth- 
ods may be used to determine whether the same genetic and 
environmental factors influence personality measure scores 
in males and females and whether the etiological architecture 
underlying the factorial structure of a personality measure is 
the same in males and females. 

The first question can be answered by fitting sex-limitation 
models to personality data (Neale & Cardon, 1992). This is 
accomplished by fitting a simple extension of the usual heri- 
tability model that uses data from same- and opposite-sex 
twin pairs to test whether the same genetic factors operate in 
males and females. In this case, gender differences are limited 
to differences in the magnitude of genetic and environmental 
influences. Another form of sex-limited gene expression oc- 
curs when different genes control the expression of a trait that 
is measured in the same way in males and females. With this 
form of sex-limitation, it is also possible to determine 
whether the same genes are present in both sexes but only ex- 
pressed in one sex. This is evaluated by comparing the simi- 
larities of opposite-sex DZ twin pairs with same-sex DZ 
pairs. Sex-specific genetic influences are suggested when the 
similarity of opposite-sex pairs is significantly less than 
the similarities of male or female DZ pairs. The difference in 
the correlation is attributable to the gender composition of 
each zygosity group. When the same and opposite-sex DZ 
correlations are similar, gender differences are not indicated. 

Only a few studies have investigated sex-limited gene 
expression in normal personality. The most notable is Finkel 
and McGue’s (1997) study that showed that the same genetic 
loci influence 11 out of the 14 scales of Multidimensional 
Personality Questionnaire (MPQ; Tellegen, 1982) in males 
and females. The heritable influences on the remaining three 
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Canadian Sample 


German Sample 


MZ DZ MZ DZ 

NEO-FFI Domain M F M F M-F M F M F M-F 
Neuroticism Al 53 Ke 35 13 .49 Ke 36 20 15 
Extraversion 50 EI 34 30 23 7 OT 34 25 17 
Openness .63 Sl .28 .36 .20 ST 50 A4 .26 .10 
Agreeableness 0 46 14 Ke .26 .43 42 37 10 .10 
Conscientiousness .47 50 KI 38 DI ‚37 46 A0 23 .05 
Sample sizes (pairs) 102 165 61 129 73 104 425 38 163 68 


traits—Alienation, Control, and Absorption—indicated that 
the genetic influences were gender-specific. Jang, Livesley, 
and Vernon (1998) reported some evidence for sex-limited 
gene expression in 18 traits delineating personality disorder 
measured by the DAPP. All dimensions except Submissive- 
ness in males, and Cognitive Dysfunction, Compulsivity, 
Conduct Problems, Suspiciousness, and Self-Harm in females 
were significantly heritable. Sex-by-genotype analyses sug- 
gested that the genetic influences underlying all but four 
DAPP dimensions (Stimulus Seeking, Callousness, Rejection, 
Insecure Attachment) were specific to each gender, whereas 
environmental influences were the same in both genders 
across all dimensions. Furthermore, the four higher-order di- 
mensions derived from the 18 basic traits (Livesley et al., 
1998) were also heritable across sex, and genetic effects were 
in common to both genders; the exception was Dissocial 
Behavior, which was not heritable in females. 

Such evidence of sex-limited effects challenges the as- 
sumed universality of trait taxonomies. However, it could be 
argued that the results based on the DAPP and MPQ are 
atypical. The DAPP is a specialized scale designed primarily 
to assess personality dysfunction. The scale does not cover 
such areas of normal personality as Openness to Experience 
(Jang & Livesley, 1999; Schroeder et al., 1992) because ab- 
normal variants of Openness are not included in clinical 
descriptions of personality disorder. The MPQ, unlike other 
scales, routinely reveals nonadditive genetic effects due to 
dominance (Waller & Shaver, 1994). This suggests that it 
may assess content different from that tested by scales such 
as the NEO-PI-R, which reveals genetic effects that are addi- 
tive (e.g., Jang et al., 1998). 

A more appropriate evaluation of the assumption of uni- 
versality would be to examine sex-limited gene expression 
on a major model of personality such as the five-factor 
model. Evaluation of whether the same genes are present 
across different samples is similar to the evaluation of cross- 
cultural effects. Jang, Livesley, Riemann, and Angleitner 
(in press) applied sex-limitation models to NEO-FFI data 


obtained from the Canada and German twin samples de- 
scribed earlier. Two general models were fit to the data. The 
first specified additive genetic and nonshared environmental 
influences for females and males and a male-specific genetic 
factor. The second tested whether heritable influences com- 
mon to males and females were the same across the two sam- 
ples. Table 3.7 reports the intrapair twin correlations for each 
zygosity group in each sample. The MZ male and MZ female 
correlations exceed their respective DZ correlations, suggest- 
ing the presence of heritable influences on each NEO-FFI 
domain in each sample. Of particular interest is the com- 
parison between the DZ opposite-sex correlations and the 
same-sex DZ correlations. In both samples, the DZ opposite- 
sex correlation for Conscientiousness was near zero, suggest- 
ing the presence of differential gender effects. The final form 
of the best-fitting model is presented in Table 3.8. The results 
suggest that genetic and environmental influences common 
to males and females influence four of the five FFM domains. 
The exception was Conscientiousness, for which gender- 
specific additive genetic influences operate. However, the 
external events and experiences specific to each twin— 
nonshared environmental influences—are common to males 
and females. The results also suggest that the type and mag- 
nitude of genetic and environmental influence were the same 
across the two groups, supporting the notion that the five- 
factor model as assessed by the NEO-FFI is applicable to 
different cultures and genders. 

This study has several limitations. The first is that the sam- 
ple sizes are rather small in both samples, especially male DZ 
twin pairs and opposite-sex pairs. The twin covariances asso- 
ciated with these two zygosity types, especially the opposite- 
sex pairs, are crucial for the validity of the analyses. The 
availability of relatively few twin pairs calls into question 
the stability of the correlations and thus the detection of sex- 
limited genes—as was obtained for Conscientiousness. Sec- 
ond, the study used the NEO-FFI, the short form of the 
NEO-PI-R. The full scale might produce different results 
because long versions of these scales sample domains more 
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TABLE 3.8 Parameter and Standard Error Estimates Produced by the Best-Fitting Sex-Limitation Model 


Parameter N E O A C 

Canada 

hy 86 + .03 .84 + .03 80 + .02 85 + .03 .86 + .03 

ep .82 + .02 84 + .02 88+ OI 83 + .02 82 + .02 

DÉI 20 + .04 84 + .04 80 + .02 89 + .03 — 

en .88 + .03 84 + .03 88+ OI .79 + .03 85 + .03 

nn — — — 83 + .04 
Germany 

hy 84 + .02 .87 + .02 80 + .02 84 + .02 82 + .02 

ep .84 + .01 .81 + .01 88+ OI .84 + .01 86 + .01 

DÉI 84 + .04 .87 + .04 81+ .04 87 + .03 

Gr 84 + .03 81 + .03 87 + .03 81+ .03 81 + .03 

Yn — — — — .87 + .03 


Note. N = Neuroticism; E = Extraversion; O = Openness to Experience; A = Agreeableness; C = Conscientiousness; hy, €p 


m 
genetic effects. 


thoroughly. As such, the present results should be considered 
tentative until replicated on a larger sample using full-scale 
versions, as well as other measures of personality. 

These analyses suggest that although most personality 
traits are influenced by the same genes in both genders (the 
implication being, e.g., DRD4 influences novelty seeking in 
both men and women), this is not true for all traits. The 
previous section suggested several explanations, but it is also 
possible that at the molecular level, different genes (or yet-to- 
be-discovered polymorphisms) differentially influence per- 
sonality across genders. If this is the case, the genetic and 
environmental architecture of some scales may differ by gen- 
der. This could be evaluated by fitting independent and com- 
mon pathways models to data from sister pairs and brother 
pairs separately and constraining the models (in form and 
magnitude) to be equal across gender groups. The sex-limita- 
tion model described previously that uses data from brother- 
sister pairs to test for gender-specific effects can be expanded 
to the multivariate case to further explore gender differences 
in personality. As far as we are aware, few multivariate ge- 
netic analyses of gender differences have been conducted, 
probably because many studies have limited data collection 
to sister pairs or have difficulty obtaining data from brother 
pairs (Lykken, McGue, & Tellegen, 1987). 


ENVIRONMENTAL EFFECTS 


Although our primary concern is with the genetic basis for 
personality structure, any discussion of genetic influence 
would be incomplete without reference to environmental fac- 
tors. Twin studies consistently show that about 50% of the 
variance in personality traits is explained by environmental 
factors and that most of this is accounted for by nonspecific 


hy,» €m = additive genetic and nonshared environmental effects common to males and females; h’,, = male-specific additive 


influences; common environmental influences do not appear 
to contribute to personality variation (Plomin & Daniels, 
1987). This etiological model derived from twin studies is 
confirmed by a large-scale study of Neuroticism by Lake, 
Eaves, Maes, Heath, and Martin (2000) that showed that in- 
dividual differences in neuroticism were not transmitted from 
parent to offspring via the environment but rather by genetic 
factors. The size and unique features of their data set (45,880 
twin pairs and their relatives on two continents) allowed 
them to test models of genetic transmission as well as gene- 
environment correlations. The results suggest that the envi- 
ronment exerts a contemporaneous influence on individual 
differences in neuroticism. That is, its effects are located in 
the current environment as opposed to being preset like ge- 
netic factors that are passed to individuals from their parents. 

Although nonshared environmental factors are important, 
the nature of these variables and the way they affect person- 
ality remain unclear. Despite considerable research effort 
(e.g., Hetherington, Reiss, & Plomin, 1994; Turkheimer & 
Waldron, 2000) using a variety of methods (Baker & Daniels, 
1990; Hetherington et al., 1994; Reiss et al., 1994; Vernon, 
Lee, Harris, & Jang, 1996) the results have been uniformly 
disappointing: Few nonshared influences on personality have 
been identified (Turkheimer & Waldron, 2000). Most studies 
have, however, investigated the effects of the nonshared en- 
vironment on the single variables; few studies have examined 
the effects of the nonshared environment on trait covariance. 
The study by McCrae et al. (in press) and the illustrative mul- 
tivariate genetic analyses of the NEO-PI-R and DAPP pre- 
sented earlier suggest that the nonshared environmental 
factors have an influence on personality structure different 
from that of genetic factors. They do not appear to contribute 
to trait substantially to the trait covariation described by trait 
taxonomies. 


It also appears that the environment does not have an 
effect that is independent of preexisting genetic factors. Re- 
views by Reiss, Neiderhiser, Hetherington, and Plomin 
(2000) and Caspi and Bem (1990) document personality- 
environment interaction and the way the individuals select 
and create their own environment. Genetic factors influence 
the environmental variables that are the focus of attention and 
the situations that the individual selects. For example, some 
kinds of life events are not independent of the individual; 
rather, their occurrence is influenced by such traits as Neu- 
roticism and Extraversion (e.g., Poulton & Andrews, 1992; 
Magnus, Diener, Fujita, & Pavot, 1993). Saudino, Pedersen, 
Lichtenstein, and McClearn (1997) showed that all genetic 
variance on controllable, desirable, and undesirable life 
events in women was common to the genetic influences un- 
derlying EPQ Neuroticism and Extraversion, NEO-FFI 
Openness to Experience (Costa & McCrae, 1985). Genetic 
influences underlying personality scales had little influence 
on uncontrollable life events because this variable was not 
heritable. Kendler and Karkowski-Shuman (1997) showed 
that the genetic risk factors for major depression increased 
the probability of experiencing significant life events in the 
interpersonal and occupational-financial domains, probably 
because individuals play an active role in creating their own 
environments. Heritable factors, such as personality and 
depression, influence the types of environments sought or en- 
countered. Jang, Vernon, and Livesley (2000) report signifi- 
cant genetic correlations between the Family Environment 
Scale (FES: Moos & Moos, 1974) subscale of Cohesiveness 
and DAPP higher-order factors of Emotional Dysregulation 
(—.45), Inhibition (—.39), FES Achievement Orientation and 
DAPP Dissocial Behavior (.38), Inhibition (—.58), and FES 
Intellectual Cultural Orientation and DAPP Emotional Dys- 
regulation (—.34). These results help to explain why mea- 
sures of the environment often have a heritable component: 
They often reflect genetically influenced traits (Saudino et al., 
1997). 

Using the factor score approach described earlier, Thomis 
et al. (2000) computed genetic factor scores for measures of 
muscle strength obtained from a sample of MZ and DZ twins. 
The twins were then subjected to a 10-week muscle strength 
training regimen. The muscle strength genetic factor scores 
explained the greatest proportion of the variance pre- and 
posttraining, indicating that genes are switched on, so to 
speak, in response to stress due to training, thus demonstrat- 
ing the existence of gene-environment interaction. Findings 
such as these suggest that the environmental factors that in- 
fluence traits are partially dependent on preexisting geneti- 
cally based personality traits. For example, a person scoring 
highly on a genetically based trait like sensation seeking will 
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seek out environments conducive to the expression of this 
personality genotype, such as engaging exciting sports. For 
this reason, molecular genetic studies designed to identify the 
genes for personality need to incorporate measures of per- 
sonality that separate the effects of genes and environment on 
the phenotype. 


MOLECULAR GENETICS 


From a genetic perspective, dimensions of individual differ- 
ences in personality are complex traits. That is, multiple genes 
and gene systems and multiple environmental factors influ- 
ence each trait (Plomin, DeFries, McClearn, & McGuffin, 
2000). The emergence of molecular genetics prompted 
considerable optimism about the possibility of identifying 
the genetic component or quantitative trait loci (QTLs) of 
these traits. Such a development would radically change the 
nature of personality research by enabling investigators to 
link behavioral dimensions to underlying molecular genetic 
structures. This would provide a more powerful way to 
resolve trait taxonomic issues that the behavioral genetic 
approaches discussed. The results of such studies have, 
however, been inconsistent, replications have often failed, 
and progress has been slower than expected. 

One of the earliest studies investigated the relationship be- 
tween Novelty Seeking and dopamine D4 or DRD4 receptor 
(Cloninger, Adolfsson, & Svrakic, 1996). Earlier Cloninger 
(1987; Cloninger, Svrakic, & Przybeck, 1993) proposed a 
model of personality that postulated that the expression of 
each personality trait is modulated by a specific genetically 
controlled neurotransmitter system. Specifically, Novelty 
Seeking is controlled by the dopaminergic system, Harm 
Avoidance by the serotonin system, and Reward Dependence 
by norephinephrine. Cloninger and colleagues (1996) re- 
ported a polymorphism of the D4 receptor that accounted for 
about 10% of the variance. Several replications have been 
reported (Benjamin, Greenberg, & Murphy, 1996; Ebstein, 
Novick, & Umansky, 1996; Ebstein, Segman, & Benjamin, 
1997) along with many failed replications (Ebstein, 
Gritsenko, & Nemanov, 1997; Malhotra, Goldman, Ozaki, & 
Breier, 1996; Ono et al., 1997; Pogue-Geile, Ferrell, Deka, 
Debski, & Manuck, 1998; Vandenbergh, Zonderman, Wang, 
Uhl, & Costa, 1997). 

Similarly, several studies have demonstrated a relation- 
ship between the serotonergic system and Harm Avoidance, 
Neuroticism, or related constructs (Hansenne & Ansseau, 
1999; Rinne, Westenberg, den Boer, & van den Brink, 
2000), and significant associations were reported with the 
serotonin transporter gene, 5-HTTLPR (Katsuragi et al., 
1999). However, several studies have failed to replicate 
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these findings (Flory et al., 1999; Gelernter, Kranzler, 
Coccaro, Siever, & New, 1998; Hamer et al., 1999; Herbst, 
Zonderman, McCrae, & Costa, 2000). Gustavsson et al. 
(1999) also failed to replicate these findings using the 
Karolinska Scales of Personality. 

These inconsistencies can be attributed to conceptual and 
measurement issues. The early studies in particular were 
often based on a conceptual model that assumed that person- 
ality is influenced by relatively few genes, each accounting 
for substantial variance. As noted, the evidence does not sup- 
port this approach. There has also been a tendency to assume 
that each trait was linked a specific neurotransmitter system. 
More recently, however, attention has focused on pleiotropic 
effects by investigating the possibility that a given polymor- 
phism influences several traits. Work on the serotonin trans- 
porter gene, for example, suggests that it is not associated 
with a single trait but rather has a pleiotropic relationship 
with Neuroticism and Agreeableness. Studies on humans and 
primates suggest that altered serotonin activity is related to 
negative emotional states such as depression, anxiety, and 
hostility, and to social behaviors such as dominance, aggres- 
sion, and affiliation with peers (Graeff, Guimaraes, De 
Andrade, & Deakin, 1996; Knutson et al., 1998; Murphy 
et al., 1998). Knutson and colleagues (1998) found that ad- 
ministration of the specific serotonin reuptake inhibitor, 
paroxetine, decreased negative affect and increased social af- 
filiation in normal human subjects. Lesch and colleagues 
(1996) reported that individuals carrying the 5-HTTLPR-S 
allele had increased total scores on NEO-PI-R Neuroticism 
and the facets of Anxiety, Angry Hostility, Depression, and 
Impulsiveness. The allele accounted for 3 to 4% of the total 
variance in these scales. Unexpectedly, the allele was also as- 
sociated with a decreased NEO-PI-R Agreeableness score. 
Greenberg et al. (1999) recently replicated these findings. 
Hamer et al. (1999) showed that 5-HTTLPR-S genotypes 
were significantly associated with increased Harm Avoidance 
(which correlates .66 with NEO-PI-R Neuroticism) and de- 
creased Self-Directedness (correlated —.64 with NEO-PI-R 
Neuroticism), Reward Dependence, and Cooperativeness 
(shown to correlate .43 and .66 with NEO-PI-R Agreeable- 
ness). These effects accounted for .80%, 1.98%, .97%, and 
2.60% of the total variance in these scores, respectively. 
Mazzanti et al. (1998), Peirson et al. (2000), and Benjamin 
et al. (2000) have reported replications. 

Measurement problems contributing to inconsistent find- 
ings include the use of measures with less-than-optimal 
psychometric properties and the use of relatively broad per- 
sonality constructs. Comparison of the dopamine-novelty 
seeking and serotonin-neuroticism studies suggests that the 
serotonin-neuroticism literature is less ambiguous than the 


dopamine-novelty seeking literature. These differences 
appear to be related to scale properties. Inconsistent find- 
ings may also be due to the confounding of genetic and en- 
vironmental influences on the phenotypes. As we have tried 
to show, many constructs and scales are etiologically 
heterogeneous. 

Twin studies estimating statistical pleiotropy could con- 
tribute to molecular genetic studies by identifying traits that 
are etiologically homogeneous units and etiologically re- 
lated. Molecular genetic work could then be used to confirm 
these associations by identifying the actual genes that ac- 
count for trait covariance. This would provide the strongest 
basis for revising personality models and allocating traits to 
etiologically related domains. 


CONCLUSIONS 


The thesis of this chapter is that behavioral genetic ap- 
proaches promise to provide an additional perspective that 
may help to resolve some of the more intractable problems in 
delineating and conceptualizing personality structure. The 
evidence reviewed suggests an alternative perspective on the 
trait structure of personality that complements traditional 
conceptions. Although trait theory has largely concentrated 
on mapping personality in terms of broad global traits, the 
evidence suggests that personality is inherited as a large num- 
ber of genetic dimensions that have relatively specific effects 
on personality phenotypes and a smaller number of genetic 
dimensions that have broader effects, perhaps through a 
modulating influence on related dispositions. These dimen- 
sions with broader effects appear to account for some of the 
observed covariation among traits. They do not appear, how- 
ever, to exert these effects through higher-order phenotypic 
structures, but rather through a direct influence on each basic 
trait. We assume that these common features are more likely 
to involve modulating functions or common mechanisms that 
regulate each trait in a given cluster. 

These tentative conclusions suggest the need to reconsider 
traditional models of the hierarchical structure of personality 
in which traits are organized into broad domains due to the ef- 
fects on broad dispositions. Instead, the organization of traits 
into clusters is assumed to arise from the pleiotropic effects of 
genetic dimensions that affect multiple traits. Under these cir- 
cumstances, it is conceivable that not all traits are organized 
into clusters of covarying features, but rather remain relatively 
distinct characteristics. Nor is it inevitable the traits are hierar- 
chically organized in similar ways across domains. That is, it 
is possible that the symmetrical hierarchical structure avidly 
sought by trait theorists and students of psychopathology does 


not reflect the way personality is organized at a genetic level. 
An equally feasible structure would involve considerable dif- 
ferences in complexity across domains. Some domains may 
consist of a relatively large number of traits, whereas others 
may consist of only one or two genetically homogeneous 
traits. These assumptions are consistent with the lexical view 
of Saucier and Goldberg (1996), who argued that the five do- 
mains are merely a convenient way of organizing lower-order 
traits and that there is no inherent reason to assume that do- 
mains are equal in breadth or in pervasiveness. 

Although behavioral genetic analyses show that environ- 
mental factors exert a considerable influence on personality, 
they do not appear to influence the structural relationships 
among traits to any appreciable extent. Instead, environ- 
mental factors appear to exert a more contemporaneous effect 
on trait expression. The nature of these factors and the way 
that they function remain important topics of research. 
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Whether we speak of mice or men, every member of a 
species is the same as other members in many respects but 
different in others. One task of personality psychology is to 
describe the basic behavioral differences and discover their 
origins. Description of personality is usually in terms of ob- 
servable traits, and various models have been proposed to 
classify them. Biology has confronted a similar task in the 
classification of species (taxonomy). Taxonomy has been 
based on phenomenal and functional similarities and differ- 
ences but more recently has been moving in the direction of 
using evolutionary analyses to define species in terms of their 
ancestries. Psychology still depends on phenomenal similari- 
ties and differences. As the genome reveals its secrets, both 
fields will eventually turn to DNA for the classification task. 

There are two basic pathways for the second task, the 
search for the sources of individual differences. These are 
shown in Figure 4.1. One pathway is the biological beginning 
in behavioral genetics. Genes make proteins into neurons, 
and neurons are organized into brain and nervous systems. 


85 


Cortical Arousal and Arousability 98 
Monoamines 99 
Monoamine Oxidase 99 
Hormones 100 
Genetics 100 
Summary 101 
AGGRESSION/HOSTILITY/ANGER/AGREEABLENESS 102 
Cortical Arousal and Arousability 102 
Cardiovascular Arousal and Arousability 103 
Monoamines 104 
Hormones 105 
Genetics 106 
Summary 106 
CONCLUSIONS 107 
REFERENCES 109 


Neurons operate through chemical neurotransmitters and the 
enzymes that govern their production and catabolism, as well 
as through hormones produced in other loci. This is the bio- 
chemical level. Differences in neurochemical makeup result 
in differences in neural activity and reactivity or physiology. 
Physiological differences affect conditionability, both of the 
classical and operant types. Individuals differ in both their 
conditionability and their sensitivities to conditioned stimuli 
associated with reward and punishment. 

The second pathway begins with the largest social unit, 
culture. Cultures are subdivided into specific societies de- 
fined by geography or class groupings defined by wealth, oc- 
cupation, and education. Neighborhood provides the more 
proximal influences on behavior. The family of origin and 
peers transmit the influences of society, albeit with individual 
variations on modal mores, values, and behavior patterns. 
Observational learning combined with social reinforcement 
is the mechanism of influence at the next level. At this point 
there is a convergence of the pathways because the different 
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Figure 4.1 Two pathways to individual differences in personality: the 
biological and the social. 


mechanisms of learning combine to produce behavioral 
traits. These traits are usually specific to certain types of situ- 
ations. Depending on their generality and strength they com- 
bine to form what we call personality traits. 

Both of these pathways have a historical origin in the evo- 
lutionary history of the species. Genetic changes account 
for the origin and changes (over long periods of time) in 
the species. Cultures represent the collective solutions of the 
human species to the basic demands of evolution: survival 
and reproduction. Cultural evolution is more rapid than bio- 
logical evolution. Significant changes can occur within a gen- 
eration, as with the sudden impact of computer technology on 
the current generation. 

This chapter describes the biological pathway up to, but 
not including, conditioning. For each of four dimensions of 
personality I describe theory and research at each level of 
analysis along this pathway starting at the top (physiology). 
At the genetic level I describe primarily the studies of mole- 
cular genetics that link specific genes to traits. The biometric 
genetic studies are covered in the chapter by Livesly, Jang, 
and Vernon in this volume. The molecular studies link genes 
more directly to the neurological and biochemical levels on 
the way up to personality traits. An analysis of this type was 
conducted a decade ago (Zuckerman, 1991). Advances occur 
rapidly in the neurosciences. Ten years is equivalent to at 
least several decades in the social sciences. I have made an 


attempt to survey the changes since my last attempt. In a 
chapter I can hope only to highlight some of these advances 
and will reserve a more thorough review for a revision of my 
1991 book. My approach draws heavily on comparative stud- 
ies of other species as any psychobiological model must do 
(Gosling, 2001; Zuckerman, 1984, 1991), but I cannot do so 
within the constraints of a single chapter. I will limit compar- 
ative studies to those in which there are clear biological 
markers in common between animal and human models. 


TEMPERAMENT AND PERSONALITY TRAITS 


Researchers of temperament in children and behavioral traits 
in other species have typically included certain dimensions 
like emotionality, fearfulness, aggressiveness, approach ver- 
sus withdrawal (in reactions to novel stimuli), general activ- 
ity, playfulness, curiosity, sociability versus solitariness, and 
inhibition versus impulsivity (Strelau, 1998). From the 1950s 
through the 1970s personality trait classification was domi- 
nated by two models: Eysenck’s (1947) three-factor theory 
(extraversion, neuroticism, and psychoticism) and Cattell’s 
(1950) 16-factor model. Eysenck’s (1967) model was biolog- 
ically based with an emphasis on genetics, physiology, and 
conditioning. Gray’s (1982, 1987) model is a bottom-up 
model that starts with behavioral traits in animals and extrap- 
olates to human personality. He places his three behavioral 
dimensions (anxiety, impulsivity, fight-flight) within the axes 
of Eysenck’s dimensions, but not lying on the axes of those 
dimensions or being precise equivalents of them. 

The first five-factor model originated in lexical studies of 
trait-descriptive adjectives in language done in the 1960s 
(Norman, 1963; Tupes & Christal, 1961) with its roots in a 
much earlier study by Fiske (1949). Interest in this model 
reawakened in the 1980s (Digman & Inouye, 1986; Goldberg, 
1990; Hogan, 1982; McCrae & Costa, 1985). Most of these 
studies used adjective rating scales. The translation of the 
model into a questionnaire form (NEO-PI-R; Costa & 
McCrae, 1992a) increased the use of the scales by personality 
investigators. The five factors incorporated in this tests are la- 
beled extraversion, neuroticism, agreeableness, conscientious- 
ness, and openness to experience. The five factors have been 
replicated in studies in many countries although with some 
differences—particularly on the last factor, openness. The en- 
thusiasts for the Big Five insist it is the definitive and final 
word on the structure of personality (Costa & McCrae, 1992b), 
although critics regard this claim as premature (Block, 1995; 
Eysenck, 1992; Zuckerman, 1992). One of the criticisms of the 
model is its atheoretical basis in contrast to Eysenck’s devel- 
opment of his factors from theory as well as empirical factor 


analytic studies of questionnaire content. However, recent 
studies in behavior genetics have used the model, and some of 
the data from earlier studies has been translated into the form 
of these five factors (Loehlin, 1992). 

Two recent models have been derived from biosocial 
theories. Based on factor analyses of scales used in psy- 
chobiological studies of temperament and personality, 
Zuckerman and Kuhlman developed a five-factor model 
dubbed the alternative five (Zuckerman, Kuhlman, & 
Camac, 1988; Zuckerman, Kuhlman, Thornquist, & Kiers, 
1991). This model was translated into a five-factor question- 
naire (Zuckerman-Kuhlman Personality Questionnaire, or 
ZKPQ) on the basis of item and factor analyses (Zuckerman, 
Kuhlman, Joireman, Teta, & Kraft, 1993). The five factors 
are sociability, neuroticism-anxiety, impulsive sensation 
seeking, aggression-hostility, and activity. This model was 
used as the framework for a volume on the psychobiology 
of personality (Zuckerman, 1991). 

Cloninger (1987) developed a personality model for 
both clinical description and classification of personality. 
The theory is biologically based and, like Zuckerman’s, 
uses the monoamine neurotransmitters as fundamental de- 
terminants of personality differences. The factors included 
in the most recent version of his questionnaire include nov- 
elty seeking, harm avoidance, reward dependence, persis- 
tence, cooperativeness, persistence, self-directedness, and 
self-transcendence (Cloninger, Przybeck, Svrakic, & Wetzel, 
1994). Much of the recent psychobiological research in per- 
sonality and psychopathology has used Cloninger’s system 
and questionnaires. 

Builders of personality trait models often give different 
names to what are essentially the same traits. But even if one 
goes by the trait labels alone there are obvious similarities in 
what are considered the basic personality traits. Extraversion 
and neuroticism appear in nearly every system. Of course, 
one cannot take their equivalence for granted until empirical 
studies are done of their correlational relatedness. 

Zuckerman et al. (1993) compared Eysenck’s Big Three, 
Costa and McCrae’s Big Five, and Zuckerman and 
Kuhlman’s Alternative Five in a factor-analytic study. A four- 
factor solution accounted for two thirds of the variance. The 
first factor was clearly extraversion, and the second was neu- 
roticism with representative scales from all three question- 
naires highly loading on their respective factors. The third 
factor consisted of Eysenck’s psychoticism and Zuckerman 
and Kuhlman’s impulsive sensation seeking at one pole and 
the NEO conscientiousness at the other. The fourth factor 
was defined by NEO agreeableness at one pole and ZKPQ 
aggression-hostility at the other. The analysis did not yield a 
fifth factor, possibly because of a lack of representative 
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markers in the three tests. Activity loaded on the extraversion 
factor, and openness loaded on the agreeableness factor. 

Zuckerman and Cloninger (1996) compared the scales of 
the ZKPQ with those of Cloninger’s Temperament and Char- 
acter Inventory (TCI). ZKPQ impulsive sensation seeking 
was highly correlated with TCI novelty seeking (r = .68), 
ZKPQ neuroticism-anxiety with TCI harm-avoidance 
(r = .66), ZKPQ aggression-hostility with TCI cooperative- 
ness (r = —.60), and ZKPQ activity with TCI persistence 
(r = 46). These scales showed convergent and discriminant 
cross validity, but the other scales in both tests had weaker 
correlations and correlated equally with several measures on 
the other scales. In Cloninger’s model there is no specific 
scale for extraversion or sociability. 

The personality systems described thus far have been 
developed using factor analyses of trait dimensions. Many 
personologists have developed typologies on a rational- 
theoretical basis. Freud (1914/1957), Erikson (1963), and 
Maslow (1954) described personality types based on their 
developmental theories, each stressing the adult expressions 
of types derived from earlier stages of development. No 
valid methods of assessment were developed to operational- 
ize these theories, although many clinicians continue to use 
them to describe personality differences among patients or 
others. 

More recently, Millon and Everly (1985) defined eight 
types based on the interactions of four primary sources of re- 
inforcement and two kinds of instrumental behavior patterns 
(active and passive). Some of the resultant types resemble 
different poles of the standard dimensions of personality. 
Sociable and introversive personality types resemble the two 
poles of the extraversion dimension; the inhibited type re- 
sembles neuroticism; and the cooperative types sounds like 
agreeableness. The model was developed as a way of inte- 
grating personality development of psychopathology, partic- 
ularly the personality disorders. It has been described as a 
biosocial theory but has not as yet been widely used in 
psychobiological research. 

The examination of the biosocial bases of personality in 
this chapter will be organized around four basic personality 
factors, derived mostly from factor analytic studies, which 
are the same or quite similar across these studies, have some 
similarity to traits described in studies of temperament and 
animal behavior, and have been used in correlational studies 
of traits and psychobiology in humans. The four traits are 
extraversion/sociability, neuroticism/anxiety, aggression/ 
agreeableness, and impulsivity/sensation seeking/psychoti- 
cism. Although activity is a widely used trait in studies of 
children and animals, it has not been widely used in studies of 
humans except for the pathological extreme of hyperactivity 
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disorder and is recognized as a primary personality trait only 
in the Zuckerman-Kuhlman model. 


EXTRAVERSION/SOCIABILITY 


All models of basic personality, with the exception of 
Cloninger’s, recognize extraversion (E) as a primary and 
basic personality factor, but different models have defined it 
differently. In his earlier model Eysenck regarded E as a com- 
bination of two narrower traits: sociability and impulsivity. 
This amalgam was questioned by Carrigan (1960) and 
Guilford (1975), who claimed that sociability and impulsivity 
were independent traits. Sybil Eysenck and Hans Eysenck 
(1963) initially defended the dual nature of extraversion. 
However, the introduction of psychoticism (P) into a new ver- 
sion of their questionnaire resulted in a drift of impulsivity- 
type items to the P dimension, leaving E defined primarily by 
sociability and activity types of items. Hans and Michael 
Eysenck (1985) finally defined E in terms of the subtraits: so- 
ciable, lively, active, assertive, sensation seeking, carefree, 
dominant, surgent, and venturesome. 

Costa and McCrae (1992a) defined their E superfactor in 
terms of subscale facets: warmth, gregariousness (sociabil- 
ity), activity, excitement seeking (sensation seeking), and 
positive emotions. Neither Eysenck nor Costa and McCrae 
now include impulsivity in the E factor; Eysenck now 
includes it in the N superfactor, and Costa and McCrae place 
it in their neuroticism factor. Both Eysenck and Costa and 
McCrae include activity and sensation seeking as compo- 
nents of their E factors. 

Zuckerman et al. (1993) include only sociability and iso- 
lation intolerance in their sociability superfactor. In the alter- 
native five, impulsivity and sensation seeking form another 
primary factor instead of being subsumed under E, and activ- 
ity comprises another major factor. In spite of these differ- 
ences in the content of the E factor in the three models, the 
questionnaire measures of the factors intercorrelate highly 
and have high loadings on a common factor (Zuckerman 
et al., 1993). 


Cortical Arousal 


Eysenck’s (1967) theory of extraversion has shaped much of 
the psychobiological research on this trait even to the end 
of the century (Strelau & Eysenck, 1987). The model suggests 
that introversion-extraversion is based on arousal characteris- 
tics of the cerebral cortex as regulated by the reticulocortical 
activating system. The extravert’s cortex in waking, nonstim- 
ulating conditions is underaroused relative to his or her 


optimal level of arousal. In these conditions the extravert is 
prone to seek out exciting stimulation in order to increase the 
level of arousal to a level that makes him or her feel and func- 
tion better. The introvert is usually closer to an optimal level 
of arousal in low stimulation conditions and has less need to 
seek additional stimulation to feel better. The introvert may be 
overstimulated at a level of stimulation that is positive for the 
extravert. 

The theory was initially tested with measures of brain ac- 
tivity from the electroencephalogram (EEG). Spectrum 
analyses break the raw EEG into bands characteristic of dif- 
ferent degrees of arousal: sleep (delta), drowsiness (theta), 
relaxed wakefulness (alpha), and alert excitement (beta). 
Alpha has often been regarded as inversely related to arousal 
on the assumption that any interruption of this regular wave 
means an increase in arousal. However, some have used the 
frequency of alpha within the usual band (8-13 Hz) as a 
measure of relative arousal or alpha amplitude as an inverse 
measure of arousal. EEG spectrum characteristics are highly 
if not completely heritable (Lykken, 1982). 

The findings relating extraversion to EEG criteria of 
arousal in various conditions from nonstimulating to mentally 
engaged have been summarized by Gale (1983), O’Gorman 
(1984), and Zuckerman (1991). Gale tried to reconcile the 
wide variety of results with the hypothesis that differences be- 
tween introverts and extraverts appear only in moderately ac- 
tive conditions and not in either low stimulation (eyes closed, 
no stimulation) or activating conditions. Both O’Gorman and 
Zuckerman concluded that neither Eysenck’s broad hypothe- 
sis nor Gale’s narrow hypothesis, limiting the prediction to 
specific experimental conditions, were consistently sup- 
ported by studies. Zuckerman noted that among the best stud- 
ies, those confirming Eysenck’s hypothesis used samples 
with either all female or equal male and female participants, 
whereas those with all male or a preponderance of male par- 
ticipants did not support the hypothesis. 

A large study utilizing the full spectrum range of EEG, 
three levels of activating conditions, measures of impulsivity 
as well as E, and a test of the interaction of personality, 
arousal level, and performance, found only weak evidence 
supporting Eysenck’s hypothesis (Matthews & Amelang, 
1993). Correlations of .16 (about 3% of the variance) were 
found between activation in the low arousal bands (delta and 
theta) and E and one of its components, impulsivity. These 
correlations controlled for the influence of the other two 
Eysenck factors, neuroticism and psychoticism. The sociabil- 
ity component of E was not related to any index of cortical 
arousal. The significant results linking E to low arousal bands 
were found only in the least stimulating condition (reclining, 
eyes closed). The fact that the differences were not found in 


alpha or beta bands but were found only in the most relaxed 
condition suggests that the weak correlation may have been 
due to impulsive extraverts’ getting drowsy or actually falling 
asleep. Regardless of interpretation, the low level of relation- 
ship between personality and arousal in this study could ex- 
plain the inconsistency of previous studies testing the 
hypothesis: They simply did not have enough power to detect 
the relationship with any reliability. 

Consistent with Eysenck’s model was the finding that 
while performing six tasks extraverts tended to perform 
worse than introverts at higher levels of alpha (indicating 
lower levels of arousal). Only the alpha band, however, sup- 
ported the hypothesis of better performance of introverts 
at lower levels of arousal. Brain imaging using positron- 
emission tomography (PET) and cerebral blood flow (CBF) 
have an advantage over EEG because they assess subcortical 
as well as cortical activation and analyze activity in particu- 
lar structures or brain loci. The problem with studies using 
these new techniques is that because of the expense, low num- 
bers of subjects are used and many brain areas are analyzed, 
increasing the possibilities of both Type I and Type II errors. 
Replication across studies is one solution to the problem. 

Mathew, Weinman, and Barr (1984) found negative corre- 
lations between E and CBF indices of activation in all corti- 
cal areas in both hemispheres, supporting Eysenck’s 
hypothesis of higher cortical arousal in introverts than in ex- 
traverts. All of their participants were female. Stenberg, 
Wendt, and Risberg (1993) also found an overall negative 
correlation (r = —.37), but this was a function of the high 
correlation among the female participants; the correlation 
among the males was close to zero. As with the EEG data, 
confirmation of the hypothesis was more common in female 
than in male samples. 

Some studies have found hemispheric differences in the 
relationships between E and activation, but these have not 
been consistent (Johnson et al., 1999; Stenberg et al., 1993). 
Studies of subcortical areas of brain have also yielded little in 
the way of consistent findings except for one: E is associated 
positively with activation of the anterior cingulate area 
(Ebmeier et al., 1994; Haier, Sokolski, Katz, & Buchsbaum, 
1987; Johnson et al., 1999). The cingulum is the major path- 
way between the frontal cortex and the limbic system and has 
been theoretically associated with neuroticism and anxiety 
rather than E (Zuckerman, 1991). 

The results in the two brain imaging studies described, un- 
like the EEG studies, tend to support Eysenck’s hypothesis of 
a relationship (albeit a weak one) between E and cortical 
arousal. There is no clue in his theory, however, why the find- 
ing is supported more in females than in males or why sub- 
cortical differences in the cingulum, the executive structure 
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of the limbic brain, should be associated with extraversion. In 
Eysenck’s model limbic arousability is associated with neu- 
roticism, and any association with E would be with introver- 
sion rather than extraversion. 

General arousal may be too broad a construct to be associ- 
ated with personality. Arousal is highly dependent on diurnal 
variation and general stimulation levels. Arousal as a trait 
would represent the state of the nervous system at a given 
time under a given set of conditions. In contrast, arousability 
is the typical immediate reaction of some part of the nervous 
system to a stimulus with specified characteristics. Eysenck’s 
(1967) optimal level of stimulation model says that introverts 
are more arousable at low to moderate intensities of stimula- 
tion, but at higher intensities extraverts are more responsive. 
Introverts have strong reactive inhibition mechanisms that 
dampen response to high intensities. Strelau (1987), in a 
model based on neo-Pavlovian theories, states that persons 
with strong nervous systems are relatively insensitive to 
stimuli at lower intensities but can process and react to stim- 
uli at higher intensities. For weak nervous system types the 
opposite is true: They are highly sensitive to low intensities 
but show inhibition of response at high intensities. 


Cortical Arousability 


Cortical arousability is usually assessed with the cortical 
evoked potential (EP). A brief stimulus, such as a tone or 
flash of light, is presented a number of times, and the EEG is 
digitized at a fixed rate, that is time locked to stimulus deliv- 
ery time and averaged across trials for a given participant. 
This process averages out the “noise” and produces a clear 
waveform representing the typical reaction of that subject to 
the specific stimulus over a 500-ms period. Although laten- 
cies of response vary somewhat for individuals, for most one 
can identify particular peaks of positivity and negativity. For 
instance, a peak of positive potential at about 100 ms after the 
stimulus (P1) represents the first impact of the intensity char- 
acteristics of stimuli on the cortical centers. Earlier peaks 
represent stimulus processing at subcortical centers. The peak 
at 300 ms after the stimulus (P3) is influenced by novelty, sur- 
prise, or unexpectedness of the stimulus and thus represents a 
higher level of cortical processing in that the stimulus must be 
compared with previous stimuli. 

Stelmack (1990) reviewed the relationship between E and 
cortical EPs. As might be expected, the results depend on the 
characteristics of the stimuli used to evoke the EPs as well as 
the reactor’s age and personality characteristics. For instance, 
Stelmack said that introverts have greater amplitude EPs in 
response to low-frequency tones, but there are no differences 
between introverts and extraverts for high-frequency tones. 
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If the stimulus attribute had been intensity, these kinds of re- 
sults might be compatible with Eysenck’s theory of increased 
sensitivity of introverts to low-intensity stimuli. But the evo- 
lutionary type of explanation offered by Stelmack for the 
greater survival significance of low-frequency sounds is not 
convincing. 

Recent studies have focused on the P300 EP component, 
many using the “odd-ball” paradigm in which the participant 
listens with eyes closed to a sequence of tones in which one 
tone is presented frequently and another one (the oddball) 
rarely. The rare tone is the signal for some task. These are 
usually vigilance tasks on which extraverts’ performances 
and EP reactions are expected to decline more rapidly than 
those for introverts. However, when the task is made less 
montonous or response requirements are high, the differ- 
ences may disappear or even be reversed with larger EP am- 
plitudes in extraverts (Stenberg, 1994). 

The intensity of the stimulus is another factor in the I-E 
difference. Brocke, Tasche, and Beauducel (1997) found that 
introverts showed larger P3 reactions to a 40-db stimulus, 
whereas extraverts showed a larger amplitude of P3 in re- 
sponse to a 60-db stimulus. Introverts’ EP amplitudes de- 
creased going from 40 db to 60 db, whereas extraverts 
increased going from the less intense to the more intense 
stimulus. These effects were a function of the impulsivity 
component rather than the sociability component of the E 
scale used in the study. The results of studies that vary the 
experimental conditions suggest that attention and inhibition 
may be the basic mechanisms governing the nature of the 
relationship between E and cortical EPs. Responses at 
the brain-stem level are probably less susceptible to these 
mechanisms, and Eysenck’s theory does involve the brain 
stem and other points along the reticulocortical arousal 
system in I and E. 

Stelmack and Wilson (1982) found that extraverts had 
longer latencies for the EP subcortical wave V (inferior 
colliculus) for stimulus intensity levels up to but not includ- 
ing 90 db. The direction of the finding was confirmed in a 
second experiment (Stelmack, Campbell, & Bell, 1993) and 
in a study by Bullock and Gilliland (1993). Different doses of 
caffeine and levels of task demand were used in the latter 
study, but the differences between extraverts and introverts 
held across all levels of caffeine and task demand. The results 
support Eysenck’s theory more strongly than those using cor- 
tical EPs, which seem more susceptible to stimulus, task, and 
background arousal factors. A study by Pivik, Stelmack, and 
Bylsma (1988), however, suggested that Eysenck’s arousal- 
inhibition hypothesis may not be broad enough. These re- 
searchers measured the excitability of a spinal motoneuronal 
reflex in the leg and found that extraverts showed reduced 


motoneuronal excitability as measured by reflex recovery 
functions. These results show that the inhibitory properties of 
the nervous system related to E may extend well below the 
reticulocortical level. 

Another line of EP research is based on Gray’s (1982, 
1987) model of personality. Gray proposed that impulsivity, a 
dimension close to extraversion, is related to sensitivity to 
signals (conditioned stimuli) of reward whereas anxiety, 
close to neuroticism, is related to sensitivity to signals of pun- 
ishment. This model suggests that the learned biological 
significance of stimuli, in addition to the intensity of stimula- 
tion, governs the strength of reaction to them. 

Bartussek, Diedrich, Naumann, and Collet’s (1993) results 
supported the theory by showing a stronger EP response (P2, 
N2) of extraverts than introverts to tones associated with 
reward (winning money) but no differences in tones associ- 
ated with punishment (losing money). In a later experiment, 
however, extraverts showed larger P3 EP amplitudes to stim- 
uli associated with both reward and punishment compared to 
neutral stimuli (Bartussek, Becker, Diedrich, Naumann, & 
Maier, 1996). 

DePascalis and his colleagues also presented findings sup- 
porting Gray’s theory. In one study they used a questionnaire 
scale developed more directly from Gray’s theory measuring 
the approach tendency (DePascalis, Fiore, & Sparita, 1996). 
Although they found no effect for E itself, the participants 
scoring high on the approach scale had higher EP (P6) ampli- 
tudes in response to stimuli (words) associated with winning 
than to those associated with losing, and the reverse was true 
for low-approach motive subjects. 

Eysenck’s and Gray’s theories have also been tested using 
peripheral autonomic measures of activity like the electroder- 
mal activity (EDA), or skin conductance (SC), heart rate 
(HR), and blood pressure (BP). These are only indirect mea- 
sures of cortical activity and reactivity because they occur in 
the autonomic nervous system (ANS) and are controlled by 
limbic system centers, which in Eysenck’s model are associ- 
ated more closely with neuroticism than with E. The results in 
relation to E are similar to those obtained with more direct 
cortical measures. Reviews by Smith (1983) and Stelmack 
(1990) showed mixed and inconclusive findings relating tonic 
EDA arousal to E, but some evidence of stronger SC re- 
sponses of introverts than extraverts in response to low-to 
moderate-intensity stimuli and stronger responses of ex- 
traverts in response to high-intensity stimulation. Tonic 
(base-level) measures of HR (Myrtek, 1984) and BP 
(Koehler, Scherbaum, Richter, & Boettcher, 1993) are unre- 
lated to E. Young children rated as shy and inhibited had 
higher and less variable HRs, and a high HR at 21 months is 
the same behavior pattern at 48 months (Kagan, Reznick, & 


Snidman, 1988). Shyness and inhibition, however, are traits 
that are a mixture of introversion and neuroticism or anxiety; 
therefore, the correlation with HR could be due to the anxiety 
component rather than to E. 

Eysenck’s model for the trait of extraversion produced a 
great deal of research in the area of psychophysiology. But 
psychophysiology has its problems as a branch of neuro- 
science. Both tonic and phasic psychophysiological measures 
are highly reactive to environmental conditions. Tonic levels 
can vary as a function of reactions to the testing situation it- 
self, and phasic reactions depend on the specific qualities of 
stimulation such as intensity and novelty. It is not surprising 
that the relationships of physiological measures with person- 
ality traits often interact with these stimulus characteristics in 
complex ways. Eysenck’s theory based on optimal levels of 
stimulation has received some support. Those based on dif- 
ferences in basal arousal levels are beginning to receive some 
support from PET studies, although the earlier results with 
EEG measures remain problematic. 


Monoamines 


The monoamine neurotransmitter systems in the brain have 
been the focus of most biosocial theories of personality. The 
reasons are the evidence of their involvement in human emo- 
tional and cognitive disorders and basic emotional and moti- 
vational systems in other species. Much of the work with 
humans has been correlational, comparing basal levels of the 
neurotransmitters, as estimated from levels of their metabo- 
lites in cerebrospinal fluid (CSF), blood, or urine, to person- 
ality traits as measured by questionnaires. Of these sources 
CSF is probably the best because the CSF is in direct contact 
with the brain. But the indirect relationship of these indica- 
tors with brain levels of activity (which can differ in different 
brain loci) and the fact that some of the metabolites in plasma 
and urine are produced in the peripheral nervous system 
make the putative measures of brain amine activity problem- 
atic. New imaging methods may eventually overcome these 
problems by directly viewing the monoamine activities in 
the brain itself. Added to these problems of validity of mea- 
surement is the use of small numbers of subjects in most 
studies, as well as the use of subjects with certain types of 
disorders rather than normal subjects. The ethical constraints 
of giving drugs that affect activity in the brain systems is an- 
other barrier, although some of the more recent studies have 
used such drugs in normals. 

The freedom of investigators to experiment directly with 
the brain in other species has given us a fairly coherent picture 
of the emotional and motivational functions of the 
monoamine systems in the brain, and bottom-up theorists 
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have used these findings to extend animal models to human 
motivations and personality (Gray, 1982, 1987; Mason, 1984; 
Panksepp, 1982; Soubrié, 1986; Stein, 1978). Top-down the- 
orists have drawn on these findings from the comparative re- 
search but have attempted to reconcile them with the relevant 
research on humans, including clinical and personality studies 
(Cloninger, Svrakic, & Prszybeck, 1993; Depue & Collins, 
1999; Netter, Hennig, & Roed, 1996; Rammsayer, 1998; 
Zuckerman, 1991, 1995). The problem with building a bridge 
from two banks is to make it meet in the middle. With these 
caveats let us first examine the case for extraversion. 

The primary monoamines in the brain are norepinephrine, 
dopamine, and serotonin. The first two are labeled cate- 
cholamines because of the similarities in their structures. 
Serotonin is an indoleamine. These are not independent neu- 
rotransmitter systems because activity in one may affect ac- 
tivity in another. Serotonin, for example, may have 
antagonistic effects on the catecholamines. These kinds of 
interaction must be kept in mind because most studies relate 
one neurotransmitter to one personality trait. Some models 
suggest that this kind of isomorphism of trait and transmitter 
is the rule. This is a new kind of phrenology based on bio- 
chemistry rather than bumps on the head. 

To understand the human research one needs to know the 
pathways of biosynthesis and catabolism (breakdown) of the 
monoamines because some experiments block the precursors 
of the transmitter to see its effect on behavior and most use 
metabolite products of the catabolism to gauge activity in 
the systems. Figure 4.2 is a simplified diagram showing the 
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Figure 4.2 Biosynthesis and breakdown of the monoamines dopamine, 
norepinephrine, and serotonin. 

Note. COMT = catechol-O-methyltransferase; MAO = monoamine oxi- 
dase; HVA = homovanillic acid; DBH = dopamine 8-hydroxylase; MHPG = 
3-methoxy-4-hydroxyphenylglycol; 5-HIAA = 5-hydroxyindoleacetic acid. 
From Psychobiology of personality, p. 177, by M. Zuckerman, 1991, 
Cambridge: Cambridge University Press. Copyright 1991 by Cambridge 
University Press. Reprinted by permission. 
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stages of production of the monoamines and some of the en- 
zymes (DBH, COMT, MAO) involved in the conversions 
from one stage to another. The metabolite for dopamine is ho- 
movanillic acid (HVA), for norepinephrine it is 3-methoxy-4- 
hydroxyphenylglycol (MHPG), and for serotonin it is 
5-hydroxyindoleacetic acid (5-HIAA). 

Theorists are in fair agreement on the role of dopaminergic 
systems in motivation based on studies of other species: ap- 
proach and sensitivity to stimuli associated with reward 
(Crow, 1977; Gray, 1982, 1987; Stein, 1978); foraging and ex- 
ploration and positive emotions like hope, desire, and joy in 
humans (Panksepp, 1982; Zuckerman, 1991); and novelty or 
sensation seeking in animals and humans (Bardo, Donohew, & 
Harrington, 1996; Cloninger et al., 1993; Le Moal, 1995; 
Zuckerman, 1984, 1991). I have proposed that the activity of 
the mesolimbic dopamine system is related to a broad ap- 
proach trait that includes extraversion, sensation seeking, and 
impulsivity (Zuckerman, 1991). Considering that dopaminer- 
gic reactivity is also related to aggression and sexuality in 
many species, it is also possible that the third dimension 
of personality, low socialization, or psychoticism, may also 
be involved. Gray’s (1987) model linked dopamine and re- 
ward sensitivity with impulsivity, a dimension related to high 
E, P, and N, although his more recent remarks (Gray, 1999) 
suggest that he is linking dopamine more closely with the P 
dimension because of this transmitter’s involvement in 
schizophrenia. 

Depue and Collins (1999) defined a broad view of extra- 
version with two main factors: interpersonal engagement, or 
affiliation and warmth, and agency, which includes social 
dominance, exhibitionism, and achievement motivation. Pos- 
itive affect and positive incentive motivation are more 
strongly associated with the agentic extraversion factor. Im- 
pulsivity and sensation seeking are regarded as constituting an 
emergent factor representing a combination of extraversion 
and constraint (a dimension related to Eysenck’s P and Costa 
and McCrae’s conscientiousness). The “lines of causal neuro- 
biological influence” are suggested to lie along the orthogonal 
dimensions of extraversion and constraint rather than along 
the dimension of impulsive sensation seeking. Although 
Depue and Collins say that this structural system does not 
mean that positive incentive motivation and its dopaminergic 
basis are related only to extraversion, the expectation is that 
they will be more strongly related to agentic extraversion than 
to impulsive sensation seeking or constraint. 

Only a few correlational studies of monoamine CSF 
metabolites and personality traits were done prior to 1991 
(Zuckerman, 1991), and they generally showed few signifi- 
cant relationships between the dopamine metabolite HVA 


and either extraversion or sensation seeking. This is still the 
case with studies that simply correlate CSF levels of HVA 
with questionnaire measures of extraversion, even when 
there is sufficient power to detect weak relationships (Limson 
et al., 1991). In fact, the Limson et al. study failed to find 
any correlations between CSF metabolites of serotonin 
(5-HIAA), norepinephrine (MHPG), norepinephrine itself, 
and Dopac and any of the personality measures assessed by 
the Minnesota Multiphasic Personality Inventory (MMPI), 
Eysenck Personality Questionnaire (EPQ), or Cloninger’s 
Temperament Character Inventory (TCI). As with psy- 
chophysiological measures, levels of neurotransmitter activ- 
ity in a resting basal state are not sensitive to variations in 
personality, at least as the latter is measured in self-report 
questionnaires. However, studies that attempt to potentiate or 
attenuate activity in neurotransmitters with agonists or antag- 
onists have yielded some significant findings in regard to per- 
sonality, even though they typically use very small sample 
sizes. 

Depue, Luciana, Arbisi, Collins, and Leon (1994) chal- 
lenged the dopamine system with bromocriptine, a potent 
agonist at D, receptor sites, and measured the effects using 
inhibition of prolactin secretion and activation of eye-blink 
rate, two measures of dopamine activation. The correlations 
between Positive Emotionality (PE) and baselline measures 
of the dopamine activity indicators were small and insignifi- 
cant, but they found significant correlations between the pu- 
tative measures of dopamine response to the agonist and the 
PE (an extraversion type measure) factor from Tellegen’s 
MPQ. Rammsayer (1998, 1999) challenged Depue et al.’s 
interpretation of their findings as indicative of higher 
dopamine reactivity in high-PE persons (extraverts) than in 
lows, suggesting that the prolactin response would indicate 
just the reverse (i.e., higher reactivity in the low-PE persons). 
The disagreements on the meaning of the data are too com- 
plicated to elucidate here. 

Rammsayer’s interpretation of the findings is supported by 
PET measures of higher cerebral blood flow to the dopamine- 
rich basal ganglia areas in introverts than in extraverts (Fischer, 
Wik, & Fredrikson, 1997); but another PET study found no re- 
lationship between E and dopamine binding in the basal gan- 
glia (N. S. Gray, Pickering, & Gray, 1994), and still another 
found a positive relationship with E (Haier et al., 1987). The 
first two of these studies used normal controls as subjects 
whereas the Haier et al. study used patients with Generalized 
Anxiety Disorder, a possible confounding factor. 

Rammsayer, Netter, and Vogel (1993), using an inhibiter 
of tyrosine hydroxlase, thereby blockading dopamine synthe- 
sis, found no difference between introverts and extraverts in 


either baseline dopamine or reactivity to the blockading 
agent. Despite the lack of difference in dopaminergic activity 
or reactivity, they found that reaction time performance was 
markedly impaired in introverts but not in extraverts by the 
dopamine blockading agent. In another study, using a chem- 
ical that selectively blocks D, receptors and inhibits 
dopamine neurons in the limbic and cortical regions of the 
brain, Rammsayer (1998) again found a detrimental effect on 
reaction (liftoff) time in introverts but not in extraverts. The 
agent that was used caused a marked decrease in alertness 
and cortical arousal, but this effect was equivalent in intro- 
verts and extraverts. Both this finding and the performance 
findings would seem to contradict Eysenck’s arousal expla- 
nation for the differences between introverts and extraverts. 
That theory would predict a more detrimental effect in ex- 
traverts because they supposedly start with a lower level of 
cortical arousal. But the results also raise the question, What 
is the source of the performance differences between intro- 
verts and extraverts if they do not differ in dopamine activity 
or reactivity? 

The answer might lie in the interactions of dopaminergic 
and other neurotransmitters or hormones or, at another level, 
in the genetics of the dopaminergic receptors. Considerable 
interest has developed in a gene associated with the dopamine 
receptor 4 (DRD4). Allelic variations in this gene have been 
associated with novelty or sensation seeking, but not with ex- 
traversion (Ebstein, Nemarov, Klotz, Gritsenko, & Belmaker, 
1997; Ebstein et al., 1996). 

Simple correlative studies have found no relationship be- 
tween serotonin or norepinephrine and E or other personality 
variables measured by questionnaires given to adult subjects. 
A study using CSF from newborns in predicting tempera- 
mental traits found that infants born with low levels of 
the serotonin metabolite 5-HIAA showed low sociability at 
9 months of age (Constantino & Murphy, 1996). Retest relia- 
bility for 5-HIAA in neurologically normal infants was very 
high (r = .94). 

A study of adults with depressive disorder treated with 
either a noradrenergic or a serotonergic reuptake inhibiter, 
which increase activity in those systems, showed that there 
were significant increases in measures of E and gregarious- 
ness (sociability) in those treated with these drugs (Bagby, 
Levitan, Kennedy, Levitt, & Joffe, 1999). The change in E was 
correlated with the change in depression severity, but the 
change in sociability was not. Although the result with socia- 
bility probably represents a change of state rather than the 
preillness trait, serotonin and norepinephrine might play some 
role in the trait as well. Studies of serotonin transporter genes 
have not shown any relationship to E, although they have to 
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other personality traits (Hamer, Greenberg, Sabol, & Murphy, 
1999; Jorm, Henderson, Jacomb, Croft, & Easteal, 1997). 


Monoamine Oxidase 


Monoamine oxidase (MAO) is an enzyme involved in the 
catabolic deamination of monoamines. Evidence using selec- 
tive monoamine inhibitors suggests that MAO-Type B, as- 
sayed from blood platelets in humans, is preferentially 
involved in the catabolic breakdown of dopamine more than 
the other two brain monoamines, norepinephrine and 
dopamine (Murphy, Aulakh, Garrick, & Sunderland, 1987). 
Although no direct correlation of platelet and brain MAO has 
been found, indirect assessments and the effects of MAO in- 
hibitors on depression, as well as a large body of behavioral 
data, suggest that there must be a connection, if only one lim- 
ited to certain brain areas. Platelet MAO is normally distrib- 
uted in the human population, is highly reliable although it 
increases in brain and platelets with age, and is lower in men 
than in woman at all ages, and variations are nearly all ge- 
netic in origin. Unlike other biochemical variables it does not 
vary much with changes in state arousal. Thus, MAO has all 
of the characteristics of a biological trait. 

Low levels of MAO-B taken from umbilical cord blood 
samples in newborn infants were related to arousal, activity, 
and good motor development (Sostek, Sostek, Murphy, 
Martin, & Born, 1981). High levels of the enzyme were re- 
lated to sleep time and general passivity. The relationship 
with motor development is particularly suggestive of devel- 
opment of the dopamine-influenced basal ganglionic areas of 
the brain involved in motor coordination. In a study of mon- 
keys living in a colony in a natural environment, low-platelet 
MAO was related to high sociability, activity, dominance, and 
sexual and aggressive activity, a broad array of E-type traits 
described by Depue and Collins (1999) as agentic extraver- 
sion. However, in human correlative studies the results relat- 
ing MAO-B to questionnaire-measured extraversion have 
been inconsistent (Zuckerman, 1991). The enzyme has more 
consistently correlated (inversely) with the trait of sensation 
seeking. But using reported behavioral indices of sociability 
in college students, low MAO was related to sociability and 
high MAO to social insolation (Coursey, Buchsbaum, & 
Murphy, 1979). 


Hormones 


The hormone testosterone (T) is produced by both men and 
women but is 8 to 10 times as high in men as in women. 
Plasma T is highly heritable (66%) in young adult males and 
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moderately heritable (41%) in females (Harris, Vernon, & 
Boomsa, 1998). In rats T has reward effects in the nucleus ac- 
cumbens, the major site of dopaminergic reward. Administra- 
tion of a dopamine receptor blocker eliminates the rewarding 
effects of T in rats, suggesting that its rewarding effects are 
mediated by an interaction with dopamine in the mesolimbic 
system (Packard, Schroeder, & Gerianne, 1998). 

The hormone T affects personality traits and may account 
in part for many of the personality trait differences between 
men and women. Men and women do not differ on the pure so- 
ciability or affiliative type of extraversion, but they do on the 
agentic type, which includes dominance, assertiveness, sur- 
gency, and self-confidence. To the extent that sensation seek- 
ing is associated with extraversion, it is with the agentic type. 

Daitzman and Zuckerman (1980) found that T in young 
males was positively correlated with sociability and extraver- 
sion, as well as with dominance and activity and inversely with 
responsibility and socialization, indicating an association with 
the agentic type of extraversion. Windle (1994) also found that 
testosterone was associated with a scale measuring behavioral 
activation, characterized by boldness, sociability, pleasure 
seeking, and rebelliousness. Dabbs (2000) also found that T is 
associated with a type of extraversion characterized by high 
energy and activity levels and lower responsibility. 


Summary 


Eysenck’s theory relating cortical arousal to extraversion has 
been extensively tested using the EEG and, in more recent 
times, the brain scanning methods. The EEG studies yielded 
mixed results in which the sources of differences between stud- 
ies were not clearly apparent. Two cerebral blood flow studies 
did confirm that extraverts were cortically underaroused re- 
lated to introverts in female subjects but not in males. Studies 
measuring cortical arousability have also not clarified the pic- 
ture. Apparently, experimental conditions affecting attention 
or inhibition may confound the relationship with E. Some more 
consistent results have been obtained from EP studies of re- 
sponses at subcortical levels in which conscious attention is 
less of a factor. Although Eysenck’s theory is confined to corti- 
cal arousal and reactivity, differences between introverts and 
extraverts have been found at lower levels of the central ner- 
vous system, even in a spinal motoneuronal reflex. 

Theories of the biochemical basis of extraversion have fo- 
cused on the monoamine neurotransmitters, particularly 
dopamine. Simple correlational studies between the 
monoamine metabolites and trait measures of E have not 
yielded significant findings, although there is some evidence 
that drugs that increase noradrenergic or serotonergic activity 
in depressed patients also increase their extraversion and 


sociability. This may be an indirect effect of the reduction in 
depression rather than a direct effect on E. The enzyme 
MAO-B is involved in regulation of the monoamines, partic- 
ularly dopamine. Low levels of MAO have been related to 
arousal and activity in newborn human infants and to socia- 
ble behavior in adult humans and monkeys. These results 
suggest that a dysregulation of the dopamine system may be 
a factor in extraversion even in its earliest expression in the 
behavior of newborns. The hormone testosterone is related to 
E, but more so to E of the agentic type, which is the type 
characterized by dominance, assertiveness, surgent affect, 
high energy levels, activity, and irresponsibility, rather than 
simple sociability and interest in social relationships. This 
distinction between the two types of E has been hypothesized 
to be crucial for the relationship between dopamine and E as 
well (Depue & Collins, 1999). 


NEUROTICISM/ANXIETY/HARM AVOIDANCE 


Although the broad trait of neuroticism/anxiety includes 
other negative emotions, such as depression, guilt, and hostil- 
ity, and character traits such as low self-esteem, neuroticism 
and anxiety are virtually indistinguishable as traits. Neuroti- 
cism is highly correlated with measures of negative affect, 
but when the negative affect was broken down into anxiety, 
depression, and hostility components, anxiety had the highest 
correlation, and hostility the lowest, with the N factor while 
depression was intermediate (Zuckerman, Joireman, Kraft, & 
Kuhlman, 1999). Hostility had a higher relationship to a fac- 
tor defined by aggression. 

Eysenck (1967) assumed a continuity between N as a per- 
sonality trait and anxiety disorders. Indeed, N is elevated in 
all of the anxiety and depressive mood disorders, and longi- 
tudinal studies show that the trait was evident in most persons 
before they developed the symptoms of the clinical disorder 
(Zuckerman, 1999). In the first half of the twentieth century, 
when little was known about the role of the limbic system in 
emotions, the biological basis of neuroticism and anxiety trait 
was related to overarousal or arousability of the sympathetic 
branch of the autonomic nervous system. Such arousal is ap- 
parent in state anxiety elicited by anticipation of some kind of 
aversive stimulus or conditioned stimuli associated with 
aversive consequences. 

Autonomic overarousal is apparent in the primary symp- 
toms of many anxiety disorders. On the assumption of conti- 
nuity between the N trait and these disorders, it was expected 
that autonomic arousal, as assessed by peripheral measures 
such as heart rate (HR), breathing rate (BR), blood pressure 
(BP), and electrodermal activity (EDA), would be correlated 
with N. In Eysenck’s (1967) theory, N was ultimately based 


on reactivity of the limbic system, which regulates the ANS, 
but he did not distinguish particular pathways, structures, or 
neurotransmitters within that system that were involved in N. 
Some theories did not even make a distinction between corti- 
cal and autonomic arousal in emotions. Eysenck felt that 
there was some correlation between the two kinds of arousal 
because of collaterals between the limbic and ascending 
reticulocortical system. Gray (1982) and others, extrapolat- 
ing from experimental studies of animals, delineated specific 
limbic systems involved in anxiety and the neurotransmitters 
involved in these systems. Neuroimaging studies have at- 
tempted to extend these brain models to humans. 


Autonomic Arousal 


Large-scale studies of the relationship between cardiovascu- 
lar measures, either in resting levels of activity or reactivity 
to stressful experimental situations, and Measures of N failed 
to reveal any significant relationships (Fahrenberg, 1987; 
Myrtek, 1984). On the assumption that high cardiovascular 
activity put high-N subjects at risk for cardiovascular disease, 
Almada et al. (1991) investigated the relation between mea- 
sures of N and subsequent health history in nearly 2,000 men. 
N was not associated with systolic BP or serum cholesterol 
but was associated with cigarette smoking and alcohol con- 
sumption. When tobacco and alcohol consumption were held 
constant there was no relationship between N and cardiovas- 
cular disease. Similar studies have failed to find any relation- 
ships between electrodermal activity and N or trait anxiety 
(Fahrenberg, 1987; Hodges, 1976; Naveteur & Baque, 1987). 

Given the fact that many anxiety disorders do show ele- 
vated heart rate and electrodermal reactivity, how can we 
explain the lack of correlation with N? The answer may lie in 
the difference between generalized anxiety disorder (GAD) 
and panic disorder (PD), agoraphobia (Ag), and obsessive- 
compulsive disorder (OCD). Whereas the latter (PD, Ag, 
OCD) show elevated basal HRs and frequent spontaneous 
SCRs, GAD patients show little evidence of this kind of 
autonomic arousal (Zuckerman, 1991). Their anxiety is 
expressed cognitively (worry) and in symptoms of muscle 
tension such as fatigue. In contrast, PD, Ag, and OCD pa- 
tients complain of autonomic symptoms, such as accelerated 
heart rate, even when they are not experiencing an actual 
panic attack (Zuckerman, 1999). Most persons who are high 
on N probably represent subclinical GAD disorder rather 
than the other types of anxiety disorders. 


Brain Arousal 


Studies of general cortical arousal using the EEG have 
historically focused on E, but some of these studies found 
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interactions with N. These effects were inconsistent; some 
found higher and some reported lower arousal for high-N 
persons. Application of PET methods has not shown any as- 
sociation of general cortical or limbic arousal with N in situ- 
ations that were not emotionally provoking (Fischer et al., 
1997; Haier et al., 1987). Similar results are seen in anxiety 
patients; but when anxiety is provoked in patients by present- 
ing them with feared stimuli, increased activity is seen in 
areas like the orbitofrontal cortex, insular cortex, temporal 
cortex, and anterior cingulate (Breier et al., 1992; Rauch et 
al., 1995). These studies identify an anxiety pathway in hu- 
mans (orbitofrontal-frontal to cingulate to temporal lobe and 
amygdala) already established in animals, but they do not 
show a preexisting sensitivity of this pathway in normals 
scoring high in N. Another study of anxiety patients in non- 
stimulated conditions, which did use normal controls, found 
that whole brain blood flow did not distinguish anxiety 
patients from normals but did find a negative correlation be- 
tween a depression scale and caudate activation. The previ- 
ously mentioned study by Canli et al. (2001) found that in a 
small sample of normal women N correlated with increased 
brain activation to negative pictures (relative to activation by 
positive pictures) in left-middle frontal and temporal gyri and 
reduced activation in the right-middle frontal gyrus. Taken 
together, the clinical studies and this last study of normals 
suggests that whole brain activation does not vary with N- 
Anx, but given negative emotional provocation there may be 
a reactive disposition in frontal cortex of high-N persons that 
activates a pathway through the orbitofrontal cortex around 
the cingulum to the temporal lobe and amygdala. 

Davis (1986) argued that the central nucleus of the amyg- 
dala is a major center where the input of fear-provoking stim- 
uli is organized and where output to various intermediate 
nuclei organizes the entire range of behavioral, autonomic, 
and neurotransmitter reactions involved in panic or fear. A 
recent MRI study (van Elst, Woermann, Lemieux, & Trimble, 
1999) found an enlargement of left and right amygdala vol- 
umes in epileptic patients with dysthymia (a chronic kind of 
neurotic depression). Amygdala volume within the group did 
not correlate with trait or state anxiety but did correlate posi- 
tively with a depression inventory. Because anxiety and 
depression are usually highly correlated and both correlate 
highly with N, it is not clear why depression alone was re- 
lated to amygdala volume. 


Monoamines 


Much of the recent exploration of the role of the monoamines 
in N-Anx have been based on Cloninger’s (1987) biosocial 
model of personality and therefore used his scale of Harm 
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Avoidance (HA) instead of the N or anxiety trait scales used 
by other investigators. HA, however, is not a pure scale of the 
N factor but lies between the E and N dimensions, constitut- 
ing a measure of introverted neuroticism. It is defined in the 
same way that Gray defines trait anxiety: a sensitivity to cues 
associated with punishment and nonreward (frustration) and 
a tendency to avoid them. 

Gray’s (1982) model suggests that norepinephrine in the 
dorsal ascending noradrenergic system (DANA) originating 
in the locus coeruleus is the major neurotransmitter involved 
in anxiety, although high levels of serotonin may mediate the 
behavioral inhibition that is associated with high levels of 
anxiety. Redmond (1977), from a psychiatric viewpoint, sees 
the DANA as an alarm system at lower levels and a panic 
provoker at high levels of activity. In contrast to these two 
theorists, Cloninger, Svrakic, and Przybeck (1993) proposed 
that high levels of serotonin activity underlie the trait of HA 
whereas norepinephrine activity is related to another trait 
called Reward Dependence. 

In patients there has been little evidence of higher levels 
of the norepinephrine metabolite MHPG in anxiety patients 
compared to normals, although a more recent study by 
Spivak et al. (1999) showed higher levels of MHPG in 
plasma of patients with combat-related posttraumatic stress 
disorder than in controls. 

The alpha-2 receptor functions as a homeostatic regulator 
of the norepinephrine systems, tuning them down when ex- 
cessive neurotransmitter levels are detected in the synapse. 
Yohimbine is a antagonist to this receptor and therefore po- 
tentiates the activity of the norepinephrine system, just as a 
broken thermostat results in an overheated room. Yohimbine 
increases MHPG levels and provokes panic attacks in pa- 
tients with panic disorders, although it does not have these ef- 
fects in normal controls (Charney & Heninger, 1986). 
Cameron et al. (1996) replicated a previous result finding a 
decreased number of alpha-2 receptors in panic disorder. One 
might extrapolate that MHPG should correlate with N or anx- 
iety over the range in normals and other patient groups. How- 
ever, as noted earlier, high N in normals may resemble GAD 
more than panic disorder. Heinz, Weingarten, Hommer, 
Wolkowitz, and Linnoila (1999) reported a high correlation 
between CSF MHPG and an anxiety scale in a combined 
group of abstinent alcoholics and normals. A stress resistant 
group, defined by N and similar measures, had lower plasma 
MHPG after a mild stressor than did a nonresistant (high-N) 
group (de Leeuwe, Hentschel, Tavenier, & Edelbroek, 1992). 
Norepinephrine may be one of the factors underlying N, but 
it may be the dysregulation of norepinephrine by a lack of the 
receptors needed for this and a consequent tendency to be un- 
able to cope with stress, rather than the basal level of activity 
in the norepinephrine system, which is related to N. 


Cloninger’s biosocial theory of personality proposes that 
the trait of harm avoidance is related to behavioral inhibition 
mediated by serotonergic activity in the brain. Earlier studies 
showed no correlation between between CSF levels of the 
serotonin metabolite, 5-HIAA, and N. A more recent study 
has found a positive correlation between CSF 5-HIAA and N 
but in a sample of depressed patients (Roy, 1999). Constan- 
tino and Murphy’s (1996) study of the prediction of infant 
temperament from CSF levels of 5-HIAA showed no rela- 
tionships between this metabolite and emotionality, sootha- 
bility, or activity in infants. 

Studies of normals using serotonin challenges, drugs that 
stimulate serotonergic activity, and indirect measures of sero- 
tonin response in normals have yielded mixed results includ- 
ing both positive (Gerra et al., 2000; Hansenne & Ansseau, 
1999), nonsignificant (Ruegg et al., 1997), and a negative re- 
lationship (Mannuck et al., 1998) with N. The first three of 
these studies used the HA scale, whereas the last used the 
N scale, but with a much larger number of normal subjects 
than in the other studies. Serotonin seems to be implicated in 
harm avoidance, but the nature of that relationship is open 
to question. As with other neurotransmitters, the personality- 
relevant aspects of serotonin may have more to do with recep- 
tor number and sensitivity than with basal levels of transmitter 
activity. 


Hormones 


Daitzman and Zuckerman (1980) found that testosterone (T) 
in males correlated negatively with various MMPI indexes of 
anxiety, depression, and neuroticism; that is, subjects with 
neurotic tendencies were low on T. Dabbs, Hopper, and 
Jurkovic (1990) reported a significant negative correlation 
between T and N in one study, but this was not replicated in 
another larger study of males; and in an even larger study of 
over 5,000 veterans T was not correlated with any MMPI in- 
dexes of trait anxiety or N. In still another study Dabbs et al. 
report significant negative correlations between T and a mea- 
sure of pessimism in both males and females. T reflects both 
trait and state characteristics; that is, it is affected by immedi- 
ate stressful experiences, particularly those involving success 
or defeat in competitive activities (Dabbs, 2000). The rela- 
tionship with pessimism may reflect a history of defeat and 
consequent expectations for future failures. This depressive 
attitude may underlie negative relationships with N if any 
such relationships do exist. 

Cortisol is one of the end products of activation of the 
hypothalamic-pituitary adrenocortical (HYPAC) system, a 
stress-reactive hormonal system. Like T, cortisol reactivity 
has both trait and state characteristics. Elevated cortisol is 
associated with major depressive disorder as a trait but is 


Psychoticism/Impulsivity/Sensation Seeking/Conscientiousness/Constraint 97 


found in anxiety disorders only when activated by an imme- 
diate stressor. 


Molecular Genetics 


Lesch, Bengal, Hells, and Sabol (1996) found an association 
between a serotonin transporter gene (5-HTTLPR) and the trait 
of neuroticism, as assessed by three different scales including 
the NEO N scale and Cloninger’s TCI harm avoidance scale. 
Individuals with either one or two copies of the short form had 
higher N scores than individuals homozygous for the long vari- 
ant of the gene. The association was limited to the N factor 
of the NEO and the harm avoidant factor of the TCI; none of 
the other factors in these test was associated with the genetic 
variant. However, in a second study by this group (Hamer, 
Greenberg, Sabol, & Murphy, 1999) the association of the gene 
with harm avoidance was weaker, and associations were found 
with TCI traits of cooperativeness and self-directiveness. 

Several other studies have not been able to replicate the re- 
lationship between the gene variants and N or harm avoidance. 
This is acommon outcome in the hunt for specific genes asso- 
ciated with personality traits or types of psychopathology, 
even when studies have adequate power and use good method- 
ology. Population differences may account for some of these 
failures. Even in the studies that are significant the particular 
gene accounts only for a small portion of the genetic variance. 
In the Lesch et al. study the 5-HTT polymorphism accounted 
for 3% to 4% of the total variance for the trait and 7% to 9% of 
the genetic variance, and 10 to 15 more genes were estimated 
to be involved. If there is any replication of a gene-trait associ- 
ation, that finding should not be immediately dismissed by 
subsequent failures of replication, particularly if the finding 
has a theoretical basis. In this case Cloninger’s theory has sug- 
gested the involvement of serotonin in harm avoidance. 

The short form of the gene, which is associated with high 
neuroticism, reduces serotonin uptake and therefore in- 
creases serotonergic transmission. Reduced uptake has been 
associated with anxiety in animal and human models, but 
paradoxically the serotonin uptake inhibitors are therapeutic 
agents in depressive disorders and several forms of anxiety 
disorders. These drugs could achieve their results through the 
inhibitory effects of serotonin on other systems such as the 
noradrenergic ones. 


Summary 


A sudden intense surge in anxiety is characterized by 
arousal of the sympathetic branch of the autonomic nervous 
system as expressed in elevated heart and breathing rates, 
blood pressure, sweating, and other signs of activation 
of this system. This led to the expectation that N or trait 


anxiety would be related to measures of these indicators 
either in the basal state or in reaction to stress. Research has 
generally failed to support this correlational hypothesis. 
EEG and brain scan studies also fail to reveal a difference in 
arousal levels as a trait distinguishing high- and low-N indi- 
viduals. However, PET scan studies, done primarily on pa- 
tients with anxiety disorders in reaction to fearful stimuli, 
show heightened reactivity of frontal, insular, and temporal 
cortex and anterior cingulate to such stimuli. Evidence from 
studies of animals has implicated the amygdala as a center 
for organization of the fear response, but brain imaging 
studies in humans have not yet supplied evidence for this 
localization. 

Much of the research on other species identifies activation 
of the dorsal ascending noradrenergic system originating in 
the locus coeruleus as an alarm system activating the entire 
cortex in states of fear or anxiety. Reactivity of this system is 
a characteristic of panic disorders during panic attacks com- 
pared to the reactions of other types of anxiety disorders and 
normal controls. Correlational studies of norepinephrine 
metabolites and N-type trait measures in the basal state have 
not found a relationship, but at least one study has found a re- 
lationship between N and reactivity of a norepinephrine 
metabolite and response to stress. A hypothesized relation- 
ship with the monoamine serotonin has also shown no rela- 
tionship with N in the basal state and no consistent findings 
relating N to reactions to drugs that stimulate serotonergic ac- 
tivity. Initial findings of a relationship between a serotonin 
transporter gene and N-type scales have not been replicated. 
Hormones like testosterone and cortisol show similar nega- 
tive findings in the basal state and few findings relating N to 
reactivity to stress. 

The research attempting to find a biological basis for N has 
had a disappointing outcome, particularly in view of the posi- 
tive results in experimental research with animals and with 
humans that suffer from anxiety and mood disorders. Longi- 
tudal research has shown that N is a personality precursor of 
these disorders, so why does N not show relationships with 
some of the same biological indicators that characterize the 
disorders? There may be a kind of threshold effect so that the 
dysregulation of neurotransmitter systems characteristic of 
the disorders only emerges at some critical level of persistent 
stress that is not reproducible in controlled laboratory studies. 


PSYCHOTICISM/IMPULSIVITY/SENSATION 
SEEKING/CONSCIENTIOUSNESS/CONSTRAINT 


The third major personality factor goes under a variety of 
names depending on the various trait classification systems. 
Our factor analyses of personality scales have shown that 
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Eysenck’s psychoticism scale is one of the best markers for 
the dimension that consists of scales for impulsivity and 
sensation seeking at one pole and scales for socialization, 
responsibility, and restraint at the other pole (Zuckerman 
et al., 1988, 1991, 1993). In a three-factor solution this 
factor also includes aggression and capacity to inhibit 
aggression, but in a four- or five-factor solution aggression 
and hostility versus agreeableness form a separate factor 
(Zuckerman et al., 1993). This chapter is organized by the 
four-factor model. 


Cortical Arousal and Arousability 


At the time the original studies were done relating condition- 
ing to arousal and the construct “strength of the nervous sys- 
tem” to extraversion, E was measured by scales with two 
components: E and Impulsivity (Imp). In a theoretical shift, 
not receiving much attention, Eysenck and Eysenck (1985) re- 
assigned Imp to the P rather than the E dimension. Although 
nearly all the earlier arousal and conditioning studies focused 
on E, it was shown that the relationship of E to conditionabil- 
ity (introverts more conditionable than extraverts) depended 
on the Imp component of E rather than the sociability compo- 
nent (Barratt, 1971; Eysenck & Levey, 1972). A later study 
showed that classical eyelid conditoning was related most 
closely to a specific type of Imp, the tendency to act quickly on 
impulse without thinking or planning. This is the type of Imp, 
called narrow impulsivity IMPn), that constitutes a subscale 
of the older E scale. It is also the type of Imp that has been 
combined with sensation seeking in the latest ImpSS scale. 
Conditionability is thought to be a function of arousal; the 
more aroused a person is, the more conditionable he or she is 
thought to be. Could this mean that cortical arousal is related 
to the third dimension (P), including sensation seeking and 
IMPn, rather than the first (E) dimension of personality? A 
PET study found negative correlations between P and glucose 
use in cortex and in thalamic and cingulate areas of the limbic 
system (Haier et al., 1987). Low cortical and autonomic 
arousal is a characteristic of the psychopathic (antisocial) per- 
sonality, which may represent an extreme manifestation of the 
P dimension of personality (Zuckerman, 1989). 

Evidence for a relationship between cortical arousal 
(EEG) and P and IMPn was found by some investigators 
(Goldring & Richards, 1985; O’Gorman & Lloyd, 1987); 
high P and impulsive subjects were underaroused. Sensation 
seeking, however, was not related to tonic arousal. Instead, 
sensation seeking—particularly that of the disinhibitiory 
type—has been consistently related to a particular measure of 
cortical arousability called augmenting-reducing (A-R, 
Buchsbaum, 1971). 
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Figure 4.3 Mean visual evoked potential amplitudes (P1-N1) at five levels 
of light intensity for low and high scorers on the disinhibition subscale of the 
Sensation Seeking Scale. 

From “Sensation seeking and cortical augmenting-reducing,” by M. 
Zuckerman, T. T. Murtaugh, and J. Siegel, 1974, Psychophysiology, 11, 
p. 539. Copyright 1974 by the Society for Psychophysiological Research. 
Reprinted by permission. 


A-R asseses the relationship of cortical reactivity, mea- 
sured as a function of the relationship between the cortical EP 
and stimulus intensity for any given individual. A strong pos- 
itive relationship between the amplitude of the EP and the in- 
tensity of stimuli is called augmenting, and a negative or zero 
relationship is called reducing. A-R differences are most 
often observed at the highest intensities of stimulation, where 
the reducers show a marked EP reduction and the augmenters 
continue to show increased EP amplitude. There is an obvi- 
ous relevance of this measure to Pavlov’s (1927/1960) con- 
struct of “strength of the nervous system,” based on the 
nervous system’s capacity to respond to high intensities of 
stimulation without showing transmarginal inhibition. 

Figure 4.3 shows the first study of the relationship be- 
tween the Disinhibition (Dis) subscale of the SSS and ampli- 
tude of the visual EP. Those scoring high on Dis displayed an 
augmenting pattern, and those scoring low on this scale 
showed a strong reducing pattern, particularly at the highest 
intensity of stimulation. This study was followed by many 
others, some using visual and others using auditory stimuli. 
Replications were frequent, particularly for the auditory EP 
(Zuckerman, 1990, 1991). Replications continue to appear 
(Brocke, Beauducel, John, Debener, & Heilemann, 2000; 
Stenberg, Rosen, & Risberg, 1990). A-R has also been found 
to be related to Imp, particularly cognitive impulsiveness 
(Barratt, Pritchard, Faulk, & Brandt, 1987). 

The A-R model has been extended to other species and 
used as a biological marker for behavioral traits in animals 
resembling those in high and low human sensation seekers 
and impulsive and constrained persons. Cats who showed the 
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augmenting pattern were active, exploratory, and approached 
rather than withdrew from novel stimuli. Augmenting cats 
adapted easily to novel situations, were responsive to a sim- 
ple reward task, but were poor at learning to inhibit responses 
where they were only reinforced for low rates of response 
(Hall, Rappaport, Hopkins, Griffin, & Silverman, 1970; 
Lukas & Siegel, 1977; Saxton, Siegel, & Lukas, 1987). 

Siegel extended this paradigm to a study of two geneti- 
cally selected strains of rats, one actively avoidant or more 
aggressive and the other passive and frozen in reaction to 
shock (Siegel, Sisson, & Driscoll, 1993). The first strain 
consistently showed the augmenting EP pattern, and the sec- 
ond showed the reducing. Other behavioral characteristics of 
these strains were consistent with the human model of im- 
pulsive sensation seeking: The augmenting strain was ag- 
gressive, more willing to ingest alcohol, had high tolerance 
for barbituates, and self-administered higher intensities of 
electrical stimulation in reward areas of the limbic brain than 
the reducing strain. 

Biochemical reactions suggested the basis for behavioral 
differences in characteristics of stress-reactive neurotransmit- 
ter and hormonal responses. Under stress, the augmenting 
strain showed more dopaminergic activity in the prefrontal 
cortex of brain, whereas the reducers had a stronger reaction in 
the hypothalamic-pituitary-adrenal cortex (HYPAC) stress 
pathway including increased serotonergic activity and corti- 
cotropin releasing factor in the hypothalamus and adrenocor- 
ticotropic hormone in the pituitary gland. Dopamine is a 
neurotransmitter implicated in action tendencies and theo- 
rized to be the basis of novelty and sensation seeking. 
Dopamine release would explain the active avoidance patterns 
that were the basis for selecting the two strains. Conversely, 
serotonin activity is associated with behavioral inhibition. 


Monoamines 


The animal model described earlier suggests that sensation 
seeking and related traits in humans may be associated posi- 
tively with dopaminergic and negatively with serotonergic 
reactivity. Indirect evidence of this association comes from 
patients with Parkinson’s disease (PD), in which dopamine is 
depleted 75% in ventral tegmental neurons. A study of per- 
sonality of PD patients showed that the PD patients were sig- 
nificantly lower on novelty seeking than controls but did not 
differ from them on harm avoidance or reward dependence 
(Menza, Golbe, Cody, & Forman, 1993). The PD patients 
were more depressed than controls, but depression did not 
correlate with novelty seeking scores. 

Simple correlations between sensation seeking and 
dopamine and serotonin metabolites (HVA and 5-HIAA) 


assayed from CSF reveal no correlations between these 
metabolites and sensation seeking or the P scale or impulsiv- 
ity scales (Ballenger et al., 1983; Limson et al., 1991). How- 
ever, the correlational study by Ballenger et al. found a 
significant negative correlation between norepinephrine in 
the CSF and sensation seeking. A significant correlation was 
found between P and dopamine D2 binding in left and right 
basal ganglia in a PET study of a small group of normal sub- 
jects (Gray, Pickering, & Gray, 1994). 

An experimental study by Netter, Hennig, and Roed 
(1996) used drugs that stimulate (agonist) or inhibit (antago- 
nist) activity in the serotonergic and dopaminergic systems 
and measured their effects on hormonal, emotional-state, and 
behavioral reactions. Their findings suggested a low respon- 
sivity of the serotonergic system in high sensation seekers, 
but no association of dopaminergic response to an agonist and 
sensation seeking. However, craving for nicotine was in- 
creased by a dopamine agonist in high sensation seekers, sug- 
gesting that dopamine stimulation may induce more approach 
behavior in high than in low sensation seekers. Experiments 
in which nicotine or amphetamine is given to participants 
high or low in sensation seeking or novelty seeking showed 
that the high sensation/novelty seekers had more intense 
“highs” or subjective effects in response to these drugs than 
did low sensation seekers (Hutchison, Wood, & Swift, 1999; 
Perkins, Wilson, Gerlach, Broge, & Grobe, 2000). The effect 
for nicotine was most intense for nonsmokers, and the study 
on amphetamine did not use persons with a drug history. 
These special reactions of high sensation/novelty seekers to 
the novel drugs suggests some sensitivity to these dopamine 
agonists, perhaps in the receptors. 

Another study by the German group found that the disin- 
hibition type of sensation seeking and impulsivity, as well as 
aggression, were correlated with a response to a serotonin 
antagonist indicating low serotonergic responsivity in impul- 
sive sensation seekers (Hennig et al., 1998). 


Monoamine Oxidase 


Fairly consistent negative relationships have been found be- 
tween sensation seeking and MAO. Asurvey of results in 1994 
showed low but significant negative correlations between 
platelet MAO and sensation seeking trait in 9 of 13 groups, 
and in 11 of 13 groups the correlations were negative in sign. 
The gender and age differences in sensation seeking are con- 
sistent with the gender and age differences in MAO described 
previously. Low MAO levels are characteristic of disorders 
characterized by impulsive, antisocial behavior including an- 
tisocial and borderline personality disorders, alcoholism and 
heavy drug abuse, pathological gambling disorder, bipolar 
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disorder, and attention deficit and hyperactivity disorder in 
children. MAO is low even in children of alcoholics and bipo- 
lar disorders who have not yet manifested the disorders, sug- 
gesting that it is a genetic risk marker for these disorders. 

In a general normal population, low MAO was associated 
with use of tobacco, alcohol, and illegal drugs, convictions for 
crimes other than traffic offenses, and sociability in terms of 
hours spent with friends (Coursey et al., 1979). A study of low- 
MAO monkeys living in a natural environment showed that 
they were more aggressive, dominant, sexually active, and so- 
ciable than were high-MAO monkeys (Redmond, Murphy, & 
Baulu, 1979). Monkeys with high MAO levels were social iso- 
lates and passive. This study of another species suggests the 
evolutionary advantage of sensation seeking as mediated by 
MAO and possibly dopaminergic systems in the brain. Low 
MAO, however, is also associated with impulsivity in labora- 
tory tests (Klinteberg et al., 1991), as is sensation seeking 
(Breen & Zuckerman, 1999; Thornquist & Zuckerman, 1995), 
and impulsivity in risky situations could be a disadvantaged 
trait that may lead to premature death. However, the advantage 
in securing and dominating mates by intimidation of rivals 
may have outweighed the evolutionary disadvantages of reck- 
less behavior. 

In the public mind testosterone is identified with sexual 
drive and aggressiveness. However research shows that 
testosterone (T) is associated with a broader range of traits 
than these two. Androgens and T assayed from blood are cor- 
related with sensation seeking (Daitzman & Zuckerman, 
1980; Daitzman, Zuckerman, Sammelwitz, & Ganjam, 
1978). Dabbs (2000) and Bogaert and Fisher (1995), using T 
from saliva, found only nonsignificant tendencies in that di- 
rection. A comparison of hypogonadal (low-T) and normal-T 
men, all referred for complaints of erectile dysfunction, 
showed that the low T-men were lower on sensation seeking 
than were the normal-T men (O’ Carroll, 1984). 


Hormones 


Testosterone and sensation seeking in young males are both 
correlated with their sexual experience, in terms of the num- 
ber of sex partners they have had (Bogaert & Fisher, 1995; 
Dabbs, 2000; Daitzman & Zuckerman 1980). Testosterone 
levels affect sexuality in women as well as men. Androgen 
levels of married women were related to sexual responsivity, 
frequency of intercourse, and sexual gratification (Persky 
et al., 1982). As with MAO, we can see the evolutionary 
advantage of the behavioral trait based on its biochemical 
substrate. However, other correlates of T include sociability, 
dominance, and activity, as well as inverse relationships to 
socialization and self-control. 


The high-T male tends to be assertive, impulsive, and 
low in self-control, as well as high in sensation seeking. 
There is much less work on T in women, but what data there 
are suggest the same kind of personality correlates as in 
men. Apart from aggression, high-T men were more likely 
than others to misbehave in school as children, get into legal 
difficulties as adults, use drugs and alcohol, and go AWOL 
(absent without leave) while in the army (Dabbs, 2000). Fra- 
ternities with high average T levels were generally disor- 
derly and chaotic, and their members were described by an 
observer as “crude and rude.” The high-T fraternities had 
more parties, worse grades, and fewer community service 
activities. Dabbs (2000) suggested that the total effect 
among members is an outcome of an interaction between T 
levels of its members and reinforcement of each other for 
antisocial behavior. In this case, high T is clearly a predis- 
posing factor for low socialization, which these authors 
describe as “rambunctiousness.” 

Testosterone levels reflect both trait and state moods. Al- 
though reliability can be found in T levels taken at the same 
time of day in the same setting, T levels can also be affected 
by experiences in competition (Dabbs, 2000). Competitors 
show increases in T when victorious and decreases when de- 
feated. Even sports spectators show increases in T when their 
team wins and decrease when their team loses. 

High levels of cortisol are associated with prolonged 
stress and depression. Ballenger et al. (1983) found that low 
levels of CSF cortisol were associated with a P dimension 
factor that included the P scale, the disinhibition subscale of 
the Sensation Seeking Scale (SSS), the MMPI hypomania 
scale, and lifetime number of sexual partners. Low levels of 
cortisol have been found in prisoners who have a history of 
psychopathic and violent behavior (Virkkunen, 1985). Low 
cortisol was also associated with novelty seeking in veterans 
with posttraumatic stress disorder (Wang, Mason, Charney, 
& Yehuda, 1997). Low cortisol may indicate a low reactivity 
to stress, which can be an advantage in some situations but 
carries the dangers inherent in lack of control and impulsiv- 
ity. Traits that may have been adaptive in the warrior societies 
of the past may now confer a disadvantage in more socialized 
civilizations. 


Genetics 


Twin studies have found relatively high heritabilities (58%) 
for sensation seeking whether based on twins raised together 
(Fulker, Eysenck, & Zuckerman, 1980) or on twins separated 
shortly after birth and raised in different families (Hur & 
Bouchard, 1997). Heritability for Cloninger’s NS scale is 
somewhat lower (40%) but typical of that found for other 


Psychoticism/Impulsivity/Sensation Seeking/Conscientiousness/Constraint 101 


personality traits (Heath, Cloninger, & Martin, 1994), but 
that for impulsivity is lower (15-40%) albeit significant 
(Eysenck, 1983). 

Ebstein et al. (1996) were the first to report an association 
between the trait of novelty seeking and the gene for the D4 
dopamine receptor (D4DR). The longer, usually the 7 repeat 
form of the 48 base pair sequence, was associated with high 
scores on Cloninger’s NS scale in an Israeli population. An 
immediate replication was reported by Benjamin et al. (1996) 
in an American population using scales from the NEO that 
approximate the NS factor such as Excitement Seeking and 
Deliberation (vs. Impulsiveness). Within a year Ebstein and 
Belmaker (1997) summarized the rapidly growing literature 
reporting two more replications and three failures to repli- 
cate. Since then two more failures to replicate have been 
reported, one in a Swedish population (Jönsson et al., 1998) 
and the other in a New Zealand sample (Sullivan et al., 1998). 
One partial replication was reported in Finland (Ekelund, 
Lichtermann, Jaervelin, & Peltonen, 1999). The variations in 
populations among the studies may have something to do 
with the inconsistent results. The distribution of alleles dif- 
fers among populations. For instance, in a Japanese popula- 
tion the 7 repeat allele was not found but a comparison of the 
longer (5 and 6 repeats) with the shorter (2 to 4 repeats) still 
showed the former to be more characteristic of high novelty 
seekers (Ono et al., 1997). 

As with MAO, the association between sensation or nov- 
elty seeking and this genetic marker is given some credence 
by its association with behavioral traits or disorders charac- 
terized by impulsivity and sensation seeking. The longer 
form of the D4DR has been found in high proportions of 
opiate abusers (Kotler et al., 1997), persons with pathological 
gambling disorder (Castro, Ibanez, Torres, Säiz-Ruiz, & 
Fernändez-Piqueras, 1997), those with attention-deficit- 
hyperactivity disorder (Swanson et al., 1998), and infants 
showing less distress in reaction to novel stimuli (Auerbach 
et al., 1998). 

A comparative study was done on the effects of knocking 
out the DAR gene in mice on tests of approach-avoidance in 
reaction to novel objects or situations (Dulawa, Grandy, Low, 
Paulus, & Geyer, 1999). The D4 knockout mice showed re- 
ductions in behavioral response to novelty or a decrease in 
novelty related exploration in comparison to D4 intact mice. 

Despite some failures of direct replication the association 
between novelty seeking and the D4DR receptor gene is 
given credence by these extensions to psychopathology and 
behavior in humans and mice. The D4DR association ac- 
counts for only about 10% of the genetic variation in the 
human trait, so other genes are certainly involved. The search 
is on for such genes. A crucial question is the functional sig- 


nificance of the difference between the alleles associated with 
high or low sensation seeking. The D4DR gene is expressed 
mainly in the limbic brain regions associated with emotional 
and motivational characteristics of sensation seeking. 
Dopaminergic activity is certainly involved, as has been pos- 
tulated. But the significance of the D4DR gene in this activity 
is far from certain. An interesting finding is that the density of 
D4 receptors is elevated in brains of schizophrenics and that 
this receptor is the primary target for the antipsychotic drug 
clozapine (Seeman, 1995). 


Summary 


The underarousal hypothesis related to E has been more suc- 
cessfully applied to this third dimension of personality. Both 
EEG and brain imaging studies have found some preliminary 
evidence of cortical underarousal related to the P dimension 
and impulsivity. Sensation seeking and impulsivity have 
been related to the characteristic cortical response to a range 
of intensities of stimulation. Disinhibited and impulsive per- 
sons show an augmentation of cortical response at high 
intensities of stimulation relative to low intensities, whereas 
inhibited and constrained individuals show a reducing 
pattern, particularly at high intensities. This augmenting- 
reducing paradigm of cortical reactivity has been extended to 
cats and rats, where it is associated with similar kinds of 
behavioral reactions and with other kinds of biological reac- 
tivity postulated to be the basis of the behavioral traits in 
humans. 

The clinical model for this dimension of personality lies in 
the psychopathic or antisocial personality disorder. One of 
the characteristics of this disorder is a lack of emotional reac- 
tivity to stimuli associated with punishment and therefore a 
deficit in learning to avoid reacting to such stimuli. This leads 
to seeking of high-intensity rewarding stimuli regardless of 
the risk involved. It is not surprising that psychopaths are all 
high impulsive sensation seekers and share some of the same 
biological traits with nonpsychopathic sensation seekers such 
as low levels of the monoamine oxidase enzyme and high 
levels of testosterone. 

One psychopharmacological theory of the P dimension is 
that it is based on high dopaminergic reactivity and low sero- 
tonergic and noradrenergic reactivity to highly stimulating 
situations. The low serotonergic reactivity is particularly re- 
lated to the lack of restraint or behavioral inhibition and the 
low noradrenergic reactivity to the lack of arousal character- 
istic of high P, impulsive, and sensation seeking individuals. 
There is some evidence from studies of humans of a weaker 
response to serotonin stimulants in high sensation seekers 
than in low sensation seekers. There is no demonstrated 
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relationship between the P dimension and dopaminergic reac- 
tivity although animal and clinical research would support 
such a relationship. 

High levels of testosterone and low levels of cortisol have 
been associated with disinhibition and psychopathic traits. 
But high levels of testosterone have also been associated with 
sociability and low levels with neuroticism, as discussed in 
previous sections. There is no necessary one-to-one relation- 
ship between biological and personality traits. Neurotrans- 
mitters like dopamine and hormones like testosterone may be 
related to two or more of the basic dimensions of personality 
or to a higher order dimensions like approach or inhibition. 

Personality in the third dimension shows a high degree of 
heritability compared to other major dimensions. A specific 
gene, the dopamine receptor D4, has been associated with 
the trait of novelty seeking, although replication has been 
spotty. The association is supported by animal and clinical 
studies. Disorders characterized by impulsivity like opiate 
abuse, pathological gambling disorder, and attention-deficit- 
hyperactivity disorder share the same form of the gene as 
found in high novelty seekers. Mice with the gene removed 
show decreases in exploration and responses to novel situa- 
tions. The third dimension of personality has been a rich lode 
of biological findings from the psychophysiological down to 
the most basic genetic level. 


AGGRESSION/HOSTILITY/ANGER/ 
AGREEABLENESS 


Problems of definition confuse the fourth dimension of per- 
sonality. Aggression refers to behavior, hostility to attitude, 
and anger to emotion. One can be aggressive without being 
hostile or angry, as in certain kinds of competition; or one can 
be chronically hostile and angry without expressing the neg- 
ative attitude and feelings in overt aggression. One may be 
disagreeable without being aggressive or being aware of hos- 
tile attitudes or anger. Hostility without aggression is more 
closely associated with the N factor whereas aggression, with 
or without hostility, is more closely associated with this 
fourth factor. 

Another source of differences is in the way aggression is 
expressed. Aggression in other species is classified by the 
source or context of the aggression: predatory, intermale, fear- 
induced, maternal, sex-related, instrumental, territorial, or 
merely irritable (Volavka, 1995). Human aggression is more 
often characterized by the form of expression. For instance, 
the widely used Buss-Durkee (1957) Hostility Scale (BDHS) 
classifies aggression as assault, indirect hostility, verbal 
hostility, irritability, negativism, resentment, and suspicion. 


A new form of the scale has reduced the number of subscales 
to four, using factor analyses: physical aggression, verbal ag- 
gression, anger, and hostility (Buss & Perry, 1992). Although 
the subscales are moderately intercorrelated, quite different 
results have been found for the different subscales of the test 
in biological studies. Another important distinction in the lit- 
erature is whether aggression is impulsive. The impulsive 
type of aggression seems more biologically rooted than in- 
strumental types of aggression, but this confounds two differ- 
ent dimensions of personality. 

Although aggression and hostility are correlated in tests 
and life, they are separated in two of the major trait classifi- 
cation systems. Eysenck’s system includes negative feelings 
like anger (moodiness) in neuroticism, but aggression and 
hostility are at the core of the psychoticism dimension. Costa 
and McCrae (1992a) have angry-hostility as a facet of neu- 
roticism but regard aggression as the obverse of agreeable- 
ness. My colleagues and I found that hostility and anger load 
more highly on N and aggression on P in a three-factor 
model, but all three correlate with a common factor in a five- 
factor analysis (Zuckerman et al., 1991). 

Aggression has been defined by several methods, includ- 
ing self-report ratings or questionnaires, observer or ratings 
by others, and life-history variables like membership in 
groups characterized by violent acts or crimes. Aggression is 
not a socially desirable trait and this may limit the usefulness 
of self-report methods in some settings. Laboratory observa- 
tions may be too specific to the experimental conditions. 
Persons who committed a violent crime, like murder, may 
differ depending on how characteristic their violent behavior 
was before they committed the crime. All methods have 
methodological problems, but in spite of this there are certain 
consistencies in results across methods in the literature. 


Cortical Arousal and Arousability 


Early studies of the EEG in abnormal populations, like vio- 
lent criminals, used crude qualitative judgments of the EEG 
records as “abnormal” or “normal” (Volavka, 1995). EEG ab- 
normalities included diffused or focal slowing, spiking or 
sharp waves in certain areas, and generalized paroxysmal 
features resembling minor epileptic seizures. The incidence 
of abnormal records found in samples of prisoners convicted 
of homicides and habitually violent prisoners was quite high 
(50-65%) compared to nonviolent prisoners or normal con- 
trols (about 5-10%; Volavka, 1995). However, some other 
studies found no differences between violent and nonviolent 
offenders. 

Studies using quantitative methods showed EEG slowing 
in offenders, including slowing of alpha activity and an 


excess of slow wave (theta) activity. Volavka (1995) pointed 
out that these results could be due to a variety of factors in- 
cluding developmental retardation, brain injuries, decreased 
arousal level, cortical disinhibition, or genetic factors. Actu- 
ally, twin research suggests that most of the activity in spec- 
trum parameters of the EEG is genetically determined 
(Lykken, 1982). 

One limitation of most of these earlier studies was that 
only prisoners referred for neuropsychiatric evaluation were 
used. A study by Wong, Lumsden, Fenton, and Fenwick 
(1994) selected subjects from a population of prisoners who 
had all been rated for violent behavior, and 70% had received 
EEG assessment. The prisoners were divided into three 
groups based on their history of violence. Going from the 
lowest to the highest violent groups, the percentages of ab- 
normal EEG’s were, respectively, 26%, 24%, and 43%. The 
most frequent EEG characteristics differentiating the most 
violent from the less violent groups was focalized EEG ab- 
normalities, particularly in the temporal lobes. Twenty per- 
cent of the most violent patients showed abnormal temporal 
lobe readings compared to 2% to 3% in the other two groups. 
Computerized tomography (CT) scans confirmed the high in- 
cidence of temporal lobe abnormalities in the most violent 
group. 

The cortical EP has also been used to study cortical arous- 
ability. A study comparing detoxified alcoholic patients with 
and without histories of aggression found lower amplitudes 
of the P300 in the aggressive group (Branchey, Buydens- 
Branchey, & Lieber, 1988). Aggressive alcoholics often have 
other characteristics, such as impulsivity and alcoholism, 
which might have produced the weaker P300 signal. Another 
study found that impulsive aggressive subjects screened from 
a college student population also showed lower P300 ampli- 
tudes at frontal electrode sites (Gerstle, Mathias, & Stanford, 
1998). Still another study showed that a drug that reduced 
frequency of aggressive acts among prisoners with a history 
of impulsive aggression also increased the amplitude of the 
P300 in this group (Barratt, Stanford, Felthous, & Kent, 
1997). This effect of the drug was not found in a group of 
prisoners who had committed premeditated murders. A re- 
duced P300, particularly in the frontal lobes, may be sympto- 
matic of a weakened inhibition from the frontal lobes and 
may account for the impulsive aspect of the aggression. 

Visual imaging methods have been used in the study of vi- 
olent behavior. Two structural methods are computed tomog- 
raphy (CT) and magnetic resonance imaging (MRI). MRI 
yields better images for precise assessment of brain structure. 
PET is used to assess brain activity in specific areas of brain 
including regions not accessible by ordinary EEG methods. 
Mills and Raine (1994) reviewed 15 studies of structural 
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brain imaging (MRI, CT) and 5 studies using PET and re- 
gional CBF. Subject groups were violent prisoners, convicted 
murderers, pedophiles, incest offenders, property offenders, 
and, in some studies, normal controls. Property offenders 
were regarded as controls for violent offenders. Sexual of- 
fenders were not necessarily violent. Nine of the 15 studies 
using CT or MRI showed some type of structural abnormal- 
ity, about evenly divided between frontal and temporal or 
frontotemporal deficits. Frontal abnormalities characterized 
the violent offenders and frontotemporal the sexual offend- 
ers, according to the authors of the review. However, most 
studies used small samples. The two studies of violent offend- 
ers using large samples (Ns of 128 and 148) found no particu- 
lar localization of abnormalities (Elliot, 1982; Merikangus, 
1981). The only study using MRI with any kind of N (another 
had only 2 cases) found evidence of temporal lobe lesions in 
5 of 14 violent patients (Tonkonogy, 1990). The large study by 
Wong et al. (1994), not included in the review, found that 55% 
of the most violent group had abnormal CT findings, and 75% 
of these were temporal lobe findings. Contrary to the hypoth- 
esis of Mills and Raine, temporal lobe lesions alone seem to 
be characteristic of violent patients. More MRI studies are 
needed to clarify the issue of localization. 

The temporal lobe overlays the amygdala and has connec- 
tions with it. Animal lesion and stimulation studies have 
found sites in the amygdala that inhibit and others that trigger 
aggression. Total amygdalectomies in monkeys produce a 
drop in the dominance hierarchy and an inability to defend 
against aggression from other monkeys. The comparative 
data suggest loci for aggression in the amygdala. 

Mills and Raine reviewed five PET studies, but of these 
only one had a near-adequate number of subjects (3 had less 
than 10) and another compared child molesters with controls. 
The one study remaining compared 22 murderers with 22 
normals and found selective prefrontal dysfunction in the 
group of murderers (Raine et al., 1994). Temporal lobe dam- 
age and functional hypofrontality are not unique to violent 
offenders but are also found in patients with schizophrenia. 


Cardiovascular Arousal and Arousability 


Numerous studies show that persons who score high on 
hostility scales show greater anger and cardiovascular 
arousal, especially blood pressure, in response to stress or 
perceived attack than do low hostile persons. As an example, 
a recent study found that among participants who were delib- 
erately harassed in an experiment, the high hostile group who 
was harassed showed enhanced and prolonged blood pres- 
sure, heart rate, forearm blood flow and vascular resistance, 
and increased norepinephrine, testosterone, and cortisol 
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responses than did low hostile subjects who were harassed 
(Suarez, Kuhn, Schanberg, Williams, & Zimmermann, 
1998). This kind of cardiovascular reactivity may occur in 
frequent situations like stressful marital interactions (Smith 
& Gallo, 1999), and general day-to-day living (Räikkönen, 
Matthews, Flory, & Owens, 1999), and thus put a strain on 
the cardiovascular system that can result in cardiovascular 
disease, including hypertension (Lawler et al., 1998; Miller, 
Dolgoy, Friese, & Sita, 1998) and isochemic heart disease 
(IHD; Gallagher, Yarnell, Sweetnam, Elwood, & Stansfied, 
1999). Persons with a family history of hypertension exhibit 
the same pattern of hostility and anger arousal with elevated 
blood pressure as do those who have developed the disorder 
suggesting that there may be a genetically influenced source 
to the cardiovascular overreactivity associated with anger 
arousability. However, how the anger is dealt with is a factor 
in vulnerability to heart disease. In a prospective study of 
nearly 3,000 men in their 50s and 60s Gallager et al. (1999) 
found that suppressed anger was most predictive of the inci- 
dence of IHD even when other risk factors were statistically 
controlled. 


Monoamines 


Asberg’s (1994) review of the role of the monoamine neuro- 
transmitters in human aggressiveness and violence attributes 
a primary importance to the role of serotonin. In animals 
serotonin is associated with inhibition of aggressive behavior 
and lowered serotonin with disinhibition of such behavior 
(Soubrié, 1986). In humans low levels of the serotonin 
metabolite, 5-HIAA, have been consistently found in those 
who attempt or complete suicide using violent means and in 
violent criminal offenders, particularly those characterized 
by impulsive violence or murder (Asberg, 1994). Personality 
disorders like antisocial and borderline disorder have a high 
incidence of aggressive behaviors and suicide attempts. 
Homicide and suicide are not antithetical; homicide offend- 
ers have increased suicide rates. Within a group with person- 
ality disorders a negative correlation was found between CSF 
5-HIAA and lifetime aggressive behavior (Brown et al., 
1982). 

Hormonal responses to serotonergic agonists and antago- 
nists have also been used to assess the reactivity of the sys- 
tem in relation to aggression. They generally support the 
hypothesis of an inverse relationship between serotonin func- 
tion and aggression/hostility (Cleare & Bond, 1997; Coccaro, 
Kavoussi, Sheline, Berman, & Csernansky, 1997; Moss, Yao, 
& Panzak, 1990). 

Tryptophan is a precursor of serotonin in the brain (see 
Figure 4.2). Tryptophan depletion provides an experimental 


approach to the serotonin-aggression hypothesis, and unlike 
correlational studies it can provide evidence of a causal link 
with aggression. Studies have found that tryptophan deple- 
tion increases aggressive responses in laboratory behavioral 
tests (Cleare & Bond, 1995; Dougherty, Bjork, Marsh, & 
Moeller, 1999) as well as subjective feelings of anger, ag- 
gression, and hostile mood (Cleare & Bond, 1995; Finn, 
Young, Pihl, & Ervin, 1998), but the effect is limited to per- 
sons who are high in trait measures of hostility. The inference 
is that hostile persons, who are already low in serotonergic 
activity, tend to react aggressively with even more lowering 
of serotonin stores. There is the further suggestion that sero- 
tonin agonists or selective serotonin reuptake inhibitors 
(SSRIs) may reduce aggression in aggression-prone persons. 
A study by Knutson et al. (1998) showed that an SSRI re- 
duced focal indices of hostility through a general decrease in 
negative affect without altering positive affect. In addition, 
the SSRI increased agreeableness on a behavioral index and 
cooperativeness in a puzzle-solving task. 

SSRIs are used to treat depression, but can they change 
other emotions like anger-hostility? A study of SSRI therapy 
for depressed outpatients showed a significant decrease in 
anger-hostility as well as neuroticism (Bagby et al., 1999). 
The decrease in neuroticism, however, was correlated with 
the decrease in clinical depression severity, whereas the de- 
crease in anger-hostility was independent of the reduction of 
depression. 

NE mediates a primary arousal system in the brain begin- 
ning in the locus-coeruleus and extending through limbic 
structures to innervate all areas of cerebral cortex. As such it 
has been implicated in the arousal of anxiety as previously 
discussed. But anger is also associated with an arousal effect 
as shown by the cardiovascular reactivity in highly hostile 
and angry persons as previously discussed. A study of ag- 
gression in free-ranging monkeys found a negative correla- 
tion between aggressivity and CSF 5-HIAA, congruent with 
the serotonin-aggression hypothesis, but it also found an 
equally strong positive correlation between aggressivity and 
CSF MHPG, the NE metabolite. The Ballenger et al. (1983) 
study of humans (normals) found a very high positive corre- 
lation (.64) between plasma MHPG and the Assault scale 
from the BDHS. Use of a noradrenergic challenge revealed a 
correlation of noradrenergic reactivity and irritability and 
assault scales (Coccaro et al., 1991). 

On the other hand, low levels of the catecholamines epi- 
nephrine and NE, obtained from urine, are inversely related 
to aggressiveness (Magnusson, 1987). Psychopathic youths 
have low reactivity in these peripheral catecholamine sys- 
tems in stressful situations (Lidberg, Levander, Schalling, & 
Lidberg, 1978). The difference may lie between the central 


noradrenergic and the peripheral autonomic stress system. 
Another possibility is that there is a difference between the 
psychopathic type of aggression, which is often not accom- 
panied by high arousal, and the impulsive-angry type of 
aggression in which emotional disinhibition is typical. Netter, 
Hennig, and Rohrmann (1999) believed that they can 
distinguish the two types of aggressiveness on the basis of 
selective types of challenges to the monoamine systems. The 
serotonergic challenge was primarily correlated with 
Eysenck’s P scale, assessing the psychopathic type of aggres- 
sion, whereas another type of challenge more closely related 
to dopamine reactivity was related to the nonpsychopathic 
type of aggression. 

Increasing levels of brain dopamine in rats increases im- 
pulsive aggressive responding, but it takes a great deal of 
dopamine depletion to reduce aggressive behavior (Volavka, 
1995). Little research has been done on dopamine specifi- 
cally although the aggression producing effects of ampheta- 
mine may be a function of stimulation of dopaminergic as 
well as noradrenergic systems. A study of the neuroendocrine 
responses to glucose challenge in a group of substance 
abusers showed that those participants characterized by anti- 
social hostility had responses suggestive of increased 
dopaminergic activity (Fishbein, Dax, Lozovsky, & Jaffe, 
1992). 


Hormones 


The hypothesis of an influence of T on aggressive behavior 
has a prescientific origin in that the pacifying effects of cas- 
tration in animals were known and used for that purpose. 
Sexual competition among males is one form of aggression 
influenced by T, but other forms are also affected. Castration 
reduces aggression in males in most species, and T replace- 
ment reverses this effect. 

Studies of the relationships between T and hostility or ag- 
gression in humans have produced mixed results, but a meta- 
analysis of such studies found a moderate effect size of .40 
over all studies (Archer, Birring, & Wu, 1988). An earlier re- 
view by Archer (1991) suggested that results were more pos- 
itive in studies where behavioral assessments (usually in 
prisoners) were used and less powerful in studies of trait 
(self-report) hostility or aggression (usually in college stu- 
dent samples). The newer meta-analysis failed to support this 
hypothesis. Better results were obtained in studies using sali- 
vary T as opposed to T derived from blood. A study using 
salivary T in 306 students found T positively correlated with 
aggression and negatively with prosocial scales in both men 
and women (Harris, Rushton, Hampson, & Jackson, 1996), 
but in other studies using either blood (Archer et al., 1998) or 
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salivary T (Campbell, Muncer, & Odber, 1997) in large sam- 
ples of male students no relationship was found. In still an- 
other study of blood T in students, both T and estradiol were 
postively correlated with self-reported aggression in men, but 
the correlations were negative in women (Gladue, 1991). 

More consistent results have been obtained with behav- 
ioral (non-self-report) assessments. A study of nearly 700 
male prison inmates found that salivary T was related to a 
history of violent crimes, particularly rape, homicide, and 
child molestation, as well as violations of prison rules, partic- 
ularly those involving assault (Dabbs, Carr, Frady, & Riad, 
1995). A study of female inmates showed a relationship of T 
with aggressive dominance in prison but not with the history 
of criminal violence. A group of alcoholics with a history of 
violence had elevated levels of serum T relative to other al- 
coholics (Bergman & Brismar, 1994). 

Even among nonclinical samples there is correlational ev- 
idence of a relationship between T and aggression. T corre- 
lated with more aggressive fighting in men during judo 
contests (Salvador, Suay, Martinez, Simon, & Brain, 1999) 
and in amount of shock given to an opponent in a contrived 
laboratory situation (Berman, Gladue, & Taylor, 1993). 

Whether self-report or behavioral, correlational studies 
cannot establish cause and effect. There is ample evidence in 
both animal and human studies that aggressive experience 
in competition may raise T levels in victors or lower them in 
those who are defeated. Experimental studies in which the ef- 
fects of raised T levels on aggression are observed might be 
helpful. Clinical studies of steroid users have shown in- 
creased aggressiveness in some of them (Pope & Katz, 1994). 
Archer (1991) reviewed studies in which T or T-stimulating 
hormones were given and effects on aggression assessed by 
self-report. Although there is some evidence that T can affect 
hostility, there are also some negative findings from other 
studies. In all likelihood there is a continuous interaction be- 
tween endogenous levels of T and life experiences (affecting 
current levels) during life. T makes one more likely to 
aggress, and aggression or its anticipation raises T levels. 

Longitudinal studies may also elucidate the complex causal 
pattern. In one study a group of boys was followed from 6 to 13 
years of age (Tremblay et al., 1998). T at age 12 and body mass 
predicted social dominance in adolescence but only body mass 
independently predicted physical aggression. The authors sug- 
gest that the relation between aggression and T in adolescents 
may be mediated by the effect of T in the change in physique 
in the context of dominance. A similar study followed males 
from pre- or early adolescence (12-13 years) and found little 
relationship between early or concurrent measure of T and ag- 
gression; the few that were found did not persist over time 
(Halpern, Udry, Campbell, & Suchindran, 1993). 
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Short time periods of prediction may confound en- 
vironmental-developmental interactions that could mask the 
influence of endogenous levels of T. Windle and Windle 
(1995), in a retrospective longitudinal study, examined the 
adult levels of plasma T in four groups: (a) those who were 
aggressive only in childhood; (b) those who became aggres- 
sive as adults; (c) those who were aggressive in both child- 
hood and adulthood (continuity); and (d) those who were low 
in aggression in childhood and adulthood. Adult onset and 
continuity (in aggressiveness) groups had higher T levels as 
adults than the other two groups. Other than aggressiveness, 
the high-T adult groups had higher rates of antisocial person- 
ality and a history of various signs of antisocial behavior. 
Was the high level of T in these groups a product of their his- 
tory or a sign of an earlier level of T that affected the devel- 
opment of these behaviors? The authors admit that it is 
impossible to answer this question. 

High levels of cortisol are associated with stress and inhi- 
bition and low levels with impulsivity and sensation seeking, 
as noted previously. In baboons dominant and aggressive 
males usually have low levels of cortisol and subordinate and 
nonaggressive primates have higher levels of cortisol. As 
with testosterone, cortisol varies considerably with recent 
and long-term patterns of experience such as winning or los- 
ing in fights. Low levels of cortisol have been found in psy- 
chopathic, violent offenders (Virkkunen, 1985), but high 
levels of cortisol are positively associated with hostility as 
measured by hostility questionnaires (Keltikangas, Räikkö- 
nen, & Adlercreutz, 1997; Pope & Smith, 1991). Chronic 
feelings of hostility are often associated with anxiety and de- 
pression, but the type of impulsive aggression seen in antiso- 
cial personality represents a brief state of anger in a generally 
unemotional personality. 


Genetics 


Behavior genetic studies of general hostility scales or ag- 
gression in children have shown significant heritabilities. 
However, it is possible that some aspects of hostility or ag- 
gression may be more heritable than others. A twin study of 
adult males using the BDHS revealed heritabilities ranging 
from 28% for verbal hostility to 47% for assault (Coccaro, 
Bergeman, Kavoussi, & Seroczynski, 1997). Verbal hostility 
is the most common form and yet it had the least heritability 
and the strongest environmental influence. An analysis of the 
genetic influence on the correlations among the scales that 
the assault scale had different underlying influences than the 
other scales which shared a common genetic influence. With 
the exception of the assault scale the genetic influence un- 
derlying the scales is of a nonadditive type suggesting 


Mendelian dominant or recessive or epistatic mechanisms. If 
it is the former, there is the likelihood of finding a gene of 
major effect in the general trait of aggression, apart from 
physical assault type. 

The MAO type-A gene has become a likely candidate for 
this trait. Aggression in male mice is heightened by deletion 
of the MAO-A gene (Cases et al., 1995), and a mutation in 
the gene in a large Dutch family has been linked to mild re- 
tardation and impulsive aggressive behavior (Bruner, Nelsen, 
Breakfield, Ropers, & van Oost, 1993). The mutation is rare, 
but the gene has a wide range of alleles varying in repeat 
length. Subjects with one form, in contrast to those with 
another form, had lower scores on an index of aggression/ 
impulsivity and the Barratt impulsiveness scale (Manuck, 
Flory, Ferrell, Mann, & Muldoon, 2000). The life history of 
aggression only approached significance and the BDHS did 
not show significant differences between allele groups. Ap- 
parently, the impulsivity was more salient than the aggres- 
siveness in the combination. Consistent with the association 
between low serotonin and aggression in the finding that the 
allele group with the higher impulsive aggression score also 
showed less response to a serotonergic challenge test. 

Just as the findings on the MAO-A gene suggest one 
source of the link between serotonin and aggression, another 
gene has been found that suggests a genetic mechanism for 
the association of norepinephrine with aggression. The 
adrenergic-2A receptor gene (ADRA2A) plays a role in mod- 
ulating norepinephrine release in the locus coeruleus. Alleles 
of this gene were associated with scales for hostility and im- 
pulsivity in a younger student sample and impulsivity alone 
in an older sample (Comings et al., 2000). 


Summary 


Extreme violence has been associated with EEG evidence of 
cortical abnormality usually in the form of an excess of slow 
wave activity (underarousal) or focalized EEG abnormalities 
in the temporal lobes. Brain scans have confirmed the tem- 
poral lobe abnormalities and also found an equal incidence 
of frontal lobe abnormalities. A reduced P300 cortical EP re- 
sponse has also been found in prisoners with a history of ex- 
tremely violent behavior. The reduced activity and reactivity 
in the frontal lobes may reflect a deficit in inhibitory capac- 
ity, which is part of the executive function of these lobes. 
The abnormal activity of the temporal lobe may be sympto- 
matic of abnormal amygdala function because this lobe is in 
close proximity to the underlying amygdala. An MRI study 
has revealed temporal lobe lesions in about one third of vio- 
lent patients. Hostility or anger proneness is related to a high 
level of cardiovascular, noradrenergic, and testosterone and 


cortisol response to stress or perceived attack. Suppressed 
hostility can lead to cardiovascular disease. 

Among the monoamines, serotonin deficit is most highly 
associated with impulsive aggression. However, low sero- 
tonin is associated with depression and suicide as well as ag- 
gression and homicide, another example of the multiple trait 
associations of biological markers. Lack of emotional and be- 
havioral control is the likely consequence of serotonin deficit. 
Depletion of tryptophan, the precursor of serotonin in the 
production chain, increases aggressive responses and angry 
and hostile feelings in laboratory experiments. Augmentation 
of serotonin, through reuptake inhibitors, can reduce aggres- 
sion in aggression-prone persons. 

Unlike depression, in which both serotonin and norepi- 
nephrine depletions are seen, brain norepinephrine (from 
CSF) tends to be positively correlated with aggressive ten- 
dencies in monkeys and humans. However, low levels of pe- 
ripheral levels of the catecholamines norepinephrine and 
epinephrine are also related to aggressiveness. We need to 
distinguish between the type of aggression that occurs in 
states of high emotional arousal and the cold type of aggres- 
sion more characteristic of the psychopath. The latter type 
may be reflected in the low levels of peripheral cate- 
cholamine reactivity. 

Testosterone is associated with aggression based on be- 
havioral records, but results using self-report measures of 
hostility or aggression are less conclusive. Prisoners with ei- 
ther histories of extremely violent crimes or characterized by 
aggression in prison show high levels of testosterone. Testos- 
terone is increased by victory in competitive contests and 
sexual stimulation and decreased by defeat, raising the old 
“chicken or egg” problem of causation. The influence of 
testosterone during development may be mediated by its in- 
fluence on physique in male adolescents where it is associ- 
ated with a more muscular mesomorhpic body build. Low 
cortisol levels are found in aggressive types and are also in- 
fluenced by the outcomes of fights. 

Aggression trait is moderately heritable, but its heritability 
depends on the form it takes. Assaultive aggression is moder- 
ately heritable but verbal aggression is only weakly heritable. 
The gene for MAO of the A type has been linked to aggres- 
sion in a human family study. Deletion of the MAO-A gene in 
mice increases their aggressivity, suggesting that the gene is 
involved in the inhibition or regulation of aggression. 


CONCLUSIONS 


Wilson (1998) described consilience as a quality of science 
that links knowledge across disciplines to create a common 
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background of explanation. Personality psychology, extend- 
ing from social psychology at the higher level to biopsychol- 
ogy at the more fundamental level, provides a daunting 
challenge to consilience. The introduction to this chapter pre- 
sented a model of levels along the biological and social path- 
ways leading up to a merger in personality traits. 

Such a levels approach suggests a goal of reductionism, a 
pejorative term for critics of science and many scientists as 
well. The artist is contemptuous of the critic’s attempts to re- 
duce his or her art to a textual formula, and the social scien- 
tist may resent the presumptious intrusion of the biological 
scientist into his or her own complex type of explanation. 
Wilson, however, views reductionism as a natural mode of 
science: 


The cutting edge of science is reductionism, the breaking apart 
of nature into its natural constituents. . . . It is the search strategy 
employed to find points of entry into otherwise impenetrably 
complex systems. Complexity is what interests scientists in the 
end, not simplicity. Reductionism is the way to understand it. 
The love of complexity without reductionism makes art; the love 
of complexity with reductionism makes science. (pp. 58-59) 


Later, Wilson (1998) admits that reductionism is an over- 
simplification that may sometimes be impossible. At each 
level of organization the phenomena may require new laws 
and principles that cannot be predicted from those at more 
general levels. My view is that this is always true for levels 
that involve an interaction between biological traits or genes 
and experience in the social environment. A learned associa- 
tion cannot be reduced to a specific set of neural events, at 
least not in the complex brain of a higher organism. It is not 
inconceivable, however, that the difference in general neural 
events that make an association more likely in one individual 
than another is not only explicable but also essential for a 
complete understanding of the event. Consilience is more 
possible at the borders of two levels, and this is where the 
breakthroughs are most likely to take place. As Wilson puts 
it, “The challenge and the cracking of thin ice are what gives 
science its metaphysical excitement” (p. 60). 

This chapter was organized around a top-down approach, 
starting with four broad classes of personality traits that are em- 
pirically identifiable across several systems of trait description: 
extraversion/sociability, neuroticism/anxiety, impulsiveness/ 
conscientousness, and aggression/agreeableness. One way to 
bypass the complex social determinants of these traits in 
human societies is to look for appropriate animal models and 
biological links between behavior in these species and 
our own. This approach has identified certain biological mark- 
ers for analogous behavioral traits such as the monoamine 
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neurotransmitters and enzymes like MAO that regulate them; 
hormones like testosterone and cortisol; psychophysiological 
characteristics such as augmenting/ reducing of the cortical 
evoked potential; brain structure and physiology as assessed 
by brain imaging methods in humans and lesion and stimula- 
tion studies in other species; and molecular genetic studies 
that link genes, biological mechanisms, and behavioral and 
personality traits. 

Simple-minded reductionism would expect one personal- 
ity or behavioral trait to be associated with one brain struc- 
ture, one neurotransmitter, one hormone, one physiological 
pattern of reactivity, and one gene in both humans and other 
animals. The chapter is organized by personality traits, but if 
one reads across the traits it is clear that this neat kind of 
phrenological isomorphism is not the rule. Evolution may 
have shaped the nervous system around behavioral mecha- 
nisms necessary for adaptation, but evolution did not select 
for personality traits. The tendency to explore, forage, and ap- 
proach novel but nonthreatening objects or creatures is part of 
that adaptation and is important in survival, as is competitive 
and defensive aggression, cooperation, and even altruism. 

If we reverse direction and work up from the biological 
mechanisms to the personality trait and behavioral levels 
the fourfold classification at the top becomes blurred. 
Monoamine reactivities, MAO, testosterone, cortisol, and re- 
activity of cortical EPs to stimulus variation are related to so- 
ciability and sensation seeking, impulsivity and aggression, 
asocialization, neuroticism, anxiety, and inhibition, but in no 
simple one-to-one manner. Low levels of serotonergic activ- 
ity are related to both depression and impulsive aggression 
producing both violent and impulsive homicides and sui- 
cides, sometimes in the same person. Is it the impulsivity, the 
aggression, or the neuroticism that is related to a serotonin 
deficiency? High levels of testosterone are related to socia- 
bility and social dominance, disinhibitory sensation seeking, 
aggressivity, asocialization, and low levels to neuroticism 
and agreeableness. Low levels of MAO are related to sensa- 
tion seeking, impulsivity, asocial tendencies, and sociability. 

Personality traits may be orthogonal, but biological traits 
do not respect these boundaries. It is almost as if the func- 
tional biology of the organism is organized around two basic 
traits: approach (including sociability, impulsivity, sensation 
seeking, and aggression) and inhibition/avoidance (or 
neuroticism/anxiety at the personality trait level). The com- 
parative psychologist Schneirla (1959) put this idea into a 
postulate: “For all organisms in early ontogenetic stages, low 
intensities of stimulation tend to evoke approach reactions, 
high intensities withdrawal reactions” (p. 3). In evolved or 
more mature organisms Schneirla used the terms “seeking” 
and “avoidance” in place of “approach” and withdrawal.” 
The latter terms convey the idea of reflexive or tropistic 


mechanisms, whereas the former imply learned behavior. 
Approach-withdrawal describes a basic dimension of tem- 
perament and inhibition/shyness another in infant scales of 
temperament. These individual differences in infants may 
represent two biologically based dimensions found in other 
species, and they may develop into more diffentiated charac- 
teristics in adult humans. 

Genetic dissection is one method of defining the bound- 
aries of biological influence in traits. If both biological and 
behavioral traits are included in biometric or molecular ge- 
netic studies, the genetic covariance between the genetic and 
the other two can be determined. Rarely are genetic, biologi- 
cal, and behavioral traits all included in one study. 

A biosocial approach cannot ignore the complex interac- 
tions between biological traits and environmental experi- 
ences. In both animals and humans the levels of the hormones 
testosterone and cortisol influence behavioral interactions 
with the environment but are in turn influenced by the out- 
comes of these interactions. There is no reason to think that 
similar interactions do not occur for the monoamine neuro- 
transmitters. All of these systems are regulated by internal 
mechanisms. For instance, if there is overactivity in a system, 
regulators like MAO may catabolize the excess neurotrans- 
mitter. There may be more trait stability in the regulator than 
in the transmitter itself. After repeated experiences, however, 
there may be changes in the activity of a biological system 
that are relatively enduring if not irreversible. Environment 
may even influence the effect of genes by affecting their 
release. Given the constant interaction between the biological 
and environmental pathways (Figure 4.1), reductionism of 
one to the other is impossible. It would be like describing the 
biological activity of the lungs in the absence of oxygen, the 
digestive organs in the absence of food, or, using a more rel- 
evant analogy, the brain in the absence of stimulation. 

Psychology emerged from the biological sciences more 
than a century ago, although its origins were forgotten by 
those who wanted a science that would emulate physics 
and those who wanted to cut all connections with the biolog- 
ical sciences. Fifty years ago, when I entered the field, the 
founder of behaviorism, Watson, had declared that the out- 
come of personality was entirely a matter of life experience 
(conditioning) and had nothing to do with genetics, and 
Skinner had declared the irrelevancy of the brain in behavior. 
Despite Freud’s own view that the mysteries of the psyche 
would one day be understood in terms of biology, his follow- 
ers advocated an environmental determinism that put the en- 
tire weight of explanation on society, the family, and early 
experience. These early prophets of our science are now his- 
torical footnotes, and the science is more cognitive and 
biosocial with new cross disciplines like cognitive neuro- 
science emerging. The changes are in large measure due to 


the rapid advances in the neurosciences that have opened 
new, unforeseen vistas in psychology. Further progress will 
also procede apace with the development of new methodolo- 
gies and the refinement of current ones. 

Behavior genetics has challenged the radical environmen- 
talist position by showing that nearly all personality traits and 
even some broad attitudinal traits have a significant degree of 
genetic determination. It is becoming a truism that genes in- 
teract with environment throughout life. But the precise na- 
ture of this complex interaction remains obscure. Genes do 
not make personality traits; they make proteins. The develop- 
ment of molecular behavioral genetics will help solve some 
of these problems. When we know some of the major genes 
involved with a personality trait and what these genes make 
and influence in the nervous system, we will be in a better po- 
sition to define the biological mechanisms that lie between 
gene and behavior. Knowing the gene-biological trait link is 
not sufficient until we can understand the way the biological 
mechanism interacts with the environment, or more specifi- 
cally the brain-behavior relationship. 

Until recent decades the study of the brain was limited to 
peripheral measures like the EEG. The brain-imaging meth- 
ods are only in their infancy but are already influencing the 
course of our science. The ones like PET or the more effective 
fMRI can tell us exactly what is happening in the brain after 
the presentation of a stimulus or condition, as well as where it 
is happening. The expensiveness of these methods has lim- 
ited their use to medical settings and to clinical populations. 
Studies of personality in normals are rare and incidental to the 
objectives of clinical studies. They usually involve small 
numbers of subjects with a consequent unreliability of find- 
ings. Sooner or later the application of these methods to the 
study of personality dimensions in nonclinical populations 
will help to understand exactly what a personality predisposi- 
tion is in the brain. Longitudinal studies starting with genetic 
and neurochemical markers and tracking the fate of individu- 
als with these markers through life will enable us to predict 
both normal variant outcomes and psychopathology. 
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Freud’s psychoanalysis is like Picasso’s cubism. Controver- 
sial from the outset, Picasso’s work enchanted some and 
alienated others, but every twentieth-century painter has re- 
sponded to it in some way. So it is with Freud’s psychoana- 
lytic theory: Some psychologists love it, others hate it, but 
almost every psychologist has reacted to it—deliberately or 
inadvertently, consciously or unconsciously—in his or her 
own work. 

Psychoanalysis and cubism are alike in at least one other 
respect as well. Both paradigms changed in fundamental ways 
our view of the world by pointing out limitations in our habit- 
ual manner of thinking and perceiving. Cubism compelled us 
to view a given object or situation from multiple perspectives 
simultaneously—no single viewpoint can capture the com- 
plexity of the scene. Psychoanalysis taught us much the same 
thing, but instead of looking outward toward the external 
world, psychoanalysis turned our attention inward. In the 
process, it altered forever the way we see ourselves. 

Evaluating the validity and utility of a theory of personality 
is never easy, but it is particularly challenging for a theory as 
complex and far-reaching as psychoanalysis. Psychoanalytic 
theory touches upon virtually every aspect of human mental 
life, from motivation and emotion to memory and information 
processing. Embedded within this larger model is a theory of 
personality, but it is not always obvious where the personality 
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portion of psychoanalysis begins and other aspects of the 
model leave off. Because of this, one cannot assess the psy- 
choanalytic theory of personality without examining psycho- 
analytic theory in toto, with all its complexity, intricacy, and 
controversy. 

This chapter reviews psychodynamic models of personal- 
ity and their place in contemporary psychology. The chapter 
begins with a brief discussion of the core assumptions of psy- 
choanalytic theory, followed by an overview of the evolution 
of the theory from Freud’s classical model to today’s integra- 
tive psychodynamic frameworks. I then discuss the common 
elements in different psychodynamic models and the ways 
that these models have grappled with key questions regarding 
personality development and dynamics. Finally, I discuss the 
place of psychoanalysis within contemporary psychology 
and the relationship of psychoanalytic theory to other areas of 
the discipline. 


THE CORE ASSUMPTIONS OF PSYCHOANALYSIS 


Given the complexity of psychoanalytic theory and the myr- 
iad incarnations that the model has assumed over the years, 
the core assumptions of the psychodynamic framework are 
surprisingly simple. Moreover, the three core assumptions of 
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psychoanalysis are unique to the psychodynamic framework: 
No other theories of personality accept these three premises 
in their purest form. 


Primacy of the Unconscious 


Psychodynamic theorists contend that the majority of psy- 
chological processes take place outside conscious awareness. 
In psychoanalytic terms, the activities of the mind (or psyche) 
are presumed to be largely unconscious, and unconscious 
processes are thought to be particularly revealing of personal- 
ity dynamics (Brenner, 1973; Fancher, 1973). Although 
aspects of the primacy of the unconscious assumption remain 
controversial (see Kihlstrom, 1987; McAdams, 1997), re- 
search on implicit learning, memory, motivation, and cog- 
nition has converged to confirm this basic premise of 
psychoanalysis (albeit in a slightly modified form). Many 
mental activities are only imperfectly accessible to con- 
scious awareness—including those associated with emotional 
responding, as well as more mundane, affectively neutral 
activities such as the processing of linguistic material (see 
Bornstein & Pittman, 1992; Greenwald & Banaji, 1995; 
Schacter, 1987; Stadler & Frensch, 1998). Whether uncon- 
scious processes are uniquely revealing of personality dy- 
namics is a different matter entirely, and psychologists remain 
divided on this issue. 

It is ironic that the existence of mental processing outside 
awareness—so controversial for so long—has become a cor- 
nerstone of contemporary experimental psychology. In fact, 
in summarizing the results of cognitive and social research on 
automaticity, Bargh and Chartrand (1999) recently concluded 
that evidence for mental processing outside of awareness is 
so pervasive and compelling that the burden of proof has 
actually reversed: Rather than demonstrate unconscious in- 
fluences, researchers must now go to considerable lengths to 
demonstrate that a given psychological process is at least in 
part under conscious control. This conclusion represents a 
rather striking (and counterintuitive) reversal of prevailing 
attitudes regarding the conscious-unconscious relationship 
throughout much of the twentieth century. 


Psychic Causality 


The second core assumption of psychodynamic theory is 
that nothing in mental life happens by chance—that there 
is no such thing as a random thought, feeling, motive, or be- 
havior (Brenner, 1973). This has come to be known as the 
principle of psychic causality, and it too has become less con- 
troversial over the years. Although few psychologists accept 


the principle of psychic causality precisely as psychoanalysts 
conceive it, most theorists and researchers agree that cogni- 
tions, motives, emotional responses, and expressed behaviors 
do not arise randomly, but always stem from some combina- 
tion of identifiable biological and/or, psychological processes 
(Rychlak, 1988). 

Although few psychologists would argue for the existence 
of random psychological events, researchers do disagree 
regarding the underlying processes that account for such 
events, and it is here that the psychodynamic view diverges 
from those of other perspectives. Whereas psychoanalysts 
contend that unconscious motives and affective states are key 
determinants of ostensibly random psychological events, psy- 
chologists with other theoretical orientations attribute such 
events to latent learning, cognitive bias, motivational conflict, 
chemical imbalances, or variations in neural activity (e.g., see 
Buss, 1991; Danzinger, 1997). The notion that a seemingly 
random event (e.g., a slip of the tongue) reveals something im- 
portant about an individual’s personality is in its purest form 
unique to psychoanalysis. 


Critical Importance of Early Experiences 


Psychoanalytic theory is not alone in positing that early de- 
velopmental experiences play a role in shaping personality, 
but the theory is unique in the degree to which it emphasizes 
childhood experiences as determinants of personality devel- 
opment and dynamics. In its strongest form, psychoanalytic 
theory hypothesizes that early experiences—even those oc- 
curring during the first weeks or months of life—set in motion 
personality processes that are to a great extent immutable (see 
Emde, 1983, 1992). In other words, the events of early child- 
hood are thought to create a trajectory that almost invariably 
culminates in a predictable set of adult character traits (Eagle, 
1984; Stern, 1985). This is especially of events that are out- 
side the normal range of experience (i.e., very positive or very 
negative). 

The psychodynamic hypothesis that the first weeks or 
months of life represent a critical period in personality de- 
velopment contrasts with those of alternative theories (e.g., 
cognitive), which contend that key events in personality 
development occur somewhat later, after the child has ac- 
quired a broad repertoire of verbal and locomotive skills. 
Freud’s notion of a critical early period in personality devel- 
opment—coupled with his corollary hypothesis that many of 
the most important early experiences involve sexual frustra- 
tion or gratification—was (and is) highly controversial. It 
helped create a decades-long divergence of psychoanalysis 
from mainstream developmental psychology, which has only 
recently begun to narrow (Emde, 1992). 


THE EVOLUTION OF PSYCHOANALYSIS: GAZING 
ACROSS THREE CENTURIES 


Many psychodynamic ideas—including the core assump- 
tions just discussed—predated Freud’s work and were anti- 
cipated by eighteenth and nineteenth century philosophers 
(Ellenberger, 1970; Hilgard, 1987). Nonetheless, psychoana- 
lytic theory as an independent school of thought was con- 
ceived just over 100 years ago, with the publication of Breuer 
and Freud’s (1895/1955) Studies on Hysteria. Since that time, 
the history of psychoanalysis can be divided into four over- 
lapping phases: classical psychoanalytic theory, neo-analytic 
models, object relations theory and self psychology, and con- 
temporary integrative models. Each phase introduced a novel 
approach to human development and personality. 


Classical Psychoanalytic Theory 


Given Freud’s background in neurology, it is not surprising 
that the first incarnation of psychoanalytic theory was 
avowedly biological. In his early writings, Freud (1895/1966, 
1900/1958a) set out to explain psychological phenomena 
in terms that could be linked to extant models of neural 
functioning (an ironic goal to say the least, given that psy- 
choanalysis developed in part to explain “neurological” 
symptoms that had no identifiable neurological basis, such as 
hysterical blindness and hysterical paralysis). 

Because the core principles of classical psychoanalytic 
theory developed over more than 40 years, there were numer- 
ous revisions along the way. Thus, it is most accurate to think 
of classical psychoanalytic theory as a set of interrelated mod- 
els, which were often (but not always) consistent with and 
supportive of each other: the drive model, the topographic 
model, the psychosexual stage model, and the structural 
model. 


The Drive Model 


One consequence of Freud’s determination to frame his the- 
ory in quasi-biological terms is that the earliest version of 
psychoanalytic drive theory was for all intents and purposes 
a theory of energy transformation and tension reduction 
(Breuer & Freud, 1895; Freud, 1896/1955c). Inborn (presum- 
ably inherited) instincts were central to the drive model, and 
most prominent among these was the sex drive, or libido. 
Freud’s interest in (some might say obsession with) sexual 
impulses as key determinants of personality development and 
dynamics was controversial during his lifetime, and remains 
so today (e.g., see Torrey, 1992). At any rate, during the ear- 
liest phase of psychoanalytic theory, personality was seen as 
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a by-product of the particular way in which sexual impulses 
were expressed in an individual. 

Freud never fully renounced the drive concept, even after 
he shifted the emphasis of psychoanalytic theory from inborn 
instincts to dynamic mental structures with no obvious bio- 
logical basis (Greenberg & Mitchell, 1983). The concept of 
cathexis—investment of libidinal (or psychic) energy in an 
object or act—remained central to psychoanalytic theory even 
as the drive model waned in influence. As his career drew to a 
close during the 1930s, Freud (1933/1964a, 1940/1964b) con- 
tinued to use the concept of cathexis to account for a wide 
range of psychological processes, from infant-caregiver 
bonding and infantile sexuality to group behavior and para- 
praxes (i.e., “Freudian slips”). 

As the concept of cathexis became reified in classical psy- 
choanalytic theory, so did the companion concepts of fixation 
De, lingering investment of psychic energy in objects and 
activities from an earlier developmental period), and regres- 
sion (i.e., reinvestment of psychic energy in an earlier stage of 
development, usually under stress). As should become appar- 
ent, the concept of cathexis gradually faded from view, but 
the concepts of fixation and regression continue to be widely 
discussed and used to explain a wide range of issues related to 
personality development and dynamics. 


The Topographic Model 


At the same time as Freud was refining the drive theory, he 
was elaborating his now-famous topographic model of the 
mind, which contended that the mind could usefully be di- 
vided into three regions: the conscious, preconscious, and un- 
conscious (Freud, 1900/1958a, 1911/1958b). Whereas the 
conscious part of the mind was thought to hold only informa- 
tion that demanded attention and action at the moment, the 
preconscious contained material that was capable of becom- 
ing conscious but was not because attention (in the form of 
psychic energy) was not invested in it at that time. The un- 
conscious contained anxiety-producing material (e.g., sexual 
impulses, aggressive wishes) that were deliberately repressed 
De, held outside of awareness as a form of self-protection). 
Because of the affect-laden nature of unconscious material, 
the unconscious was (and is) thought to play a more central 
role in personality than are the other two elements of Freud’s 
topographic model. In fact, numerous theories of personality 
ascribe to the notion that emotion-laden material outside 
of awareness plays a role in determining an individual’s per- 
sonality traits and coping style (see Hogan, Johnson, & 
Briggs, 1997; Loevinger, 1987). 

The terms conscious, preconscious, and unconscious con- 
tinue to be used today in mainstream psychology, and research 
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has provided a surprising degree of support for this tripartite 
approach in the areas of memory and information processing 
(Bucci, 1997; Stein, 1997; Westen, 1998). Consciousness is 
indeed linked with attentional capacity, and studies show that 
a great deal of mental processing (including perceptual pro- 
cessing) occurs preconsciously (Bornstein, 1999b; Erdelyi, 
1985). As noted earlier, the existence of a dynamic uncon- 
scious remains controversial, with some researchers arguing 
that evidence favoring this construct is compelling (Westen, 
1998), and others contending that “unconscious” processing 
can be accounted for without positing the existence of a 
Freudian repository of repressed wishes and troubling urges 
and impulses (Kihlstrom, 1987, 1999). 

Perhaps the most troubling aspect of the topographic 
model—for Freud and for contemporary experimentalists as 
well—concerns the dynamics of information flow (i.e., the 
mechanisms through which information passes among 
different parts of the mind). Freud (1900/1958a, 1915/1957, 
1933/1964a) used a variety of analogies to describe informa- 
tion movement among the conscious, preconscious, and un- 
conscious, the most well-known of these being his gatekeeper 
(who helped prevent unconscious information from reaching 
conscious awareness), and anteroom (where preconscious in- 
formation was held temporarily before being stored in the un- 
conscious). Contemporary researchers (e.g., Baddeley, 1990) 
have coined terms more scientific than those Freud used (e.g., 
central executive, visuospatial scratch pad), but in fact they 
have not been much more successful than Freud was at spec- 
ifying the psychological and neurological mechanisms that 
mediate intrapsychic information flow. 


The Psychosexual Stage Model 


Freud clung to the drive model (and its associated topo- 
graphic framework) for several decades, in part because of his 
neurological background, but also because the drive model 
helped him bridge the gap between biological instincts and 
his hypothesized stages of development. By 1905, Freud had 
outlined the key elements of his psychosexual stage model, 
which argued that early in life humans progress through an 
invariant sequence of developmental stages, each with its 
own unique challenge and its own mode of drive (i.e., sexual) 
gratification (Freud, 1905/1953, 1918/1955a). Freud’s psy- 
chosexual stages—oral, anal, Oedipal, latency, and genital— 
are well known even to nonpsychoanalytic psychologists. So 
are the oral, anal, and Oedipal (or phallic) character types as- 
sociated with fixation at these stages (Fisher & Greenberg, 
1996). From a personality perspective, the psychosexual 
stage model marks a turning point in the history of psycho- 
analysis because it was only with the articulation of this 


TABLE 5.1 The Psychosexual Stage Model 


Associated 


Stage Age Range Character Traits 


Oral 0-18 months 


Developmental Task 


Moving from infantile 
dependency toward 
autonomy and self- 
sufficiency 

Learning to exercise 
control over one’s 
body, one’s impulses, 
and other people 

Mastering competitive 
urges and acquiring 
gender role related 
behaviors 

Investing energy in — 
conflict-free 
(nonsexual) tasks 
and activities 

Mature sexuality — 
(blending of 
sexuality and 
intimacy) 


Dependency 


Anal 18-36 months Obsessiveness 


Oedipal 5-6 years Competitiveness 


Latency 6 years— 


puberty 


Genital Puberty 
onward 


Note. Dashes indicate that no associated character traits exist (fixation in the 
latency and genital periods does not play a role in classical psychoanalytic 
theory). 


model that personality moved from the periphery to the cen- 
ter of psychoanalytic theory. 

Table 5.1 illustrates the basic organization of Freud’s 
(1905/1953) psychosexual stage model. Frustration or over- 
gratification during the infantile, oral stage was hypothesized 
to result in oral fixation, and an inability to resolve the devel- 
opmental issues that characterize this period (e.g., conflicts 
regarding dependency and autonomy). The psychosexual 
stage model further postulated that the orally fixated (or oral 
dependent) person would (a) remain dependent on others for 
nurturance, protection, and support; and (b) continue to ex- 
hibit behaviors in adulthood that reflect the oral stage Oe, 
preoccupation with activities of the mouth, reliance on food 
and eating as a means of coping with anxiety). Research sup- 
ports the former hypothesis, but has generally failed to con- 
firm the latter (Bornstein, 1996). 

A parallel set of dynamics (i.e., frustration or overgratifi- 
cation during toilet training) were assumed to produce anal 
fixation and the development of an anal character type. Be- 
cause toilet training was viewed by Freud as a struggle for 
control over one’s body and impulses, the anally fixated indi- 
vidual was thought to be preoccupied with issues of control, 
and his or her behavior would thus be characterized by a con- 
stellation of three traits, sometimes termed the anal triad: 
obstinacy, orderliness, and parsimony (Masling & Schwartz, 
1979). Fixation during the Oedipal stage was presumed to 
result in a personality style marked by aggressiveness, 


competitiveness, and a concern with status and influence 
(Fisher & Greenberg, 1996; Juni, 1992). 

Empirical studies have yielded mixed results with respect 
to the anal and Oedipal stages. Studies support the existence 
of an anal triad, but they do not support the critical role of 
toilet training in the ontogenesis of these traits (Kline, 1981). 
Similarly, research offers only mixed support for the concept 
of an Oedipal personality type and offers little evidence for the 
Oedipal dynamic as Freud conceived it (Fisher & Greenberg, 
1996; Masling & Schwartz, 1979). 


The Structural Model 


Ultimately, Freud recognized certain explanatory limitations 
in the topographic model (e.g., the model’s inability to ac- 
count for certain forms of psychopathology), and as a result 
he developed an alternative, complementary framework to 
explain normal and abnormal personality development. Al- 
though the structural model evolved over a number of years, 
the theoretical shift from topography to structure is most 
clearly demarcated by Freud’s (1923/1961) publication of 
The Ego and the Id, wherein he described in detail the central 
hypothesis underlying the structural model: the notion that 
intrapsychic dynamics could be understood with reference 
to three interacting mental structures called the id, ego, and 
superego. The id was defined as the seat of drives and in- 
stincts (a throwback to the original drive model), whereas the 
ego represented the logical, reality-oriented part of the mind, 
and the superego was akin to a conscience, or set of moral 
guidelines and prohibitions (Brenner, 1973). Figure 5.1 illus- 
trates the sequence of development of the id, ego, and super- 
ego in Freud’s structural model. 

According to the structural model, personality is derived 
from the interplay of these three psychic structures, which 
differ in terms of power and influence (Freud, 1933/1964a, 
1940/1964b). When the id predominates, an impulsive, 
stimulation-seeking personality style results. When the 
superego is strongest, moral prohibitions inhibit impulses, 
and a restrained, overcontrolled personality ensues. When 
the ego (which serves in part to mediate id impulses and 
superego prohibitions) is dominant, a more balanced set of 
personality traits develop. Table 5.2 summarizes the psycho- 
dynamic conceptualization of personality in Freud’s struc- 
tural model, as well as within the drive, topographic, and 
psychosexual stage models. 

From 1923 until his death in 1939, Freud spent much of 
his time elaborating the key principles and corollaries of the 
structural model, and he extended the model to various areas 
of individual and social life (e.g., humor, mental errors, cul- 
tural dynamics, religious belief). He also made numerous 
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Id: Present at birth. 


Ego: Age 2+; develops as a result 
of imperfect parenting and 
the child’s need to develop 
independent coping strategies. 


Superego: Age 5+; develops when the child 
becomes capable of internalizing 
abstract rules and principles 
as communicated by parents and 
others. 


Figure 5.1 Development of the id, ego, and superego in classical psycho- 
analytic theory. 


efforts to link the structural model to his earlier work in order 
to form a more cohesive psychodynamic framework. For 
example, Freud (and other psychoanalysts) hypothesized 
that oral fixation was characterized in part by a prominent, 
powerful id, whereas Oedipal fixation was characterized by 
strong investment in superego activities. At the time of his 
death, Freud was actively revising aspects of the structural 
model (Fancher, 1973; Gay, 1988), and it is impossible to 
know how the model would have developed had Freud con- 
tinued his work. This much is certain, however: During the 
decades wherein Freud explicated details of the structural 
model of the mind, he altered it in myriad ways, and in doing 
so he laid the foundation for several concepts that—many 
years later—became key elements of modern psychoanalytic 
theory. 


TABLE 5.2 Conceptions of Personality Within Classical 
Psychoanalytic Theory 


Model Conception of Personality 
Drive Personality traits as drive (instinct) derivatives. 
Topographic Unconscious (repressed) material is a primary 


determinant of personality. 


Psychosexual Fixation at a particular psychosexual stage 
leads to an associated character type. 
Structural Id-ego-superego dynamics determine personality 


traits and coping strategies. 
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Neo-Analytic Models 


Following Freud’s 1909 Clark University lectures, psycho- 
analysis attracted large numbers of adherents from within the 
medical and lay communities. At first, these adherents fol- 
lowed Freud’s ideas with little questioning and minimal re- 
sistance. By the early 1920s, however, competing schools 
of psychoanalytic thought were beginning to emerge both in 
Europe and in America. At first, the growth of these alterna- 
tive psychodynamic frameworks was inhibited by Freud’s 
strong personality and by the immense international popular- 
ity of psychoanalytic theory (Hilgard, 1987; Torrey, 1992). It 
was only upon Freud’s death in 1939 that competing psycho- 
analytic perspectives blossomed into full-fledged theories in 
their own right. 

By the mid-1940s, the discipline had splintered into an 
array of divergent theoretical perspectives. This splintering 
process, which has continued (albeit in a somewhat abated 
form) to the present day, is summarized graphically in 
Figure 5.2. As Figure 5.2 shows, each post-Freudian psycho- 
dynamic model was rooted in classical psychoanalytic theory, 
but each drew upon ideas and findings from other areas of 
psychology as well. 


19th-century philosophy, 
„+ neurology, psychiatry, 


-” and academic psychology 


Classical 
Psychoanalytic 
Theory 


_ Cognitive, social, 
-7-7 _ and developmental 
wi 
=.” psychology 


y 
Neo-Analytic Self Object Relations 
Models Psychology Theory 
Behavioral, 


_ cognitive, and 
— 7 humanistic 
Ba treatment models 


Contemporary 


Integrative 
Theories 


Figure 5.2 Evolution of psychodynamic models of personality; arrows in- 
dicate the influence of earlier theories/perspectives on later ones. 


TABLE 5.3 Neo-Analytic Models of Personality 


Theorist Key Assumption Key Terms/Concepts 

Adler Family dynamics (especially Striving for 
birth order) are primary superiority, 
determinants of personality. inferiority complex 

Erikson Social interactions between Psychosocial stages, 
individual and significant developmental 
others are key in personality crises 
development. 

Fromm Personality is best understood Authoritarianism 
with reference to prevailing 
social and political (as well 
as intrapsychic) forces. 

Horney Infantile dependency- Basic anxiety 
powerlessness is key to 
personality. 

Jung Personality is shaped by Archetypes, 
spiritual forces as well as collective 
by biological and social unconscious 
variables. 

Sullivan Personality can only be Personifications, 
conceptualized within the developmental 


context of an individual’s epochs 
core relationships. 


Several neo-analytic theories became particularly influ- 
ential in the decades following Freud’s death. Among the 
most important of these were Jung’s (1933, 1961) analyti- 
cal psychology, Erikson’s (1963, 1968) psychosocial theory, 
Sullivan’s (1947, 1953) interpersonal theory, and the quasi- 
dynamic models of Adler (1921, 1923), Fromm (1941, 1947), 
and Horney (1937, 1945). These theories shared a Freudian 
emphasis on intrapsychic dynamics, childhood experiences, 
and unconscious processes as determinants of personality 
and psychopathology. However, each neo-analytic theorist 
rejected the classical psychoanalytic emphasis on sexuality 
as a key component of personality, and each theory sought to 
supplant sexuality with its own unique elements. Key fea- 
tures of the most prominent neo-analytic models are summa- 
rized in Table 5.3. 

Each neo-analytic model in Table 5.3 attained a loyal 
following during its heyday, but for the most part these neo- 
analytic models are no longer influential in mainstream psy- 
chology. To be sure, aspects of these neo-analytic theories 
continue to be discussed (and on occasion isomorphically 
rediscovered by other personality theorists). However, with 
the exceptions of Erikson and Sullivan, the neo-analytic 
theories summarized in Table 5.3 have comparatively few 
adherents today, and they do not receive much attention within 
the clinical and research communities. 

Erikson’s (1963, 1968) psychosocial approach continues 
to have a strong impact on personality and developmental 
research (Franz & White, 1985). Sullivan’s (1953, 1956) 
interpersonal theory not only helped lay the groundwork for 


object relations theory and self psychology (described later 
in this chapter), but continues to influence developmental 
research on adolescence (Galatzer-Levy & Cohler, 1993), 
as well as psychodynamic writing on treatment of severe 
pathology (Kernberg, 1984; Millon, 1996). 


Object Relations Theory and Self Psychology 


Although the influence of most neo-analytic models has 
waned, two other psychodynamic frameworks that evolved 
out of Freud’s work—object relations theory and self 
psychology—remain very much a part of mainstream psy- 
choanalytic theory and practice. Both frameworks developed 
out of early work in ego psychology, an offshoot of the clas- 
sical model; this model updated Freud’s thinking on the role 
of the ego in personality development. Where Freud had con- 
ceptualized the ego primarily in terms of its reality-testing 
and defensive functions, ego psychologists posited that the 
ego plays another equally important role in intrapsychic 
life—setting goals, seeking challenges, striving for mastery, 
and actualizing potential (Hartmann, 1964). Within this line 
of thinking, the ego was seen as an autonomous, conflict-free 
structure, rather than an entity that simply responded to 
the demands of id, superego, and the external world. Ego psy- 
chologists’ reconceptualization of the ego set the stage for 
object relations theory and self psychology. 


Object Relations Theory 


Although there are several distinct variants of object relations 
theory (see Greenberg & Mitchell, 1983), they share a core be- 
lief that personality can be analyzed most usefully by examin- 
ing mental representations of significant figures (especially 
the parents) that are formed early in life in response to interac- 
tions taking place within the family (Gill, 1995; Winnicott, 
1971). These mental representations (sometimes called intro- 
jects) are hypothesized to serve as templates for later interper- 
sonal relationships, allowing the individual to anticipate the 
responses of other people and draw reasonably accurate infer- 
ences regarding others’ thoughts, feelings, goals, and motiva- 
tions (Sandler & Rosenblatt, 1962). Mental representations of 
the parents—parental introjects—also allow the individual to 
carry on an inner dialogue with absent figures. This inner dia- 
logue helps modulate anxiety and enables the person to make 
decisions consistent with values and beliefs acquired early in 
life (Fairbairn, 1952; Jacobson, 1964). 

One of the most prominent object relations models of per- 
sonality today is Blatt’s (1974, 1991) anaclitic-introjective 
framework. Blending psychoanalytic theory with research in 
cognitive development, Blatt postulated that the structure of 
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an individual’s parental introjects play a key role in personal- 
ity development and dynamics. When introjects are weak (or 
even absent), an anaclitic personality configuration results, 
characterized by dependency, insecurity, and feelings of 
helplessness and emptiness. When introjects are harsh and 
demanding, an introjective personality configuration is pro- 
duced, characterized by feelings of guilt, failure, worthless- 
ness, and self-loathing. A plethora of studies have shown that 
Blatt’s anaclitic-introjective distinction helps predict risk for 
psychopathology and physical illness, the form that psy- 
chopathology and illness will take, the kinds of stressful 
events that are likely to be most upsetting to the individual, 
and the types of interventions that will effect therapeutic 
change most readily (Blatt & Homann, 1992; Blatt & Zuroff, 
1992). 


Self Psychology 


Self psychologists share object relations theorists’ emphasis 
on mental representations as the building blocks of personal- 
ity. However, self psychologists contend that the key introjects 
are those associated with the self, including selfobjects (i.e., 
representations of self and others that are to varying degrees 
merged, undifferentiated, and imperfectly articulated). Self 
psychology developed in part in response to analysts’ interest 
in treating severe personality disorders and other treatment- 
resistant forms of psychopathology (Goldberg, 1980; Kohut, 
1971). The development of self psychology was also aided by 
a recognition that the knowledge base of analytic theory and 
practice could be enriched if greater attention were paid to the 
ontogenesis of the self in the context of early child-caregiver 
relationships (see Mahler, Pine, & Bergman, 1975). 

The most widely known self psychology framework was 
first described by Kohut (1971, 1977). Kohut postulated that 
empathic and supportive early interactions resulted in the 
construction of a secure, cohesive autonomous self, with 
sufficient resources to deal with the stresses and challenges of 
intimacy. In contrast, disturbances in infant-caregiver interac- 
tions were hypothesized to result in damage to the self along 
with impairments in evocative constancy (i.e., the ability to 
generate stable mental images of self and absent others) and 
an inability to tolerate true intimacy with others. A variety of 
narcissistic disorders result from damage to the self—and al- 
though these narcissistic disorders range in severity from 
moderate to severe, all reflect the individual’s inability to 
maintain a cohesive sense of self, except when recapitulating 
specific (often destructive) interaction patterns. Empirical 
data testing Kohut’s model are less plentiful than those as- 
sessing various object relations frameworks, but studies offer 
indirect support for Kohut’s contention that early difficulties 
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within the infant-caregiver unit result in subsequent character 
pathology and may predict the form that character pathol- 
ogy will take (Galatzer-Levy & Cohler, 1993; Masling & 
Bornstein, 1993). 


Contemporary Integrative Models 


Object relations theory and self psychology have revived aca- 
demic psychologists’ interest in psychodynamic ideas during 
the past several decades, in part because they represent nat- 
ural bridges between psychoanalytic theory and research in 
other areas of psychology (e.g., cognitive, social, develop- 
mental; see Barron, Eagle, & Wolitzky, 1992; Masling & 
Bornstein, 1994; Shapiro & Emde, 1995). While object rela- 
tions theory and self psychology continue to flourish, a paral- 
lel stream of theoretical work has developed that focuses on 
integrating psychodynamic models of personality with ideas 
and findings from competing clinical frameworks. 

As Figure 5.1 shows, contemporary integrative psychody- 
namic models draw from both object relations theory and self 
psychology (and to some extent, from classical psychoana- 
lytic theory as well). Unlike most earlier psychodynamic the- 
ories, however, these integrative frameworks utilize concepts 
and findings from other schools of clinical practice (e.g., cog- 
nitive, behavioral, humanistic) to refine and expand their 
ideas. Some integrative models have gone a step further, 
drawing upon ideas from neuropsychology and psychophar- 
macology in addition to other, more traditional areas. 

There are almost as many integrative psychodynamic mod- 
els as there are alternative schools of psychotherapeutic 
thought. Among the most influential models are those that link 
psychodynamic thinking with concepts from cognitive ther- 
apy (Horowitz, 1988; Luborsky & Crits-Christoph, 1990), be- 
havioral therapy (Wachtel, 1977), and humanistic-existential 
psychology (Schneider & May, 1995). Other integrative mod- 
els combine aspects of psychoanalysis with strategies and 
principles from family and marital therapy (Slipp, 1984). 
Needless to say, not all analytically oriented psychologists 
agree that these integrative efforts are productive or desirable. 
Moreover, the question of whether these integrative frame- 
works are truly psychoanalytic or have incorporated so many 
nonanalytic principles as to be something else entirely is 
a matter of considerable debate within the psychoanalytic 
community. 


PSYCHOANALYTIC PERSONALITY THEORIES: 
BRINGING ORDER TO CHAOS 


Given the burgeoning array of disparate theoretical per- 
spectives, a key challenge confronting psychodynamic theo- 
rists involves finding common ground among contrasting 


viewpoints. Although there are dozens of psychodynamically 
oriented models of personality in existence today, all these 
models have had to grapple with similar theoretical and 
conceptual problems. In the following sections, I discuss how 
contemporary psychodynamic models have dealt with three 
key questions common to all personality theories. 


Personality Processes and Dynamics 


Three fertile areas of common ground among psychodynamic 
models of personality involve motivation, mental structure 
and process, and personality stability and change. 


Motivation 


With the possible exception of the radical behavioral ap- 
proach, every personality theory has addressed in detail the 
nature of human motivation—that set of unseen internal 
forces that impel the organism to action (see Emmons, 1997; 
Loevinger, 1987; McAdams, 1997). Although classical psy- 
choanalytic theory initially conceptualized motivation in 
purely biological terms, the history of psychoanalysis has 
been characterized by an increasing emphasis on psycholog- 
ical motives that are only loosely based in identifiable physi- 
ological needs (Dollard & Miller, 1950; Eagle, 1984). 

During the 1940s and 1950s, evidence from laboratory 
studies of contact-deprived monkeys (Harlow & Harlow, 
1962) and observational studies of orphaned infants from 
World War II (Spitz, 1945, 1946) converged to confirm that 
human and infrahumans alike have a fundamental need for 
contact comfort and sustained closeness with a consistent 
caregiver. Around this time, developmental researchers were 
independently formulating theories of infant-caregiver at- 
tachment that posited a separate need to relate to the primary 
caregiver of infancy and specified the adverse consequences 
of disrupted early attachment relationships (Ainsworth, 1969, 
1989; Bowlby, 1969, 1973). 

Object relations theorists and self psychologists integrated 
these developmental concepts and empirical findings into 
their emerging theoretical models, so that by the late 1960s 
most psychodynamic psychologists assumed the existence of 
one or more psychological drives related to contact comfort 
(e.g., Kernberg, 1975; Kohut, 1971; Winnicott, 1971). Theo- 
rists emphasized the critical importance of interactions that 
take place within the early infant-caregiver relationship, not 
only because these interactions determined the quality of con- 
tact comfort available to the infant, but also because positive 
interactions with a nurturing caregiver were necessary for the 
construction of a cohesive sense of self (Kohut, 1971; Mahler 
et al., 1975); stable, benevolent introjects (Blatt, 1974, 1991); 


and useful mental models of self-other interactions (Main, 
Kaplan, & Cassidy, 1985). 


Mental Structure and Process 


Along with psychoanalysts’ recognition that mental images 
of self and others were key building blocks of person- 
ality came a change in the way the structures and processes 
of personality were conceptualized. Terms like introject, 
schema, and object representation gradually took their place 
alongside those of Freud’s structural model as cornerstones 
of psychoanalytic theory and therapy (Bornstein, 1996; 
Greenberg & Mitchell, 1983). Analysts recognized that in ad- 
dition to mental images of self and others, a key derivative of 
early relationships was the formation of internal working 
models of self-other interactions (sometimes identified as 
scripts). This alternative conceptualization of the nature of 
mental structure not only enabled psychodynamic theorists 
to derive new treatment approaches (especially for working 
with character-disordered patients), but also helped con- 
nect psychodynamic models with research in attachment the- 
ory and social cognition (Galatzer-Levy & Cohler, 1993; 
Masling & Bornstein, 1994). 

This language shift not only was due to theoretical changes, 
but also reflected a need to develop a psychoanalytic terminol- 
ogy that was less abstract and closer to the day-to-day experi- 
ence of psychoanalytic patients. In fact, close analysis of 
psychoanalytic discourse during the early days of object rela- 
tions theory indicated that this terminological evolution was 
already underway, regardless of the fact that some newfound 
language was only gradually becoming formalized within the 
extant psychoanalytic literature. 

In this context, Mayman (1976) noted that at any given 
time, a psychoanalytic theorist or practitioner may use sev- 
eral different levels of discourse to communicate theoretical 
concepts. At the top of this framework is psychoanalytic 
metapsychology—the complex network of theoretical con- 
cepts and propositions that form the infrastructure of psycho- 
analysis. Metapsychological terms are often abstract, rarely 
operationalizable, and typically used in dialogue with other 
theorists and practitioners. The concepts of libido and selfob- 
ject are examples of language most closely associated with 
psychoanalytic metapsychology. 

The middle-level language of psychoanalysis incorporates 
the constructs used by theorists and practitioners in their own 
day-to-day work. It is the language in which psychoanalysts 
conceptualize problems and communicate informally—the 
kind of language likely to turn up in the heart of a case study 
or in a set of clinical notes. The terms oral dependent and 
sublimation are examples of the middle-level language of 
psychoanalysis. 


Psychoanalytic Personality Theories: Bringing Order to Chaos 125 


The bottom level of psychoanalytic language centers on 
the experience-near discourse that characterizes therapist- 
patient exchanges within an analytic session. Less formal than 
Mayman’s (1976) middle-level language, this experience- 
near discourse is intended to frame psychoanalytic concepts 
in a way that resonates with a patient’s personal experience 
without requiring that he or she have any understanding of 
psychoanalytic metapsychology. When an analyst discusses a 
patient’s “aggressive impulses” or “sibling rivalry,” that ana- 
lyst has translated an abstract concept into experience-near 
terms. 

Thus, like most personality theorists, psychoanalysts 
today conceptualize mental structures and processes on 
several levels simultaneously. Unfortunately, it has taken 
psychoanalytic psychologists a long time to develop an expe- 
rience-near language for day-to-day work—longer perhaps 
than it has taken psychologists in other areas. On the positive 
side, however, in recent years psychoanalytic theorists have 
addressed this issue more openly and systematically than 
have theorists from other theoretical backgrounds (e.g., see 
Horowitz, 1991; Kahn & Rachman, 2000). 


Personality Stability and Change 


The parallel conceptualization of psychoanalytic concepts in 
relational terms introduced a fundamentally new paradigm 
for thinking about continuity and change in personality de- 
velopment and dynamics. In addition to being understood in 
terms of a dynamic balance among id, ego, and superego, 
stability in personality was now seen as stemming from con- 
tinuity in the core features of key object representations (in- 
cluding the self-representation; see Blatt, 1991; Bornstein, 
1996). In this context, personality change was presumed to 
occur in part because internalized representations of self and 
other people changed as a result of ongoing inter- and intra- 
personal experiences (Schafer, 1999). 

This alternative framework influenced psychoanalytic the- 
ories of normal personality development and led to a plethora 
of studies examining the intrapsychic processes involved in 
therapeutic resistance, transference, and cure (Blatt & Ford, 
1994; Luborsky & Crits-Christoph, 1990). It also called theo- 
rists’ attention to the critical importance of present-day expe- 
riences in moderating long-term psychodynamic processes. 
One important consequence of newfound concepts of person- 
ality stability and change was a continuing shift from past to 
present in the study of psychodynamics (Spence, 1982). 


Insight, Awareness, and Coping 


As noted earlier, a key tenet of all psychodynamic models is 
that unconscious processes are primary determinants of 
thought, emotion, motivation, and behavior. To the degree that 
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people have only limited introspective access to these under- 
lying causes, they have only limited control over these 
processes as well. In part as a consequence of their emphasis 
on unconscious processes, psychodynamic theorists are unan- 
imous in positing that a certain degree of self-deception is 
characteristic of both normal and abnormal functioning: Not 
knowing why we are driven to behave in a certain way, but 
needing to explain our behavior to ourselves, we generate 
explanations that may or may not have anything to do with the 
real causes of behavior (e.g., see Bornstein, 1999b). More- 
over, when feelings, thoughts, and motivations produce anxi- 
ety (including guilt), we invoke coping strategies called ego 
defenses to minimize these negative reactions and to hide 
them from ourselves (Cramer, 2000). 

The once-radical notion of defensive self-deception is 
now widely accepted among psychoanalytic and nonpsycho- 
analytic psychologists alike. Research in social cognition 
(attribution theory in particular) confirms that systematic, pre- 
dictable distortions in our perceptions of self and others are a 
normal part of everyday life (Kihlstrom, 1987; Robins & John, 
1997). Although the language of attribution theory differs 
substantially from that of psychoanalysis, scrutiny reveals a 
remarkable degree of convergence between these two frame- 
works. Moreover, researchers have begun to bridge the gap be- 
tween these ostensibly divergent theoretical perspectives, 
uncovering a surprising degree of overlap in the process. 

One area in which psychodynamic models of defensive 
self-deception diverge from social psychological models of 
this phenomenon is in the explanations of why these distor- 
tions occur. Although both models agree that these distortions 
stem largely (but not entirely) from self-protective processes, 
only psychoanalytic theories explicitly link these distor- 
tions to an identifiable set of unconsciously determined strate- 
gies termed ego defenses. Social cognitive researchers have 
tended to favor explanatory models that emphasize limitations 
in the human information-processing apparatus and mental 
shortcuts that arise from the need to process multiple sources 
of information simultaneously as key factors in our cognitive 
biases and distortions of self and others (Robins & John, 
1997). Recent work in terror management theory represents a 
potential bridge between psychodynamic and social-cognitive 
work in this area, insofar as the terror management theory 
model specifies how distortions in inter- and intrapersonal 
perception simultaneously reflect defensive processes and 
information-processing limitations (Pyszczynski, Greenberg, 
& Solomon, 1999). 

Ironically, the concept of the ego defense—now central to 
psychodynamic models of personality—did not receive much 
attention during the theory’s formative years. In fact, Janet 
paid greater attention to the defense concept than Freud did 


(Perry & Laurence, 1984), and in certain respects Janet’s po- 
sition regarding this issue has turned out to be more accurate 
than Freud’s has (see Bowers & Meichenbaum, 1984). Evi- 
dence suggests that a conceptualization of defensive activity 
as narrowing of consciousness may be more valid and heuris- 
tic than is the classic psychoanalytic conceptualization of de- 
fense in terms of exclusion (or barring) of material from 
consciousness (Cramer, 2000; cf. Erdelyi, 1985). 

Although Freud discussed certain ego defenses (e.g., re- 
pression, projection, sublimation) in his theoretical and clini- 
cal writings, it was not until Anna Freud’s (1936) publication 
of The Ego and the Mechanisms of Defense that any effort was 
made to create a systematic, comprehensive listing of these 
defensive strategies. Most of the ego defenses discussed by A. 
Freud continue to be discussed today, although some have 
fallen out of favor, and new ones have been added as empiri- 
cal research on defenses began to appear following A. Freud’s 
(1936) seminal work. 

In the decades following A. Freud’s (1936) publication, 
several alternative methods for conceptualizing ego defenses 
were offered. The most influential of these are summarized in 
Table 5.4. As Table 5.4 shows, differences among the individ- 
ual defense, defense style, and defense cluster models have 
less to do with the way that specific defensive processes are 
conceptualized and more to do with how these processes 
are organized and relate to one another. Each approach to 
conceptualizing and organizing ego defenses has its own as- 
sociated measurement strategy (technique), its own research 
base, and its own adherents within the discipline. 

The combined influences of unconscious processes and ego 
defenses raise the unavoidable question of whether within the 


TABLE 5.4 Perspectives on Ego Defenses 


Perspective Key Contributors Key Terms 


Individual S. Freud, A. Freud 


defenses 


Specific defenses: 
Repression 
Projection 
Denial 
Sublimation 
Displacement 

Defense styles: 
Reversal 
Projection 
Principalization 

Turning against object 
Turning against self 

Defense levels-clusters: 
Adaptive-mature 
Maladaptive-immature 
Image distorting 
Self-sacrificing 


Defense style Thilevich & Gleser 


approach 


Defense Vaillant 


levels-clusters 


Note. Detailed discussions of these three perspectives are provided by 
Cramer (2000), Ihilevich & Gleser (1986, 1991), and Vaillant (1986). 
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psychodynamic framework humans are seen as inherently 
irrational creatures. Like most questions in psychoanalysis, 
this one has more than one answer. On the one hand, humans 
are indeed irrational—driven by forces they do not understand, 
their thoughts and feelings are distorted in ways they cannot 
control. On the other hand, humans are as rational as can be ex- 
pected given the constraints of their information-processing 
skills, their need to cope with and manage anxiety, and the 
adaptations necessary to survive in an unpredictable, threaten- 
ing world. Within the psychodynamic framework, all humans 
are irrational, but most are irrational in a rational way. 


Normal and Pathological Functioning 


As any psychologist knows, all humans may be irrational, 
but some are more irrational than others. Like most person- 
ality theorists, psychoanalysts see psychopathology as re- 
flected in a greater-than-expected degree of self-destructive, 
self-defeating Oe, irrational) behavior (Millon, 1996). In 
most psychodynamic frameworks, psychopathology is also 
linked with increased self-deception, decreased insight into 
the underlying causes of one’s behavior, and concomitant 
limitations in one’s ability to modify dysfunctional inter- 
action patterns and alter self-defeating responses (Eagle, 
1984). 

Psychodynamic models conceptualize psychopathology 
in terms of three general processes: (a) low ego strength, 
(b) maladaptive ego defenses, and (c) dysfunctional introjects. 
Low ego strength contributes to psychopathology because the 
ego cannot execute reality testing functions adequately; intra- 
and interpersonal distortions increase. Maladaptive defenses 
prevent the individual from managing stress and anxiety ade- 
quately leading to higher levels of self-deception, increased 
perceptual bias, and decreased insight. Dysfunctional intro- 
jects (including a distorted or deficient self-representation) 
similarly lead to inaccurate perceptions of self and others, but 
they also foster dysfunctional interaction patterns and propa- 
gate problematic interpersonal relationships. 

A key premise of the psychoanalytic model of psy- 
chopathology is that psychological disorders can be divided 
into three broad levels of severity (Kernberg, 1970, 1975). 
The classic conceptualization of this three-level framework 
invokes the well-known terms neurosis, character disorder, 
and psychosis. In most instances, neuroses are comparatively 
mild disorders which affect only a few areas of functioning 
(e.g., phobias). Character disorders are more pervasive, long- 
standing disorders associated with problematic social rela- 
tionships, distorted self-perception, and difficulties with 
impulse control (e.g., borderline personality disorder). Psy- 
choses are characterized by severely impaired reality testing 


TABLE SS Levels of Psychopathology in Psychodynamic Theory 


Level Ego Strength Ego Defenses Introjects 
Neurosis High Adaptive-mature Articulated- 
(displacement, differentiated 
sublimation) and benign 
Character Variable Maladaptive-immature ` Quasi-articulated, 
disorder (denial, projection) malevolent, or both 
Psychosis Low Maladaptive-immature ` Unarticulated- 
or nonexistent undifferentiated 


and malevolent 


and low levels of functioning in many areas of life (e.g., 
schizophrenia). 

Although this tripartite model is both theoretically heuris- 
tic and clinically useful, it is important not to overgeneralize 
regarding differences among different levels of functioning. 
There are great variations in both severity and chronicity 
within a given level (e.g., certain neuroses may be more debil- 
itating than an ostensibly more severe personality disorder). 
In addition, there is substantial comorbidity—both within and 
between levels—so that a disordered individual is likely to 
show multiple forms of psychopathology (Bornstein, 1998; 
Costello, 1995). 

As Table 5.5 shows, all three dimensions of intrapsychic 
dysfunction—low ego strength, maladaptive defenses, and 
dysfunctional introjects—can be mapped onto the tripartite 
psychopathology model. In this respect, the model represents 
an integrative framework that links different psychodynamic 
processes and connects the psychoanalytic model with con- 
temporary diagnostic research. Although the term neurosis is 
rarely used today in mainstream psychopathology research, 
perusal of contemporary diagnostic frameworks (including 
the DSM-IV; APA, 1994) confirms that the tripartite model 
has had a profound influence on the way practitioners con- 
ceptualize and organize psychological disorders (see also 
Masling & Bornstein, 1994, and Millon, 1996, for discus- 
sions of this issue). 


PSYCHOANALYSIS AND CONTEMPORARY 
PSYCHOLOGY: RETROSPECT AND PROSPECT 


Psychodynamic models of personality occupy a unique place 
in contemporary psychology. On the one hand, they continue 
to be roundly criticized—perceived by those within and out- 
side the discipline as untested and untestable and denigrated 
by skeptics as a quasi-phrenological pseudoscience that has 
hindered the progress of both scientific and clinical psychol- 
ogy. On the other hand, Freud’s theory continues to fascinate 
many, occupying a central place in undergraduate and gradu- 
ate psychology texts and influencing in myriad ways our 
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understanding of ourselves and our culture. In these final sec- 
tions, I discuss the place of psychoanalysis in contemporary 
psychology and speculate about its future. 


Testing Psychoanalytic Theories 


Within the psychoanalytic community, few issues are as 
controversial as the nature of evidence in psychoanalysis 
(see Grunbaum, 1984, for a detailed discussion of this issue). 
Because psychoanalysis focuses on the in-depth understand- 
ing of individuals, many of the theory’s adherents argue that 
research aimed at confirming general principles of human 
functioning is of little value (e.g., see Gedo, 1999). Others 
maintain that without a strong nomothetic research base, psy- 
chodynamic theory can never be refined and updated based 
on our evolving understanding of brain, mind, and behavior 
(Bornstein, 2001). 

The controversy regarding the nature of psychoanalytic 
evidence dates almost to the inception of the theory itself. 
Although Freud started his career as a researcher, his attitude 
toward traditional scientific methods became increasingly 
dismissive as time went on (Fisher & Greenberg, 1996; 
Masling & Schwartz, 1979). By the 1920s, psychoanalytic 
theory had become quite distant from its roots in the natural 
sciences. With this distancing came an increasing discomfort 
with traditional nomothetic research methods and a shift 
toward idiographic data, which most theorists and practi- 
tioners saw as being ideally suited to both testing and refin- 
ing psychoanalytic hypotheses via close analysis of clinical 
material. 

Psychoanalytic theories of personality continue to be 
strongly influenced by data obtained in the treatment setting. 
The case reports of psychoanalytic practitioners are still used 
to formulate general principles of psychopathology, after 
which these case-derived general principles are reapplied to 
new cases. Although for many years psychoanalytic psycholo- 
gists accepted the heuristic value of case studies with little out- 
ward resistance, this situation is changing, and contemporary 
theorists and researchers have begun to question the near- 
exclusive emphasis on case material in psychoanalytic theory- 
building (Bornstein, 2001; Bornstein & Masling, 1998). 

Although psychodynamic theorists have tended to place 
the greatest value on material derived from the psychoana- 
lytic treatment session, other forms of idiographic evidence 
(e.g., anthropological findings, literary records) have also 
been used to assess psychoanalytic ideas. Needless to say, 
psychodynamic theorists’ devotion to idiographic methods 
has led to widespread criticism from within and outside 
psychology. Proponents of the nomothetic approach maintain 
that idiographic data—especially those obtained behind 


closed doors—are neither objective nor replicable, and pro- 
vide little compelling evidence for the validity of psychoana- 
lytic concepts or the efficacy of psychoanalytic treatment 
(Crews, 1998; Macmillan, 1996). 


The Researcher-Practitioner Split 


A noteworthy difference between psychoanalysis and other 
models of personality becomes apparent when one contrasts 
the theoretical orientations of practitioners with those of 
academics. Although there are few practicing psychoanalysts 
outside large metropolitan centers, a sizable minority of 
clinical psychologists acknowledge the impact of psychody- 
namic principles on their day-to-day clinical work (Norcross, 
Karg, & Prochaska, 1997). In contrast, few personality re- 
searchers are openly psychodynamic despite the fact that 
many concepts in contemporary nonanalytic models of per- 
sonality are rooted to varying degrees in psychodynamic 
ideas (Bornstein, 2001). 

This researcher-practitioner divide is in part political. 
During the 1960s and 1970s, behavioral, cognitive, and hu- 
manistic personality theorists deliberately distanced them- 
selves from psychoanalytic theory. For behaviorists, this 
distancing was a product of their core assumptions and be- 
liefs, which clearly conflict with those of psychoanalysis. For 
cognitivists and humanists, however, the split with psycho- 
analysis was aimed at enhancing the status of their theories. 
During this era, it was important for these burgeoning models 
to distinguish themselves from long-standing psychoanalytic 
principles in order to assert the uniqueness of their perspec- 
tives. Even when parallel concepts arose in these models, the- 
orists emphasized the differences from psychoanalysis rather 
than focusing on their commonality. 

The situation has changed somewhat in recent years: 
Now that the cognitive and humanistic perspectives are 
well-established, there has been a slow and subtle reconcilia- 
tion with Freudian ideas. In the case of humanistic psy- 
chology, there has even some explicit acknowledgment of 
the discipline’s Freudian roots. Even contemporary trait 
approaches—which have historically been strongly bound to 
the biological and psychometric traditions—have begun to 
integrate psychodynamic principles into their models and 
methods (e.g., see Pincus & Wilson, 2001). 


Freud’s Cognitive Revolution 


The theory that upended mainstream neuroscience a century 
ago has had a significant impact on cognitive psychology 
within the past two decades. Although the synergistic inter- 
change between these two fields dates back at least to the 
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1960s, the impact of Freud’s cognitive revolution only be- 
came widely accepted with the publication of Erdelyi’s 
(1985) landmark analysis of the interface between cognitive 
psychology and psychoanalysis. Erdelyi’s work demon- 
strated that many psychoanalytic concepts dovetailed well 
with prevailing models of perception, memory, and informa- 
tion processing, and set the stage for an increasingly produc- 
tive interchange between psychodynamic researchers and 
cognitive psychologists (e.g., see Bucci, 1997; Horowitz, 
1988; Stein, 1997). 

The language of the topographic model—conscious, un- 
conscious, and preconscious—continues to be used to a sur- 
prising degree, even by researchers unaffiliated with (and 
often unsympathetic to) Freudian ideas. Moreover, recent re- 
search in perception without awareness, implicit learning, 
and implicit memory draws heavily from psychodynamic 
concepts (Bornstein & Masling, 1998; Bornstein & Pittman, 
1992). Despite psychoanalysts’ long-standing resistance to 
nomothetic research methods, psychoanalytic principles have 
undeniably been affected by laboratory research in these 
other related areas. 

Although it was largely unacknowledged at the time, the 
integration of psychoanalysis and cognitive psychology was 
central to the development of object relations theory and re- 
sulted in substantive reconceptualization of such traditional 
psychoanalytic concepts as transference, repression, and 
screen (or false) memories (Bornstein, 1993; Bowers, 1984; 
Eagle, 2000; Epstein, 1998). As cognitive psychology contin- 
ues to integrate findings from research on attitudes and emo- 
tion (resulting in the study of hot, or affect-laden cognitions), 
the psychodynamics of perception, memory, and information 
processing are increasingly apparent. 

A likely consequence of this ongoing integration will be 
the absorption of at least some psychodynamic principles 
into models of problem solving, concept formation, and 
heuristic use. Studies confirm that systematic distortions and 
biases in these mental processes are due in part to constraints 
within the human information-processing system (Gilovich, 
1991), but this does not preclude the possibility that motiva- 
tional factors (including unconscious motives and their asso- 
ciated implicit memories) may also influence psychological 
processes that were once considered largely independent of 
personality and psychopathology factors (McClelland, 
Koestner, & Weinberger, 1989). 


Developmental Issues 


A second domain of contemporary psychology that has been 
strongly influenced by psychodynamic models is the study of 
human development. There is a natural affiliation between 


developmental psychology and the psychodynamic emphasis 
on stages of growth, familial influences, and the formation of 
internal mental structures that structure and guide behavior 
(Eagle, 1996; Emde, 1992; Stern, 1985). Theorists in both 
areas have built upon and deepened this natural affiliation. 

In contrast to cognitive psychology, the exchange between 
psychoanalysis and developmental psychology has been 
openly acknowledged from the outset (see Ainsworth, 
1969, 1989). Moreover, the psychoanalysis—developmental- 
psychology interface is synergistic: Just as models of child 
and adolescent development have been affected by psycho- 
dynamic concepts, psychoanalytic models of personality for- 
mation and intrapsychic dynamics have been affected by 
developmental research on attachment, emotions, and cogni- 
tive development (Emde, 1992). At this point in the history of 
psychology, the proportion of developmental psychologists 
receptive to psychoanalytic ideas is probably higher than that 
found in any other subdiscipline of psychology (with the pos- 
sible exception of clinical psychology). 

Ironically, although Freud denied the existence of person- 
ality development postadolescence, there has been a surpris- 
ing amount of empirical research on the psychodynamics of 
aging. Beginning with Goldfarb’s (1963) work, theoreticians 
and researchers have explored myriad aspects of the psycho- 
dynamics of late-life development (e.g., see Ainsworth, 
1989; Galatzer-Levy & Cohler, 1993). With the advent of 
more sophisticated multistore models of memory, the links 
between psychodynamic processes and injury- and illness- 
based dementia have also been delineated. 


Psychoanalytic Health Psychology 


Over the years, psychoanalysis has had an ambivalent rela- 
tionship with health psychology (Duberstein & Masling, 
2000). In part, this situation reflects Freud’s own ambivalence 
regarding the mind-body relationship. After all, the great in- 
sight that led Freud to develop his topographic and structural 
models of the mind—in many ways, the raison d’étre of psy- 
choanalysis itself—was the idea that many physical symp- 
toms are the product of psychological conflicts rather than of 
organic disease processes (Bowers & Meichenbaum, 1984; 
Erdelyi, 1985). Freud’s early interest in conversion disorders 
and hysteria set the stage for a psychoanalytic psychology 
that emphasized mental—not physical—explanations for 
changes in health and illness states. 

Beginning in the 1920s, however, Deutsch (1922, 1924) 
and others argued that underlying psychodynamic processes 
could have direct effects on the body’s organ systems. The no- 
tion that unconscious dynamics could influence bodily func- 
tioning directly was extended and elaborated by Alexander 
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(1950, 1954), who developed a detailed theoretical framework 
linking specific psychodynamic processes with predictable 
physiological sequelae and illness states. When Sifneos 
(1972) articulated his empirically grounded, psychoanalyti- 
cally informed model of alexithymia (i.e., an inability to ver- 
balize emotions), the stage was set for the development of a 
truly psychoanalytic health psychology. The key hypotheses 
of Sifneos’s approach—that unverbalized emotions can have 
myriad destructive effects on the body’s organ systems— 
helped lay the groundwork for several ongoing health psy- 
chology research programs that are to varying degrees rooted 
in psychodynamic concepts. Research on health and hardiness 
(Kobasa, 1979), stress and coping (Pennebaker & O’ Heeron, 
1984), emotional disclosure and recovery from illness 
(Spiegel, Bloom, Kraemer, & Gottheil, 1989), and the “Type 
C” (cancer-prone) personality (Temoshok, 1987) are all based 
in part in psychodynamic models of health and illness. 


The Opportunities and Challenges of Neuroscience 


Some of the first contemporary efforts to integrate psychoan- 
alytic principles with findings from neuroscience involved 
sleep and dreams (Hobson, 1988; Winson, 1985). Although 
the language of Freudian dream theory is far removed from 
that of most neuropsychological models, work in this area has 
revealed a number of heretofore unrecognized convergences 
between the psychodynamics and neurology of dreaming. In 
fact, contemporary integrative models of dream formation 
now incorporate principles from both domains, setting the 
stage for extension of this integrative effort to other aspects 
of mental life. 

Neuroimaging techniques such as the computerized axial 
tomography (CAT) scan, the positron-emission tomography 
(PET) scan, and magnetic resonance imaging (MRI) have 
begun to play a leading role in this ongoing psychoanalysis- 
neuroscience integration. Just as neuroimaging techniques 
have allowed memory researchers to uncover the neural un- 
derpinnings of previously unseen encoding and retrieval 
processes, functional magnetic resonance imaging (fMRI) 
have enabled dream researchers to record on-line visual rep- 
resentations of cortical activity associated with different 
sleep stages and experiences. 

Two psychodynamically relevant issues now being studied 
via fMRI (functional MRI) and other neuroimaging techniques 
are unconscious processes (e.g., implicit perception and learn- 
ing) and psychological defenses (Schiff, 1999; Walla, Endl, 
Lindinger, & Lang, 1999). In general, evidence suggests that 
implicit processes are centered in mid- and hindbrain regions 
to a greater degree than are explicit processes—a finding 
that dovetails with Freud’s own hypotheses as well as with 


recent evolutionary interpretations of psychodynamic princi- 
ples (Slavin & Kriegman, 1992). Neuroimaging studies of de- 
fensive mental operations are still in their infancy, but 
preliminary findings suggests that the process of biasing and 
distorting previously-encoded information involves predictable 
patterns of cortical (and possibly subcortical) activation. 


CONCLUSION: THE PSYCHOLOGY 
OF PSYCHODYNAMICS AND THE 
PSYCHODYNAMICS OF PSYCHOLOGY 


Despite their limitations, psychodynamic models of person- 
ality have survived for more than a century, reinventing 
themselves periodically in response to new empirical find- 
ings, theoretical shifts in other areas of psychology, and 
changing social and economic forces. Stereotypes notwith- 
standing, psychodynamic models have evolved considerably 
during the twentieth century and will continue to evolve dur- 
ing the first decades of the twenty-first century as well. 

For better or worse, psychoanalytic theory may be the 
closest thing to an overarching field theory in all of psychol- 
ogy. It deals with a broad range of issues—normal and patho- 
logical functioning, motivation and emotion, childhood and 
adulthood, individual and culture—and although certain 
features of the model have not held up well to empirical test- 
ing, the model does have tremendous heuristic value and 
great potential for integrating ideas and findings in disparate 
areas of social and neurological science. 

More than a century ago, Freud (1895b) speculated that 
scientists would be resistant to psychoanalytic ideas because 
of the uncomfortable implications of these ideas for their 
own functioning. Whether or not he was correct in this re- 
gard, it is true that psychodynamic models of personality 
provide a useful framework for examining ourselves and our 
beliefs. Clinical psychologists have long used psychoana- 
lytic principles to evaluate and refine their psychotherapeu- 
tic efforts. Scientists have not been as open to this sort of 
self-scrutiny. There is, however, a burgeoning literature on 
the biases and hidden motivations of the scientist (Bornstein, 
1999a; Mahoney, 1985), and psychodynamic models of 
personality may well prove to contribute a great deal to this 
literature. 
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This chapter has several aims. One is that of considering the 
role of behaviorism and behavioral approaches in the fields of 
personality theory and measurement. A second and central 
aim is that of describing a particular and different behavioral 
approach to the fields of personality theory and personality 
measurement. A third concern is that of presenting some of 
the philosophy- and methodology-of-science characteristics 
of this behavioral approach relevant to the field of personal- 
ity theory. A fourth aim is to characterize the field of person- 
ality theory from the perspective of this philosophy and 
methodology of science. And a fifth aim is to project some 
developments for the future that derive from this theory per- 
spective. Addressing these aims constitutes a pretty full 
agenda that will require economical treatment. 


BEHAVIORAL APPROACHES AND PERSONALITY 


Behavioral approaches to personality might seem of central 
importance to personology because behaviorism deals with 
learning and it is pretty generally acknowledged that learning 
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affects personality. Moreover, behaviorist theories were 
once the models of what theory could be in psychology. But 
certain features militate against behaviorism’s significance 
for the field of personality. Those features spring from the tra- 
ditional behaviorist mission. 


Traditional Behaviorism and Personality 


One feature is behaviorism’s search for general laws. That is 
ingrained in the approach, as we can see from its strategy of 
discovering learning-behavior principles with rats, pigeons, 
dogs, and cats—for the major behaviorists in the first and sec- 
ond generation were animal psychologists who assumed that 
those learning-behavior principles would constitute a com- 
plete theory for dealing with any and all types of human 
behavior. John Watson, in behaviorism’s first generation, 
showed this, as B. F. Skinner did later. Clark Hull (1943) was 
quite succinct in stating unequivocally about his theory that 
“all behavior, individual and social, moral and immoral, nor- 
mal and psychopathic, is generated from the same primary 
laws” (p. v). Even Edward Tolman’s goal, which he later 
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admitted was unreachable, was to constitute through animal 
study a general theory of human behavior. The field of per- 
sonality, in contrast, is concerned with individual differences, 
with humans, and this represents a schism of interests. 

A second, even more important, feature of behaviorism 
arises in the fact that personality as conceived in personology 
lies within the individual, where it cannot be observed. That 
has always raised problems for an approach that placed scien- 
tific methodology at its center and modeled itself after logical 
positivism and operationism. Watson had decried as mentalis- 
tic the inference of concepts of internal, unobservable causal 
processes. For him personality could only be considered as 
the sum total of behavior, that is, as an observable effect, not 
as a cause. Skinner’s operationism followed suit. This, of 
course, produced another, even wider, schism with personol- 
ogy because personality is generally considered an internal 
process that determines external behavior. That is the raison 
d’étre for the study of personality. 

Tolman, who along with Hull and Skinner was one of the 
most prominent second-generation behaviorists, sought to 
resolve the schism in his general theory. As a behaviorist 
he was concerned with how conditioning experiences, the 
independent variable, acted on the organism’s responding, 
the dependent variable. But he posited that there was some- 
thing in between: the intervening variable, which also helped 
determine the organism’s behavior. Cognitions were interven- 
ing variables. Intelligence could be an intervening variable. 
This methodology legitimated a concept like personality. 

However, the methodology was anathema to Skinner. 
Later, Hull and Kenneth Spence (1944) took the in-between 
position that intervening variables should be considered just 
logical devices, not to be interpreted as standing for any real 
psychological events within the individual. These differences 
were played out in literature disputes for some time. That was 
not much of a platform for constructing psychology theory 
such as personology. The closest was Tolman’s consideration 
of personality as an intervening variable. But he never devel- 
oped this concept, never stipulated what personality is, never 
derived a program of study from the theory, and never em- 
ployed it to understand any kind of human behavior. Julian 
Rotter (1954) picked up Tolman’s general approach, however, 
and elaborated an axiomatic theory that also drew from Hull’s 
approach to theory construction. As was true for Hull, the ax- 
iomatic construction style of the theory takes precedence over 
the goal of producing a theory that is useful in confronting the 
empirical events to which the theory is addressed. 

To exemplify this characteristic of theory, Rotter’s so- 
cial learning has no program to analyze the psychometric 
instruments that stipulate aspects of personality, such as intel- 
ligence, depression, interests, values, moods, anxiety, stress, 
schizophrenia, or sociopathy. His social learning theory, 


moreover, does not provide a theory of what personality tests 
are and do. Nor does the theory call for the study of the learn- 
ing and functions of normal behaviors such as language, 
reading, problem-solving ability, or sensorimotor skills. The 
same is true with respect to addressing the phenomena of ab- 
normal behavior. For example, Rotter (1954) described the 
Minnesota Multiphasic Personality Inventory (MMPI) but in 
a very conventional way. There are no analyses of the differ- 
ent personality traits measured on the test in terms of their be- 
havioral composition or of the independent variables (e.g., 
learning history) that result in individual differences in these 
and other traits. Nor are there analyses of how individual dif- 
ferences in traits affect other people’s responses to the indi- 
viduals or of how individual differences in the trait in turn act 
on the individual’s behavior. For example, a person with a 
trait of paranoia is more suspicious than others are. What in 
behavioral terms does being suspicious consist of, how is that 
trait learned, and how does it have its effects on the person’s 
behavior and the behavior of others? The approach taken here 
is that a behavioral theory of personality must analyze the 
phenomena of the field of personality in this manner. Rotter’s 
social learning theory does not do these things, nor do the 
other social learning theories. 

Rather, his theory inspired academic studies to test his for- 
mal concepts such as expectancy, need potential, need value, 
freedom of movement, and the psychological situation. This 
applied even to the personality-trait concept he introduced, 
the locus of control—whether people believe that they them- 
selves, others, or chance determines the outcome of the situa- 
tions in which the individuals find themselves. Although it has 
been said that this trait is affected in childhood by parental re- 
ward for desired behaviors, studies to show that differential 
training of the child produces different locus-of-control char- 
acteristics remain to be undertaken. Tyler, Dhawan, and Sinha 
(1989) have shown that there is a class difference in locus of 
control (measured by self-report inventory). But this does not 
represent a program for studying learning effects even on that 
trait, let alone on the various aspects of personality. 

The social learning theories of Albert Bandura and Walter 
Mischel are not considered here. However, each still carries 
the theory-oriented approach of second-generation behavior- 
ism in contrast to the phenomena-oriented theory construction 
of the present approach. For example, there are many labora- 
tory studies of social learning theory that aim to show that 
children learn through imitation. But there are not programs to 
study individual differences in imitation, the cause of such 
differences, and how those differences affect individual 
differences in important behaviors (e.g., the ability to copy 
letters, learn new words, or accomplish other actual learning 
tasks of the child). Bandura’s approach actually began in a 
loose social learning framework. Then it moved toward a 


behavioral approach several years later, drawing on the ap- 
proach to be described here as well as the approach of Skinner, 
and later it moved toward including a more cognitive termi- 
nology. Mischel (1968) first took a Watsonian-Skinnerian 
approach to personality and assessment, as did other radical 
behaviorists. He later abandoned that position (Mischel, 
1973) but, like the other social learning theorists, offered no 
program for study stipulating what personality is, how it is 
learned, how it functions, and how personality study relates to 
psychological measurement. 

When all is said and done, then, standard behaviorism 
has not contributed a general and systematic program for the 
study of personality or personality measurement. It has fea- 
tures that interfere with doing so. Until they are overcome in 
a fundamental way (which Tolmanian social learning ap- 
proaches did not provide), those features represent an impass- 
able barrier. 


Behavior Therapy and Personality 


The major behaviorists such as Hull, Skinner, and Tolman 
were animal learning researchers. None of them analyzed the 
learning of functional human behaviors or traits of behavior. 
Skinner’s empirical approach to human behavior centered on 
the use of his technology, that is, his operant conditioning ap- 
paratus. His approach was to use this “experimental analysis 
of behavior’ methodology in studying a simple, repetitive 
response of a subject that was automatically reinforced (and 
recorded). That program was implemented by his students in 
studies reinforcing psychotic patients, individuals with mental 
retardation, and children with autism with edibles and such for 
pulling a knob. Lovaas (1977), in the best developed program 
among this group, did not begin to train his autistic children in 
language skills until after the psychological behaviorism (PB) 
program to be described had provided the foundation. Al- 
though Skinner is widely thought to have worked with chil- 
dren’s behavior, that is not the case. He constructed a crib for 
infants that was air conditioned and easy to clean, but the crib 
had no learning or behavioral implications or suggestions. He 
also worked with programmed learning, but that was a delim- 
ited technology and did not involve behavior analyses of the 
intellectual repertoires taught, and the topic played out after a 
few years. Skinner’s experimental analysis of behavior did not 
indicate how to research functional human behaviors or prob- 
lems of behavior or how they are learned. 


Behavior Therapy 


The original impetus for the development of behavior therapy 
(which in the present usage includes behavior modification, 
behavior analysis, cognitive behavior therapy, and behavior 
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assessment) does not derive from Hull, Skinner, Tolman, or 
Rotter, although they and Dollard and Miller (1950) helped 
stimulate a general interest in the possibility of applications. 
One of the original sources of behavior therapy came from 
Great Britain, where a number of studies were conducted of 
simple behavior problems treated by using conditioning prin- 
ciples, either classical conditioning or reinforcement. The 
learning framework was not taken from an American behav- 
iorist’s theory but from European developments of condition- 
ing principles. As an example, Raymond (see Eysenck, 1960) 
treated aman with a fetish for baby carriages by classical con- 
ditioning. The patient’s many photographs of baby carriages 
were presented singly as conditioned stimuli paired with an 
aversive unconditioned stimulus. Under this extended condi- 
tioning the man came to avoid the pictures and baby car- 
riages. The various British studies using conditioning were 
collected in a book edited by Hans Eysenck (1960). Another 
of the foundations of behavior therapy came from the work of 
Joseph Wolpe. He employed Hull’s theory nominally and 
loosely in several endeavors, including his systematic desen- 
sitization procedure for treating anxiety problems. It was his 
procedure and his assessment of it that were important. 

A third foundation of behavior therapy came from my PB 
approach that is described here. As will be indicated, it began 
with a very broad agenda, that of analyzing human behavior 
generally employing its learning approach, including behav- 
iors in the natural situation. Its goal included making analyses 
of and treating problems of specific human behavior problems 
of interest to the applied areas of psychology. Following sev- 
eral informal applications, my first published analysis of a be- 
havior in the naturalistic situation concerned a journal report 
of a hospitalized schizophrenic patient who said the opposite 
of what was called for. In contrast to the psychodynamic inter- 
pretation of the authors, the PB analysis was that the abnormal 
behavior was learned through inadvertent reinforcement given 
by the treating doctors. This analysis suggested the treat- 
ment—that is, not to reinforce the abnormal behavior, the op- 
posite speech, on the one hand, and to reinforce normal speech, 
on the other (Staats, 1957). This analysis presented what be- 
came the orientation and principles of the American behavior 
modification field: (a) deal with actual behavior problems, 
(b) analyze them in terms of reinforcement principles, (c) take 
account of the reinforcement that has created the problem be- 
havior, and (d) extinguish abnormal or undesirable behavior 
through nonreinforcement while creating normal behavior by 
reinforcement. 

Two years later, my long-time friend and colleague Jack 
Michael and his student Teodoro Ayllon (see Ayllon & 
Michael, 1959), used this analysis of psychotic behavior and 
these principles of behavior modification to treat behavioral 
symptoms in individual psychotic patients in a hospital. Their 
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study provided strong verification of the PB behavior modifi- 
cation approach, and its publication in a Skinnerian journal had 
an impact great enough to be called the “seeds of the behav- 
ioral revolution” by radical behaviorists (Malott, Whaley, & 
Malott, 1997, p. 175). Ayllon and Michael’s paper was written 
as though this approach derived from Skinnerian behaviorism 
and this error was repeated in many works that came later. For 
example, Fordyce (see 1990) followed Michael’s suggestion 
both in using the PB principles and in considering his pain 
theory to be Skinnerian. 

The study of child behavior modification began similarly. 
Following my development of the behavior modification prin- 
ciples with simple problems, I decided that a necessary step 
was to extend behavior analysis to more complex behavior 
that required long-term treatment. At UCLA (where I took my 
doctoral degree in general experimental and completed clini- 
cal psychology requirements) I had worked with dyslexic 
children. Believing that reading is crucially important to 
human adjustment in our society, I selected this as a focal 
topic of study—both remedial training as well as the original 
learning of reading. My first study—done with Judson Finley, 
Karl Minke, Richard Schutz, and Carolyn Staats—was ex- 
ploratory and was used in a research grant application I made 
to the U.S. Office of Education. The study was based on my 
view that the central problem in dyslexia is motivational. 
Children fail in learning because their attention and participa- 
tion are not maintained in the long, effortful, and nonreinforc- 
ing (for many children) learning task that involves thousands 
and thousands of learning trials. In my approach the child was 
reinforced for attending and participating, and the training 
materials I constructed ensured that the child would learn 
everything needed for good performance. Because reading 
training is so extended and involves so many learning trials, it 
is necessary to have a reinforcing system for the long haul, 
unlike the experimental analysis of behavior studies with 
children employing simple responses and M&Ms. I thus 
introduced the token reinforcer system consisting of poker 
chips backed up by items the children selected to work for 
(such as toys, sporting equipment, and clothing). When this 
token reinforcer system was adopted for work with adults, it 
was called the token economy (see Ayllon & Azrin, 1968) and, 
again, considered part of Skinner’s radical behaviorism. 

With the training materials and the token reinforcement, 
the adolescents who had been poor students became attentive, 
worked well, and learned well. Thus was the token methodol- 
ogy born, a methodology that was to be generally applied. 
In 1962 and 1964 studies we showed the same effect with 
preschool children first learning to read. Under reinforcement 
their attention and participation and their learning of reading 
was very good, much better than that displayed by the usual 


four-year-old. But without the extrinsic reinforcement, their 
learning behavior deteriorated, and learning stopped. In 
reporting this and the treatment of dyslexia (Staats, 1963; 
Staats & Butterfield, 1965; Staats, Finley, Minke, & Wolf, 
1964; Staats & Staats, 1962), I projected a program for using 
these child behavior modification methods in studying a wide 
variety of children’s (and adults’) problems. The later devel- 
opment of the field of behavior modification showed that this 
program functioned as a blueprint for the field that later devel- 
oped. (The Sylvan Learning Centers also use methods similar 
to those of PB’s reading treatments, with similar results.) 

Let me add that I took the same approach in raising my own 
children, selecting important areas to analyze for the applica- 
tion of learning-behavior principles to improve and advance 
their development as well as to study the complex learning in- 
volved. For example, in 1960 I began working with language 
development (productive and receptive) when my daughter 
was only several months old, with number concepts at the age 
of a year and a half, with reading at 2 years of age. I have 
audiotapes of this training with my daughter, which began 
in 1962 and extended for more than 5 years, and videotapes 
with my son and other children made in 1966. Other aspects 
of child development dealt with as learned behaviors include 
toilet training, counting, number operations, writing, walking, 
swimming, and throwing and catching a ball (see Staats, 
1996). With some systematic training the children did such 
things as walk and talk at 9 months old; read letters, words, 
sentences, and short stories at 2.5 years of age; and count 
unarranged objects at 2 years (a performance Piaget suggested 
was standard at the age of 6 years). The principles were also 
applied to the question of punishment, and I devised time-out 
as a mild but effective punishment, first used in the literature 
by one of my students, Montrose Wolf (Wolf, Risely, & Mees, 
1964). 

Traditional behaviorism was our background. However, 
the research developed in Great Britain and by Wolpe and by 
me and a few others constituted the foundation for the field of 
behavior therapy. And this field now contains a huge number 
of studies demonstrating that conditioning principles apply to 
a variety of human behavior problems, in children and adults, 
with simple and complex behavior. There can be no question 
in the face of our behavior therapy evidence that learning is a 
centrally important determinant of human behavior. 


The State of Personality Theory and Measurement 
in the Field of Behavior Therapy 


Behaviorism began as a revolution against traditional psy- 
chology. The traditional behaviorist aim in analyzing psy- 
chology’s studied phenomena was to show behaviorism’s 


superiority and that psychology’s approach should be aban- 
doned. In radical behaviorism no recognition is given still 
that work in traditional psychology has any value or that it 
can be useful in a unification with behaviorism. This charac- 
teristic is illustrated by the Association of Behavior Analy- 
sis’s movement in the 1980s to separate the field from the rest 
of psychology. It took a PB publication to turn this tide, but 
the isolationism continues to operate informally. Radical be- 
haviorism students are not trained in psychology, or even in 
the general field of behaviorism itself. While many things 
from the “outside” have been adopted by radical behavior- 
ism, some quite inconsistent with Skinner’s views, they are 
accepted only when presented as indigenous developments. 
Radical behaviorism students are taught that all of their fun- 
damental knowledge arose within the radical behaviorism 
program, that the program is fully self-sufficient. 

Psychological behaviorism, in conflict with radical behav- 
iorism, takes the different view: that traditional psychology 
has systematically worked in many areas of human behavior 
and produced valuable findings that should not be dismissed 
sight unseen on the basis of simplistic behaviorist method- 
ological positions from the past. Psychology’s knowledge 
may not be complete. It may contain elements that need to be 
eliminated. And it may need, but not include, the learning- 
behavior perspective and substance. But the PB view has been 
that behaviorism has the task of using traditional psychology 
knowledge, improving it, and behaviorizing it. In that process, 
behaviorism becomes psychologized itself, hence the name of 
the present approach. PB has aimed to discard the idiosyn- 
cratic, delimiting positions of the radical behaviorism tradi- 
tion and to introduce a new, unified tradition with the means to 
effect the new developments needed to create unification. 

An example can be given here of the delimiting effect of 
radical behaviorism with respect to psychological measure- 
ment. Skinner insisted that the study of human behavior was 
to rest on his experimental analysis of behavior (operant con- 
ditioning) methodology. Among other things he rejected self- 
report data (1969, pp. 77-78). Following this lead, a general 
position in favor of direct observation of specific behavior, 
not signs of behavior, was proposed by Mischel, as well as 
Kanfer, and Phillips, and this became a feature of the field of 
behavioral assessment. The view became that psychological 
tests should be abandoned in favor of Skinner’s experimental 
analysis of behavior methodology, an orientation that could 
not yield a program for unification of the work of the fields of 
personality and psychological measurement with behavior 
therapy, behavior analysis, and behavioral assessment. 

It may be added that PB, by contributing foundations to 
behavior therapy, had the anomalous effect of creating enthu- 
siasm for a radical behaviorism that PB in good part rejects. 
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For example, PB introduced the first general behavioral 
theory of abnormal behavior and a program for treatment 
applications (see Staats, 1963, chaps. 10 & 11), as well as a 
foundation for the field of behavioral assessment: 


Perhaps [this] rationale for learning [behavioral] psychotherapy 
will also have to include some method for the assessment of 
behavior. In order to discover the behavioral deficiencies, the re- 
quired changes in the reinforcing system [the individual’s emo- 
tional-motivational characteristics], the circumstances in which 
stimulus control is absent, and so on, evaluational techniques in 
these respects may have to be devised. Certainly, no two individ- 
uals will be alike in these various characteristics, and it may be 
necessary to determine such facts for the individual prior to be- 
ginning the learning program of treatment. 

Such assessment might take a form similar to some of the 
psychological tests already in use. . . . [H]owever, . . . a general 
learning rationale for behavior disorders and treatment will sug- 
gest techniques of assessment. (Staats, 1963, pp. 508-509) 


At that time there was no other broad abnormal psychology- 
behavioral treatment theory in the British behavior therapy 
school, in Wolpe’s approach, or in radical behaviorism. But 
PB’s projections, including creation of a field of behavioral 
assessment, were generally taken up by radical behaviorists. 
Thus, despite its origins within PB (as described in Silva, 
1993), the field of behavioral assessment was developed as a 
part of radical behaviorism. However, the radical behaviorism 
rejection of traditional psychological measurement doomed 
the field to failure. 

That was quite contrary to the PB plan. In the same work 
that introduced behavioral assessment, PB unified traditional 
psychological testing with behavior assessment. Behavior 
analyses of intelligence tests (Staats, 1963, pp. 407-411) and 
interest, values, and needs tests (Staats, 1963, pp. 293-306) 
were begun. The latter three types of tests were said to measure 
what stimuli are reinforcing for the individual. MacPhillamy 
and Lewinsohn (1971) later constructed an instrument to mea- 
sure reinforcers that actually put the PB analysis into practice. 
Again, despite using traditional rating techniques that Skinner 
(1969, pp. 77-78) rejected, they replaced their behavioral as- 
sessment instrument in a delimiting radical behaviorism 
framework. Thus, when presented in the radical behaviorism 
framework, this and the other behavioral assessment works 
referenced earlier were separated from the broader PB frame- 
work that included the traditional tests of intelligence, inter- 
ests, values, and needs and its program for general unification 
(Staats, 1963, pp. 304-308). 

The point here is that PB’s broad-scope unification orien- 
tation has made it a different kind of behaviorism in various 
fundamental ways, including that of making it a behaviorism 
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with a personality. The PB theory of personality is the only 
one that has been constructed on the foundation of a set of 
learning-behavior principles (Staats, 1996). Advancing in 
successive works, with different features than other personal- 
ity theories, only in its later version has the theory of person- 
ality begun to arouse interest in the general field of behavior 
therapy. It appears that some behavior therapists are begin- 
ning to realize that behaviorists “have traditionally regarded 
personality, as a concept, of little use in describing and pre- 
dicting behavior’ (Hamburg, 2000, p. 62) and that this is a 
liability. Making that realization general, along with under- 
standing how this weakens the field, is basic in effecting 
progress. 

As it stands, behavior therapy’s rejection of the concept of 
personality underlies the field’s inability to join forces with 
the field of psychological measurement. This is anomalous 
because behavior therapists use psychological tests even 
while rejecting them conceptually. It is anomalous also be- 
cause Kenneth Spence (1944), while not providing a concep- 
tual framework for bringing behaviorism and psychological 
testing together, did provide a behavioral rationale for the 
utility of tests. He said that tests produce R-R (response- 
response) laws—in which a test score (one response) is used 
to predict some later performance (the later response). It 
needs to be added that tests can yield knowledge of behavior 
in addition to prediction as we will see. 

This, then, is the state of affairs at present. Not one of the 
other behavioral approaches—radical behaviorism, Hullian 
theory, social learning theory, cognitive-behavioral theory— 
has produced or projected a program for the study of per- 
sonality and its measurement. That is a central reason why 
traditional psychology is alienated from behaviorism and 
behavioral approaches. And that separation has seriously dis- 
advantaged both behaviorism and traditional psychology. 


THE STATE OF THEORY IN THE FIELD 
OF PERSONALITY 


Thus far a critical look has been directed at the behaviorism 
positions with respect to the personality and psychological 
testing fields. This is not to say that those two fields are fulfill- 
ing their potential or are open to unification with any behav- 
ioral approach. Just as behaviorism has rejected personality 
and psychological measurement, so have the latter rejected 
behaviorism. Part of this occurs because traditional behavior- 
ism does not develop some mutuality of interest, view, or 
product. But the fields of psychological testing and personal- 
ity have had a tradition that considers genetic heredity as the 
real explanation of individual differences. Despite lip service 


to the contrary, these fields have never dealt with learning. So 
there is an ingrained mutual rejection. Furthermore, the lack 
of a learning approach has greatly weakened personality 
theory and measurement, substantively as well as method- 
ologically, as I will suggest. 

To continue, examination of the field of personology reveals 
it to be, at least within the present philosophy-methodology, 
a curiosity of science. For this is a field without guidelines, 
with no agreement on what its subject matter—personality—is 
and no concern about that lack of stipulation. It is accepted that 
there will be many definitions in the operating field. The only 
consensus, albeit implicit, is that personality is some process or 
structure within the individual that is a cause of the individual’s 
behavior. Concepts of personality range from the id, ego, and 
superego of Sigmund Freud, through the personal constructs of 
George Kelly and Carl Rogers’s life force that leads to the 
maintenance and enhancement of self, to Raymond B. Cattell’s 
source traits of sociability, intelligence, and ego strength, to 
mention a very few. 

Moreover, there is no attempt to calibrate one concept of 
personality with respect to another. In textbooks each person- 
ality theory is described separately without relating concepts 
and principles toward creating some meaningful relation- 
ships. There are no criteria for evaluating the worth of the 
products of the field, for comparing them, for advancing the 
field as a part of science. Each author of a theory of personal- 
ity is free to pursue her or his own goals, which can range from 
using factor analytic methods by which to establish relation- 
ships between test items and questionnaires to running pi- 
geons on different schedules of reinforcement. There will be 
little criticism or evaluation of empirical methods or strate- 
gies. All is pretty much accepted as is. There will be no critical 
consideration of the kind of data that are employed and evalu- 
ation of what the type of data mean about the nature of the 
theory. Other than psychometric criteria of reliability and va- 
lidity, there will be no standards of success concerning a test’s 
provision of understanding of the trait involved, what causes 
the trait, or how it can be changed. Also, the success of a per- 
sonality theory will not be assessed by the extent to which it 
provides a foundation for constructing tests of personality, 
therapies, or procedures for parents to employ. It is also not 
necessary that a personality theory be linked to other fields 
of study. 

Moreover, a theory in this field does not have the same 
types of characteristics or functions as do theories in the 
physical sciences. Those who consider themselves personal- 
ity theorists are so named either because they have created 
one of the many personality theories or because they have 
studied and know about one or more of the various existent 
theories. They are not theorists in the sense that they work on 


the various personality theories in order to improve the the- 
ory level of the field. They are not theorists in the sense that 
they study their field and pick out its weaknesses and errors 
in order to advance the field. They do not analyze the con- 
cepts and principles in different theories in order to bring 
order into the chaos of unrelated knowledge. They do not, for 
example, work on the large task of weaving the theories to- 
gether into one or more larger, more advanced, and more gen- 
eral and unified theories that can then be tested empirically 
and advanced. 

An indication of the mixed-up character of the field of per- 
sonality theory is the inclusion of Skinner’s experimental 
analysis of behavior as a personality theory in some textbooks 
on personality theory. This is anomalous because Skinner has 
rejected the concept of personality, has never treated the phe- 
nomena of personality, has had no program for doing so, and 
his program guides those who are radical behaviorists to ig- 
nore the fields of personality and its measurement. His find- 
ings concerning schedules of reinforcement are not used by 
personologists, nor are his students’ findings using the experi- 
mental analysis of behavior with human subjects nor his phi- 
losophy-methodology of science. His approach appears to be 
quite irrelevant for the field. What does it say about the field’s 
understanding of theory that the irrelevance of his theory does 
not matter? From the standpoint of the philosophy and 
methodology of PB, the field of personality is in a very primi- 
tive state as a science. 

To some extent the following sections put the cart before 
the horse because I discuss some theory needs of the field of 
personality before I describe the approach that projects those 
needs. That approach involves two aspects: a particular the- 
ory and a philosophy-methodology. The latter is the basis for 
the projections made in this section. This topic needs to be 
developed into a full-length treatment rather than the present 
abbreviation. 


The Need for Theorists Who Work the Field 


One of the things that reveals that the field of personality the- 
ory is not really part of a fully developed science is the lack of 
systematic treatment of the theories in the field. Many study 
the theories of the field and their empirical products. But that 
study treats the field as composed of different and indepen- 
dent bodies of knowledge to be learned. There is not even the 
level of integration of study that one would find in humanities, 
such as English literature and history, where there is much 
comparative evaluating of the characteristics of different 
authors’ works. 

If the field of personality theory is to become a real scien- 
tific study, we need theorists who work the field. Theories have 
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certain characteristics. They contain concepts and principles, 
and the theories deal with or derive from certain empirical data. 
And those concepts, principles, and data vary in types and in 
functions. With those differences, theories differ in method and 
content and therefore in what they can do and thus how they fit 
together or not. We need theorists who study such things and 
provide knowledge concerning the makeup of our field. What 
can we know about the field without such analysis? 

We need theorists who work the field in other ways also. 
For example, two scientific fields could be at the same level in 
terms of scientific methods and products. One field, however, 
could be broken up by having many different theorists, each of 
whom addresses limited phenomena and does so in idiosyn- 
cratic theory language, with no rules relating the many theo- 
ries. This has resulted in competing theories, much overlap 
among theories and the phenomena they address, and much 
redundancy in concepts and principles mixed in with real dif- 
ferences. This yields an unorganized, divided body of knowl- 
edge. Accepting this state provides no impetus for cooperative 
work or for attaining generality and consensus. 

The other hypothetical field has phenomena of equal com- 
plexity and difficulty, and it also began with the same unorga- 
nized growth of theory. But the field devoted part of its time 
and effort in working those theories, that is, in assessing what 
phenomena the various theories addressed, what their meth- 
ods of study were, what types of principles and concepts were 
involved, and where there was redundancy and overlap, as 
well as in comparing, relating, and unifying the different 
theory-separated islands of knowledge. The terms for the con- 
cepts and principles were standardized, and idiosyncrasy was 
removed. The result was a simpler, coherent body of knowl- 
edge that was also more general. That allowed people who 
worked in the field to speak the same language and to do re- 
search and theory developments in that language in a way that 
everyone could understand. In turn, researchers could build on 
one another’s work. That simplifying consensus also enabled 
applied people to use the knowledge better. 

It can be seen that although these two sciences are at the 
same level with respect to much of their product, they are 
quite different with respect to their theory advancement and 
operation. The differences in the advancement of knowledge 
in science areas along these lines have not been systematically 
considered in the philosophy of science. There has not been an 
understanding that the disunified sciences (e.g., psychology) 
operate differently than do the unified sciences (e.g., physics) 
that are employed as the models in the philosophy of science. 
Thus, there has been no guide for theorists to work the fields 
of personality theory and psychological testing to produce the 
more advanced type of knowledge. So this remains a crying 
need. 
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We Need Theory Constructed in Certain Ways 
and With Certain Qualities and Data 


We need theorists to work the field of personality. And they 
need to address certain tasks, as exemplified earlier and later. 
This is only a sample; other characteristics of theory also 
need to be considered in this large task. 


Commonalities Among Theories 


In the field of personality theory there is much commonality, 
overlap, and redundancy among theories. This goes unrecog- 
nized, however, because theorists are free to concoct their own 
idiosyncratic theory language. The same or related phenom- 
ena can be given different names—such as ego, self, self- 
concept, and self-efficacy—and left alone as different. Just in 
terms of parsimony (an important goal of science), each case 
of multiple concepts and unrecognized full or partial redun- 
dancy means that the science is unnecessarily complex and 
difficult, making it more difficult to learn and use. Unrecog- 
nized commonality also artificially divides up the science, 
separating efforts that are really relevant. Personality theo- 
rists, who are in a disunified science in which novelty is the 
only recognized value, make their works as different as possi- 
ble from those of others. The result is a divided field, lacking 
methods of unification. 

We need theorists who work to remove unnecessary the- 
ory elements from our body of knowledge, to work for sim- 
plicity and standardization in theory language. We need to 
develop concepts and principles that everyone recognizes in 
order to build consistency and consensus. It is essential also 
for profundity; when basic terms no longer need to be argued, 
work can progress to deeper levels. 


Data of Theories and Type of Knowledge Yielded 


A fundamental characteristic of the various theories in per- 
sonality is that despite overlap they address different sets of 
phenomena and their methods of data collection are differ- 
ent. For example, Freud’s theory was drawn to a large extent 
from personal experience and from the stated experiences of 
his patients. Carl Rogers’s data was also drawn from per- 
sonal experience and clinical practice. Gordon Allport em- 
ployed the lexical approach, which involved selecting all 
the words from a dictionary that descriptively labeled differ- 
ent types of human behavior. The list of descriptive words 
was whittled down by using certain criteria and then was or- 
ganized into categories, taken to describe traits of personal- 
ity. This methodology rests on large numbers of people, with 
lay knowledge, having discriminated and labeled different 


characteristic behaviors of humans. Raymond Cattell used 
three sources of data. One consisted of life records, as in 
school or work. Another source was self-report in an inter- 
view. And a third could come from objective tests on which 
the individual’s responses could be compared to the re- 
sponses of others. These data could be subjected to factor 
analytic methods to yield groupings of items to measure per- 
sonality traits. 

What is not considered systematically to inform us about 
the field is that the different types of data used in theories 
give those theories different characteristics and qualities. To 
illustrate, a theory built only on the evanescent and imprecise 
data of personal and psychotherapy experience—limited by 
the observer’s own concepts and flavored by them—is un- 
likely to involve precisely stated principles and concepts and 
findings. Moreover, any attempt by the client to explain her 
behavior on the basis of her life experience is limited by 
her own knowledge of behavior and learning and perhaps by 
the therapist’s interpretations. The naturalistic data of self- 
description, however, can address complex events (e.g., 
childhood experiences) not considered in the same way in an 
experimental setup. Test-item data, as another type, can stip- 
ulate behaviors while not including a therapist’s interpreta- 
tions. However, such items concern how individuals are, not 
how they got that way (as through learning). 

Let us take as an example an intelligence test. It can predict 
children’s performance in school. The test was constructed to 
do this. But test data do not tell us how “intelligence” comes 
about or what to do to increase the child’s intelligence. For 
in constructing the test there has been no study of the causes 
of intelligence or of how to manipulate those causes to change 
intelligence. The theory of intelligence, then, is limited by 
the data used. Generally, because of the data on which they 
rest, tests provide predictive variables but not explanatory, 
causal variables. Not understanding this leads to various 
errors. 

The data employed in some theories can be of a causal 
nature, but not in other theories. Although data on animal 
conditioning may lack other qualities, it does deal with cause- 
effect principles. Another important aspect of data used in- 
volves breadth. How many different types of data does a 
theory draw on or stimulate? From how many different fields 
of psychology does the theory draw its data? We should as- 
sess and compare theories on the types of data on which they 
are based. Through an analysis of types of data we will have 
deeper knowledge of our theories, how they differ, how they 
are complementary, the extent to which they can be devel- 
oped to be explanatory as well as predictive, and also how 
they can or cannot be combined in organizing and unifying 
our knowledge. 


Precision of Theories 


There are also formal differences in theories in terms of other 
science criteria, for example, in the extent of precision of state- 
ment. A known example of imprecision was that of Freud’s re- 
action formation. If the person did not do as predicted, then the 
reaction formation still allowed the theory always to be 
“right.” Another type of difference lies in the precision or 
vagueness of definition of concepts. Hull aimed to define his 
habit strength concept with great precision. Rogers’s concept 
of the life force does not have such a precise definition. Sci- 
ence is ordinarily known for its interest in considering and as- 
sessing its theory tools with respect to such characteristics. 
The field of personality needs to consider its theories in this 
respect. 


Unifying and Generality Properties of Theories 


Hans Eysenck showed an interest in applications of condition- 
ing principles to problems of human behavior. He also worked 
on the measurement of personality, in traits such as intelli- 
gence and extroversion-introversion. Moreover, he also had 
interest in variations in psychic ability as shown in exper- 
iments in psychokinisis. (During a six-month stay at the 
Maudsley Hospital in 1961, the author conveyed the spread of 
our American behavioral applications and also argued about 
psychic phenomena, taking the position that selecting subjects 
with high “psychic” ability abrogated the assumptions for the 
statistics employed.) Theorists vary in the number of different 
research areas to which they address themselves. And that 
constitutes an important dimension; other things equal, more 
general theories are more valuable than narrow theories. 
Another property of a theory is that of unifying power. The 
example of Eysenck can be used again. That is, although he 
was interested in behavior therapy, personality measurement, 
and experimental psychic ability, he did not construct a theory 
within which these phenomenal areas were unified within a 
tightly reasoned set of interrelated principles. Both the general- 
ity and the unifying power of theories are very important. 
Freud’s psychological theory was more general than 
Rogers’s. For example, it pertains to child development, ab- 
normal psychology, and clinical psychology and has been 
used widely in those and other fields. And Freud’s theory— 
much more than other theories that arise in psychotherapy— 
also was high in the goal of unification. John Watson began 
behaviorism as a general approach to psychology. The behav- 
ioral theories of personality (such as that of Rotter, and to some 
extent the other social learning theories) exhibit some general- 
ity and unification. The present theory, PB, has the most gen- 
erality and unification aims of all. None of the personality 
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theories, with the exception of the present one, moreover, has 
a systematic program for advancing further in generality and 
unification. 

In general, there are no demands in the field of personality 
to be systematic with respect to generality or unification, and 
there are no attempts to evaluate theories for success in 
attaining those goals. Again, that is different from the other 
more advanced, unified sciences. That is unfortunate, for the 
more a theory of personality has meaning for the different 
areas of psychology, employs products of those fields, and has 
implications for those fields, the more valuable that theory 
can be. 

This view of the field of personality and its personality the- 
ories is a byproduct of the construction of the theory that will 
be considered in the remaining sections. The perspective sug- 
gests that the field of personality will continue to stagnate until 
it begins to work its contents along the lines proposed. 


PERSONALITY: THE PSYCHOLOGICAL 
BEHAVIORISM THEORY 


More than 45 years ago, while still a graduate student at 
UCLA, I began a research program that for some years I did 
not name, then called social behaviorism, later paradigmatic 
behaviorism, and finally PB. I saw great importance in the 
behaviorism tradition as a science, in fundamental learning 
principles, and in experimentation. But I saw also that the 
preceding behaviorisms were incompletely developed, ani- 
mal oriented, and too restricted to laboratory research. They 
also contained fundamental errors and had no plan by which 
to connect to traditional psychology, to contribute to it, and to 
use its products. Very early in the research program I began to 
realize that animal conditioning principles are not sufficient 
to account for human behavior and personality. In my opinion 
a new behavioral theory was needed, it had to focus on 
human behavior systematically and broadly, it had to link 
with traditional psychology’s treatments of many phenomena 
of human behavior, and it had to include a new philosophy 
and methodology. 


Basic Developments 


The early years of this program consisted of studies to extend, 
generally and systematically, conditioning principles to sam- 
ples of human behavior. This was a new program in behavior- 
ism. Some of the studies were informal, some were formal 
publications, and many involved theoretical analyses of 
behaviors—experimental, clinical, and naturalistic—that had 
been described in the psychology literature. One of the goals 
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was to advance progressively on the dimension of simple- 
complex with respect to behavior. The low end of the dimen- 
sion involved establishment of basic principles, already 
begun with the animal conditioning principles. But those prin- 
ciples had to be verified with humans, first with simple behav- 
iors and laboratory control. Then more and more complex 
behaviors had to be confronted, with the samples of behavior 
treated becoming more representative of life behaviors. The 
beginning of this latter work showed convincingly the rele- 
vance of learning-behavior principles for understanding 
human behavior and progressively indicated that new human 
learning principles were needed to deal with complex human 
behavior. Several areas of PB research are described here as 
historical background and, especially, to indicate how the the- 
ory of personality arose in an extended research-conceptual 
development. 


Language-Cognitive Studies 


My dissertation studied how subjects’ verbal responses to 
problem-solving objects were related to the speed with which 
they solved the problem. It appeared that people learn many 
word labels to the objects and events of life. When a situation 
arises that involves those objects and events, the verbal re- 
sponses to them that individuals have learned will affect their 
behavior. The research supported that analysis. 

There are various kinds of labeling responses. A child’s 
naming the letters of the alphabet involves a labeling reper- 
toire. Studies have shown that children straightforwardly 
learn such a repertoire, as they do in reading numbers and 
words. The verbal-labeling repertoire is composed of various 
types of spoken words controlled by stimulus events. The 
child learns to say “car” to cars as stimulus events, to say 
“red” to the stimulus of red light, to say “running” to the visual 
stimulus of rapidly alternating legs that produce rapid move- 
ment, and to say “merrily” to people happily reveling. More- 
over, the child learns these verbal labeling responses—like 
the nouns, adjectives, verbs, and adverbs just exemplified—in 
large quantities, so the verbal-labeling repertoire becomes 
huge. This repertoire enables the person to describe the many 
things experienced in life, but it has other functions as well. 
As discussed later, this and the other language repertoires are 
important components of intelligence. 

As another aspect of language, the child also learns to make 
different motor responses to a large number of words. The 
young child learns to look when hearing the word “look,” to 
approach when hearing the word “come,” to sit when told the 
word “sit,” and to make a touching response when told to 
“touch” something. The child will learn to respond to many 
words with motor responses, constituting the verbal-motor 


repertoire. This repertoire enables the person to follow direc- 
tions. It is constituted not only of a large number of verbs, but 
also of adverbs, nouns, adjectives, and other grammatical ele- 
ments. For example, most people could respond appropriately 
to the request to “Go quickly, please, to the top-left drawer of 
my dresser and bring me the car keys” because they have 
learned motor responses to the relevant words involved. 
Important human skills involve special developments of the 
verbal-motor subrepertoire. As examples, ballet dancers, vio- 
linists, NFL quarterbacks, mechanics, and surgeons have spe- 
cial verbal-motor repertoires that are essential parts of their 
special skills. 

Another important part of language is the verbal- 
association repertoire. When the word salt is presented as a 
stimulus in a word-association task, a common response is 
pepper or water. However, an occasional person might re- 
spond by saying wound or of the earth or something else that 
is less usual. Years ago it was believed that differences in as- 
sociations had personality implications, and word-association 
tests were given with diagnostic intent. Analysis of word asso- 
ciations as one of the subrepertoires of the language-cognitive 
repertoire suggests more definitively and specifically that this 
constitutes a part of personality. Consider a study by Judson, 
Cofer, and Gelfand (1956). One group of subjects learned a 
list of words that included the sequence rope, swing, and pen- 
dulum. The other group learned the same list of words, but the 
three words were not learned in sequence. Both groups then 
had to solve a problem by constructing a pendulum from a 
light rope and swinging it. The first group solved the problem 
more quickly than did the second. Thus, in the present view 
the reasoning ability of the two groups depended on the word 
associations they had learned. 

Word associates are central to our grammatical speech, the 
logic of our speech and thought, our arithmetic and mathemat- 
ical knowledge, our special area and general knowledge, 
our reasoning ability, our humor, our conversational ability, 
and our intelligence. Moreover, there are great individual 
differences in the verbal-association repertoire such that it 
contributes to differences on psychological tests. Additional 
repertoires are described in the PB theory of language- 
cognition (see Staats, 1968, 1971, 1975, 1996). 


Emotional-Motivational Studies 


An early research interest of PB concerned the emotional 
property of words. Using my language conditioning method I 
showed subjects a visually presented neutral word (nonsense 
syllable) paired once each with different auditorily presented 
words, each of which elicited an emotional response, with one 
group positive emotion and with another group negative in a 


classical conditioning procedure. The results of a series of ex- 
periments have showed that a stimulus paired with positive or 
negative emotional words acquires positive or negative emo- 
tional properties. Social attitudes, as one example, are 
emotional responses to people that can be manipulated by lan- 
guage conditioning (Staats & Staats, 1958). To illustrate, in a 
political campaign the attempt is made to pair one’s candidate 
with positive emotional words and one’s opponent with nega- 
tive emotional words. That is why the candidate with greater 
financial backing can condition the audience more widely, 
giving great advantage. 

Skinner’s theory is that emotion (and classical condition- 
ing) and behavior (and operant conditioning) are quite sepa- 
rate, and it is the operant behavior that is what he considers 
important. In contrast, PB’s basic learning-behavior theory 
states that the two types of conditioning are intimately related 
and that both are important to behavior. For one thing, a 
stimulus is reinforcing because it elicits an emotional 
response. Thus, as a stimulus comes to elicit an emotional re- 
sponse through classical conditioning, it gains potential as a 
reinforcing stimulus. My students and I have shown that 
words eliciting a positive or negative emotional response will 
function as a positive or negative reinforcer. In addition, the 
PB learning-behavior theory has shown that a stimulus that 
elicits a positive or negative emotional response will also 
function as a positive or negative incentive and elicit approach 
or avoidance behavior. That is a reason why emotional words 
(language) guide people’s behavior so ubiquitously. An im- 
portant concept from this work is that humans learn a very 
large repertoire of emotion-eliciting words, the verbal- 
emotional repertoire. Individual differences in this repertoire 
widely affect individual differences in behavior (see Staats, 
1996). 

One other principle should be added for positive emo- 
tional stimuli: They are subject to motivational (deprivation- 
satiation) variations. For example, food is a stimulus that 
elicits a positive emotional response on a biological basis; 
however, the size of the response varies according to the 
extent of food deprivation. That also holds for the reinforce- 
ment and incentive effect of food stimuli on operant behavior. 
These three effects occur with stimuli that elicit an emotional 
response through biology (as with food) or through learning, 
as with a food word. 

The human being has an absolutely gargantuan capacity 
for learning. And the human being has a hugely complex 
learning experience. The result is that in addition to biologi- 
cally determined emotional stimuli, the human learns a gigan- 
tic repertoire that consists of stimuli that elicit an emotional 
response, whether positive or negative. There are many vari- 
eties of stimuli—art, music, cinema, sports, recreations, 
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religious, political, manners, dress, and jewelry stimuli—that 
are operative for humans. They elicit emotion on a learned 
basis. As a consequence, they can also serve as motivational 
stimuli and act as reinforcers and incentives. That leads to a 
conclusion that individual differences in the quantity and type 
of emotional stimuli will have great significance for personal- 
ity and human behavior. 


Sensorimotor Studies 


Following its human-centered learning approach, PB studied 
sensorimotor repertoires in children. To illustrate, consider the 
sensorimotor response of speech. Traditional developmental 
norms state that a child generally says her first words at the 
age of | year, but why there are great individual differences is 
not explained, other than conjecturing that this depends on 
biological maturation processes. In contrast, PB states that 
speech responses are learned according to reinforcement prin- 
ciples, but that reinforcement depends on prior classical con- 
ditioning of positive emotion to speech sounds (Staats, 1968, 
1996). I employed this theoretical analysis and learning pro- 
cedures in accelerating the language development of my own 
children, in naturalistic interactions spread over a period of 
months, but adding up to little time expenditure. Their speech 
development accelerated by three months, which is 25% of the 
usual 12-month period (Staats, 1968). I have since validated 
the learning procedures with parents of children with retarded 
speech development. Lovaas (1977) has used this PB frame- 
work. Psychological behaviorism also systematically studied 
sensorimotor skills such as standing, walking, throwing and 
catching a ball, using the toilet, writing letters, paying atten- 
tion, counting objects, and so on in systematic experimental- 
longitudinal research (see Staats, 1968, 1996). 

In this theory of child development, PB pursued its goal of 
unification with traditional psychology, in this case with the 
field of child development. The PB position is that the norms 
of traditional child developmentalists provide valuable 
knowledge. But this developmental conception errs in assum- 
ing biological determination and in ignoring learning. Prior to 
my work, the reigning view was that it was wasteful or harm- 
ful to attempt to train the child to develop behaviors early. For 
example, the 4-year-old child was said to be developmentally 
limited to an attention span of 5 min to 15 min and thus to be 
incapable of formal learning. We showed that such preschool- 
ers can attend well in the formal learning of reading skills for 
40-min periods if their work behaviors are reinforced (Staats 
et al., 1964). When not reinforced, however, they do not 
attend. My later research showed that children learn progres- 
sively to attend and work well for longer periods by having 
been reinforced for doing so. 


146 A Psychological Behaviorism Theory of Personality 


Rather than being a biologically determined cognitive abil- 
ity, attention span is actually a learned behavior. The same is 
true with the infant’s standing and walking, the development 
of both of which can be advanced by a little systematic train- 
ing. The child of 2 years also can be straightforwardly trained 
to count unarranged objects (Piaget said 6 years). Writing 
training can be introduced early and successfully, as can other 
parts of the sensorimotor repertoire. I also developed a proce- 
dure for potty training my children (see Staats, 1963) that was 
later elaborated by Azrin and Foxx (1974). Such findings have 
changed society’s view of child development. 

What emerges from this work is that the individual learns 
the sensorimotor repertoire. Without the learning provided in 
the previous cases, children do not develop the repertoires. 
Moreover, the human sensorimotor repertoire is, again, vast 
for individuals. And over the human community it is infi- 
nitely varied and variable. There are skills that are generally 
learned by all, such as walking and running. And there are 
skills that are learned by only few, such as playing a violin, 
doing surgery, or acting as an NFL quarterback. As such there 
are vast individual differences among people in what sensori- 
motor skills are learned as well as in what virtuosity. 


Additional Concepts and Principles 
Human Learning Principles 


As indicated earlier, a basic assumption of traditional behav- 
iorism is that the animal learning principles are the 
necessary and sufficient principles for explaining human be- 
havior. Psychological behaviorism’s program has led to the 
position that while the animal conditioning principles, inher- 
ited through evolution, are indeed necessary for explaining 
human behavior, they are far from sufficient. I gained an early 
indication of that with my research on the language condition- 
ing of attitudes, and later findings deepened and elaborated the 
principles. 

What the traditional behaviorists did not realize is that 
human learning also involves principles that are unique to 
humans—human learning principles. The essential, new fea- 
ture of these principles is that much of what humans learn 
takes place on the basis of what they have learned before. For 
example, much human learning can occur only if the individ- 
ual has first learned language. Take two children, one of whom 
has learned a good verbal-motor repertoire and one of whom 
has not. The first child will be able to follow directions and 
therefore will be able to learn many things the second child 
cannot because many learning tasks require the following of 
directions. The goodness of that verbal-motor repertoire dis- 
tinguishes children (as we can see on any intelligence test for 


children). In PB, language is considered a large repertoire 
with many important learning functions. Learning to count, to 
write, to read, to go potty, to form attitudes, to have logic and 
history and science knowledge and opinions and beliefs, to be 
religious, to eat healthily and exercise, and to have political 
positions are additional examples in which language is a foun- 
dation. A child of 18 months can easily learn to name numbers 
of objects and then to count if that child has previously learned 
a good language repertoire (see Staats, 1968). On the other 
hand, a child of 3 years who has not learned language will not 
be able to learn those number skills. The reason for the differ- 
ence is not some genetic difference in the goodness of learn- 
ing. Rather, the number learning of the child is built on the 
child’s previous language learning. It is not age (biology) that 
matters in the child’s learning prowess; it is what the child has 
already learned. 


Cumulative-Hierarchical Learning 


Human learning is different from basic conditioning because 
it typically involves learning that is based on repertoires that 
have been previously learned. This is called cumulative- 
hierarchical learning because of the building properties 
involved—the second learning is built on the first learning 
but, in turn, provides the foundation for a third learning. Mul- 
tiple levels of learning are typical when a fine performance is 
involved. Let us take the learning of the language repertoire. 
When the child has a language repertoire, the child can then 
learn to read. When the child has a reading repertoire, the 
child can learn more advanced number operations, after 
which the child can learn an algebra repertoire, which then is 
basic in learning additional mathematics repertoires, which 
in turn enable the learning of physics. Becoming a physicist 
ordinarily will involve in excess of 20 years of cumulative- 
hierarchical learning. 

Cumulative-hierarchical learning is involved in all the 
individual’s complex characteristics. A sociopath—with the 
complex of language-cognitive, emotional-motivational, and 
sensorimotor repertoires this entails—does not spring forth 
full-blown any more than being a physicist. Understanding 
the sociopathic personality, hence, requires understanding the 
cumulative-hierarchical learning of the multiple repertoires 
that have been involved. 


The Basic Behavioral Repertoire: A Cause as Well 
as an Effect 


And that brings us to another concept developed in PB, that 
is, the basic behavioral repertoire (BBR). The BBRs are those 
repertoires that provide the means by which later learning can 


occur, in the cumulative-hierarchical learning process. In 
providing foundations for further learning, the three major 
BBRs—the emotional-motivational, language-cognitive, and 
sensorimotor—also grow and elaborate through cumulative- 
hierarchical learning. 

The learning of the basic behavioral repertoires changes the 
individual. The BBRs thus act as independent variables that 
determine what the individual experiences, how the individual 
behaves, and what the individual learns. The cumulative- 
hierarchical learning of such repertoires is fundamental in 
child development; in fact, the PB theory is that the study of 
that learning should be the primary objective of this field, as it 
should be in the field of personality. 


The Concept of Personality 


It is significant in comparing the PB theory to other personal- 
ity theories to note differences in such things as the type of 
data involved and the specificity, precision, systematicity, 
and empirical definition of principles and concepts. It is such 
characteristics that determine the functions that a theory can 
have. Another characteristic of the PB approach concerns the 
schism between traditional psychology and traditional be- 
haviorism. Traditional psychology infers personality as a 
unique internal process or structure that determines the indi- 
vidual’s unique behavior. That makes study of personality 
(and related concepts) very central. Traditional behaviorism, 
in opposition, and according to its fundamental methodology, 
cannot accept an inferred concept as the cause of behavior. 
So, while almost every personologist considers learning to be 
important in personality, traditional behaviorism, which 
should be concerned with how learning affects personality, 
cannot even consider the topic. The schism leaves personal- 
ity theories incomplete and divides psychology. 


The PB Definition of Personality 


The PB program has led to the development of a theory of 
personality that can resolve that schism in a way that is valu- 
able to both sides. The PB definition of personality is that it is 
composed of the three basic behavioral repertoires that the 
individual has learned. That definition harmonizes with be- 
haviorism, for the PB program is to study the behaviors in 
those repertoires and how they are learned, as well as how 
they have their effects on the individual’s characteristic be- 
havior. At the same time, that definition is very compatible 
with the traditional view of personality as an internal process 
or structure that determines behavior. As such, the PB con- 
cept of personality can link with traditional work on person- 
ality, including personality tests, and can also contribute to 
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advancement of that work. How the three BBRs compose 
personality is described next. 


The Emotional-Motivational Aspects of Personality 


There are many concepts that refer to human emotions, emo- 
tional states, and emotional personality traits. As examples, it 
may be said that humans may feel the responses of joy or fear, 
may be in a depressed or euphoric state, and may be optimistic 
or pessimistic as traits. The three different emotional 
processes are not usually well defined. PB makes explicit 
definitions. First, the individual can experience specific, 
ephemeral emotional responses depending on the appearance- 
cessation of a stimulus. Second, multiple emotion-eliciting 
events can yield a series of related emotional responses that 
add together and continue over time; this constitutes an emo- 
tional state. Third, the individual can learn emotional re- 
sponses to sets of stimuli that are organized—like learning a 
positive emotional response to a wide number of religious 
stimuli. That constitutes an emotional-motivational trait (reli- 
gious values); that is, the individual will have positive emo- 
tional responses to the stimuli in the many religious situations 
encountered. And that emotional-motivational trait will affect 
the individual’s behavior in those many situations (from the 
reinforcer and incentive effects of the religious stimuli). For 
these reasons the trait has generality and continuity. There are 
psychological tests for traits such as interests, values, atti- 
tudes, and paranoid personality. There are also tests for states 
such as anxiety and depression and moods. And there are 
also tests for single emotional responses, such as phobias or 
attitudes. 

Personality theories usually consider emotion. This is 
done in idiosyncratic terminology and principles. So how one 
theory considers emotion is not related to another. Theories 
of emotion at the personality level are not connected to stud- 
ies of emotion at more basic levels. Many psychological tests 
measure emotions, but they are not related to one another. 
Psychological behaviorism provides a systematic framework 
theory of emotion that can deal with the various emotional 
phenomena, analyze many findings within the same set of 
concepts and principles, and thus serve as a unifying overar- 
ching theory. Psychological behaviorism experimentation 
has shown that interest tests deal with emotional responses to 
occupation-related stimuli, that attitude tests deal with emo- 
tional responses to groups of people, and that values tests 
deal with emotional responses to yet other stimuli, unifying 
them in the same theory. 

In the PB theory, beginning with the basic, the individual 
has emotional responses to stimuli because of biological struc- 
ture, such as a positive emotional response to food stimuli, 
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certain tactile stimuli, warm stimulation when cold, and vice 
versa, and a negative emotional response to aversive, harmful 
stimuli of various kinds. Conditioning occurs when any neu- 
tral stimulus is paired with one of those biological stimuli and 
comes to elicit the same type of emotional response. Condi- 
tioning occurs also when a neutral stimulus is paired with an 
emotion-eliciting stimulus (e.g., an emotional word) that has 
gained this property through learning. The human has a long 
life full of highly variable, complex experiences and learns an 
exceedingly complex emotional-motivational repertoire that 
is an important part of personality. People very widely have 
different emotional learning. Not everyone experiences posi- 
tive emotional responses paired with religious stimuli, foot- 
ball-related stimuli, or sex-related stimuli. And different 
conditioning experiences will produce different emotional- 
motivational repertoires. Because human experience is so 
variegated, with huge differences, everyone’s hugely complex 
emotional-motivational personality characteristics are unique 
and different. 

That means, of course, that people find different things 
reinforcing. What is a reward for one will be a punishment for 
someone else. Therefore, people placed in the same situation, 
with the same reinforcer setup, will learn different things. 
Consider a teacher who compliments two children for work- 
ing hard. For one child the compliment is a positive reinforcer, 
but for the other child it is aversive. With the same treatment 
one child will learn to work hard as a consequence, whereas 
the other will work less hard. That is also true with respect to 
incentives. If one pupil has a positive emotional response to 
academic awards and another pupil does not, then the initia- 
tion of an award for number of books read in one semester will 
elicit strong reading behavior in the one but not in the other. 
What is reinforcing for people and what has an incentive ef- 
fect for them strongly affects how they will behave. That is 
why the emotional BBR is an important personality cause of 
behavior. 


The Language-Cognitive Aspects of Personality 


Each human normally learns a huge and fantastically com- 
plex language repertoire that reflects the hugely complex 
experience each human has. There is commonality in that ex- 
perience across individuals, which is why we speak the same 
language and can communicate. But there are gigantic indi- 
vidual differences as well (although research on language 
does not deal with those). Those differences play a central 
role in the individual differences we consider in the fields of 
personality and personality measurement. 

To illustrate, let us take intelligence as an aspect of per- 
sonality. In PB theory intelligence is composed of basic 


behavioral repertoires, largely of a language-cognitive nature 
but including important sensorimotor elements also. People 
differ in intelligence not because of some biological quality, 
but because of the basic behavioral repertoires that they have 
learned. We can see what is specifically involved at the 
younger age levels, where the repertoires are relatively simple. 
Most items, for example, measure the child’s verbal-motor 
repertoire, as in following instructions. Some items specifi- 
cally test that repertoire, as do the items on the Stanford-Binet 
(Terman & Merrill, 1937, p. 77) that instruct the child to “Give 
me the kitty [from a group of small objects]” and to “Put the 
spoon in the cup.” Such items, which advance in complexity 
by age, also test the child’s verbal-labeling repertoire. The 
child can only follow instructions and be “intelligent” if he or 
she has learned the names of the things involved. 

The language-cognitive repertoires also constitute other as- 
pects of personality, for they are important on tests of language 
ability, cognitive ability, cognitive styles, readiness, learning 
aptitude, conceptual ability, verbal reasoning, scholastic apti- 
tude, and academic achievement tests. The tests, considered to 
measure different facets of personality, actually measure char- 
acteristics of the language-cognitive BBR. The self-concept 
also heavily involves the verbal-labeling repertoire, that is, the 
labels learned to the individual’s own physical and behavioral 
stimuli. People differ in the labels they learn and in the emo- 
tional responses elicited by those verbal labels. We can exem- 
plify this using an item on the MMPI (Dahlstrom & Welsh, 
1960, p. 57): “I have several times given up doing a thing be- 
cause I thought too little of my ability.” Individuals who have 
had different experience with themselves will have learned 
different labels to themselves (as complex stimuli) and will 
answer the item differently. The self-concept (composed of 
learned words) is an important aspect of personality because 
the individual reasons, plans, and decides depending on those 
words. So the learned self-concept plays the role of a cause of 
behavior. As another example, the “suspiciousness” of para- 
noid personality disorder heavily involves the learned verbal- 
labeling repertoire. This type of person labels the behaviors of 
others negatively in an atypical way. The problem is that the 
unrealistic labeling affects the person’s reasoning and behav- 
ior in ways that are not adjustive either for the individual or 
for others. 

These examples indicate that what are traditionally con- 
sidered to be parts of personality are conceived of in PB as 
parts of the learned language-cognitive BBR. 


The Sensorimotor Aspects of Personality 


Traditionally, the individual’s behavior is not considered 
as a part of personality. Behavior is unimportant for the 


personologist. Everyone has the ability to behave. It is per- 
sonality that is important, for personality determines behav- 
ior. Even when exceptional sensorimotor differences are 
clearly the focus of attention, as with superb athletes or 
virtuoso musicians, we explain the behavior with personality 
terms such as “natural athlete” or “talent” or “genius” each of 
which explains nothing. 

Psychological behaviorism, in contrast, considers sensori- 
motor repertoires to constitute learned personality traits in 
whole or part. And there are very large individual differences 
in such sensorimotor repertoires. Part of being a physically 
aggressive person, for example, involves sensorimotor behav- 
iors for being physically aggressive. Being a natural athlete, 
as another example, involves a complex set of sensorimotor 
skills (although different body types can be better suited for 
different actions). Being dependent, as another example, may 
also involve general deficits in behavior skills. Moreover, 
sensorimotor repertoires impact on the other two personality 
repertoires. For example, a person recognized for sensorimo- 
tor excellence in an important field will display language- 
cognitive and emotional-motivational characteristics of 
“confidence” that have been gained from that recognition. 

A good example of how sensorimotor repertoires are 
part of personality occurred in a study by Staats and Burns 
(1981). The Mazes and Geometric Design tests of the 
Wechsler Preschool and Primary Scale of Intelligence 
(WPPSI) (Wechsler, 1967) were analyzed into sensorimotor 
repertoire elements. That analysis showed that children learn 
that repertoire—of complex visual discrimination and other 
sensorimotor skills—when exposed to learning to write the let- 
ters of the alphabet. The expectation, thus, is that children 
trained to write letters will thereby acquire the repertoire by 
which to be “intelligent” on the Mazes and Geometric Design 
tests, as confirmed in our study. As other examples, on the 
Stanford-Binet (Terman & Merrill, 1937) the child has to build 
a block tower, complete a line drawing of a man, discriminate 
forms, tie a knot, trace a maze, fold and cut a paper a certain 
way, string beads a certain way, and so on. These all require 
that the child have the necessary sensorimotor basic behavioral 
repertoire. This repertoire is also measured on developmental 
tests. This commonality shows that tests considered measures 
of different aspects of personality actually measure the same 
BBR. Such an integrative analysis would be central in concep- 
tualizing the field and the field needs many such analyses. 

The sensorimotor repertoire also determines the individ- 
ual’s experiences in ways that produce various aspects of per- 
sonality. For example, the male who acquires the skills of a 
ballet dancer, painter, carpenter, center in the NBA, symphony 
violinist, auto mechanic, hair dresser, professional boxer, 
architect, or opera singer will in the learning and practice of 
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those skills have experiences that will have a marked affect on 
his other personality repertoires. Much emotional-motiva- 
tional and language-cognitive learning will take place, and 
each occupational grouping will as a result have certain com- 
mon characteristics. 

As final examples, being physically aggressive is generally 
seen as an aspect of personality, a part of some inner psycho- 
logical process. However, a person cannot be physically ag- 
gressive without the sensorimotor skills for being so. It is true 
that more is involved than just those skills. But those sensori- 
motor skills are an important part. Likewise, part of aperson’s 
being caring and nurturing resides in the sensorimotor skills 
for being so. A person cannot be a “natural” athlete without 
having learned the repertoire of sensorimotor skills that en- 
ables him or her to learn new sports easily, rapidly, gracefully, 
and very well. One cannot be a mechanical, athletic, artistic, 
or surgical genius, or amusical or dance virtuoso, without the 
requisite sensorimotor repertoire. Are sensorimotor differ- 
ences part of personality? And are those differences learned? 
The PB theory answer to both questions is yes. 

The PB analyses that show tests measure BBRs provide a 
whole new way of viewing psychological tests, with a large 
new agenda for research, as will be indicated. 


Definition of the Personality Trait 


The personality trait is thus a particular feature of one or more 
of the three basic behavioral repertoires. Traits involve com- 
plex repertoires. For example, liking a religious song involves 
an isolated emotion. But if the person also has a positive emo- 
tional response to many religious stimuli—to the stated be- 
liefs, history, rituals, holidays, personages, and tenets of 
religion, generally and particularly—this constitutes a person- 
ality trait, an important part of the emotional-motivational 
BBR (as well as of language-cognitive and sensorimotor 
repertoires). That emotional-motivational repertoire will have 
general effects on the individual’s behavior, life experiences, 
and further learning, both for normal and abnormal traits. 

In PB the personality trait, as a complex repertoire of re- 
sponses, is considered a universe from which the various sit- 
uations of life sample. To illustrate, the individual’s language 
repertoire includes many different behaviors. A question like 
“How much are two and two?” is a life situation that samples 
the language-cognitive repertoire in eliciting the one response 
“Four.” Many items on intelligence tests sample individuals’ 
language repertoires. That sample is representatives of how 
rich that particular universe is. The entire universe is the total 
BBR, that is, the personality repertoire. 

Personality traits are constituted of particular repertoires 
that produce types of experience, learning, and behavior. For 
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example, a person with a trait of religiosity will display coin- 
cident knowledge (language) of religious material, will expe- 
rience religious situations with positive emotion and be 
motivated by such situations, as well as exhibit the special- 
ized ritualistic behaviors of the religion. 


The Principles of the Personality Theory 


Figure 6.1 schematizes and makes more explicit the concepts 
and principles of the PB theory of personality. Personality is 
composed of the individual’s basic behavioral repertoires. As 
a consequence of previous learning, depicted as S,, the indi- 
vidual learns BBRs. At a later time the individual is confronted 
with an environmental situation, S,, which elicits (samples) 
elements from the individual’s BBRs. Those elements make 
up the individual’s behavior (B) in that situation. Personality 
does not equate with the individual’s behavior. For example, 
many individuals learn words that are never uttered. So the in- 
dividual’s language-cognitive BBR can never be ascertained 
from observing behavior; the individual’s potential for behav- 
ior is greater than that which is exhibited. 

Traditional behaviorism never established how biology 
works its effects in the explanation of behavior. In contrast, in 
PB’s personality theory the individual’s biological character 
plays an important role at different times. First, the learning 
of the basic behavioral repertoires takes place by virtue of the 
brain and peripheral nervous system, muscles, tendons, emo- 
tional response organs, and such. The organic state at the time 
of learning is thus an important independent variable. This 
includes permanent biological conditions such as brain dam- 
age as well as ephemeral biological conditions such as those 
of deprivation-satiation, illness, and drug and alcohol effects. 
These biological conditions that are influential at the time of 
learning the BBRs are designated as O}. 

In addition, however, at the time the individual experi- 
ences a later situation certain biological conditions, O,, are 
operating in ways that affect the state of the individual’s 
BBRs. For the BBRs to be operative they have to be retained 
(remembered). Any temporary conditions, such as drugs or a 
fever, that effect the brain mechanisms that house the BBRs 
will be important, as will more permanent conditions such as 


brain damage that has deleted BBRs in whole or part. In ad- 
dition, the biological mechanism plays a third role. Even 
though the individual has retained the BBRs, other biological 
conditions, O,, may affect the ability of S,, the later situation, 
to elicit them. For example, the individual’s sensory systems 
may be affected by drugs or other organic conditions that 
limit or distort the sensory responses, as occurs with a person 
who because of poor hearing cannot respond emotionally to a 
touching dialogue in a movie. 

In this theoretical conception environmental conditions 
play two roles in the determination of the individual’s behav- 
ior. Separating these environmental events enables a more 
explicit consideration of both environmental and biological 
effects on personality and behavior. In both of these ways the 
definition of personality becomes more explicit. Several ad- 
ditional specifications can be added. 


Plasticity and Continuity in Personality 


There has been an issue of whether individuals behave the 
same across time and situations or whether their behavior is 
situationally determined. Watson’s behaviorism raised the 
issue, which was argued to a stalemate in his era. Mischel’s 
1968 book revivified the contest by arguing for the situational 
determinism position and against the conception that the in- 
dividual has a personality that acts across situations. A num- 
ber of pro and con works were then published until, as 
generally happens in such issues, interest for the moment was 
exhausted. A deeper analysis can be made, however, that can 
resolve the issue. 

To begin, Figure 6.1 has various implications. Behavior is 
certainly situational, for the situation does indeed play an im- 
portant role in selecting the elements of behavior displayed in 
that situation. For example, people generally act boisterously 
at a football game or wrestling match and sedately in a place 
of worship, a library, or a museum. 

But there is generality to personality also. A particular BBR 
over time can be relevant to various situations, and the indi- 
vidual’s behavior can thereby show characteristic features 
across those situations. For example, a person with a large 
repertoire of skilled singing behaviors will have learned a 
repertoire whose elements are called out in many later envi- 
ronmental situations. Compared to others the individual will 
sing more generally and more skillfully than others lacking 
that repertoire. Clearly that will be a characteristic, general, 
and stable feature of the individual’s behavior, considered to 
reflect a personality trait. 

Personality typically produces stability over time and sit- 
uations. For example, a person who has learned positive val- 
ues (emotion) to positions on the conservative side of many 


political-social-economic events (issues) will tend to display 
conservative behavior in the books and magazines that are 
read, the television programs that are watched, the lectures 
that are attended, the church that is attended, the voting 
choices that are made, the person who is married, the opin- 
ions that are expressed, and so on. As this example shows, a 
general trait—emotional-motivational, language-cognitive, 
or sensorimotor—promulgates additional trait develop- 
ment by ensuring additional experience of the same type that 
originally produced the BBR. In the abnormal area, for ex- 
ample, once the individual has learned negative emotional re- 
sponses to people generally, the individual will display 
negative behaviors (such as suspicion) to people. They in 
turn will typically respond in negative ways that will further 
condition the individual to have negative emotional re- 
sponses to people. That can become a general, deep, and con- 
tinuing abnormal trait. 

Stability in personality is produced in these ways. Thus, 
the BBRs, once formed, tend to ensure continuity of experi- 
ence, learning, and behavior. But personality can also exhibit 
change. For the process of personality development never 
ends. Learning goes on for the whole life span. In unusual 
cases something may happen to change a fundamental direc- 
tion in life. To illustrate, a conservative, conventional man 
may experience the horrors and immorality of war and 
thereby read things and participate in activities and meet peo- 
ple he otherwise would not. And these continuing experi- 
ences may ultimately provide him with new BBRs—new 
personality traits—that change his behavior drastically. The 
cumulative-hierarchical learning involved smacks of a chaos 
theory effect. 


The Multilevel Nature of the Theory 
and the Implications 


Simplification is a goal of science, and oversimplification is 
common. The traditional approach to personality involves 
this; that is, personality is conceptually simpler than myriad 
behaviors. Specification of personality, thus, could make it 
unnecessary to study all those behaviors. Furthermore, if one 
takes personality to be the cause of behavior, one need only 
study personality and not all the other fields of psychology, 
like animal learning principles and cognitive things (such as 
language), child development, social interaction principles, 
educational psychology, and so on. 

But PB differs here. Explaining human behavior is not 
considered a two-level task, with one basic theory level, the 
study of personality, which explains the second level, behav- 
ior. Psychological behaviorism says that psychology is di- 
vided into fields that have a general hierarchical relationship 
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with one another. The field of animal learning is basic to a 
field like developmental psychology because much of devel- 
opment depends on learning. The field of developmental psy- 
chology, on the other hand, is basic to the field of personality 
because important aspects of personality develop in child- 
hood. In turn, knowledge of personality is relevant to psy- 
chological measurement, abnormal psychology, and clinical 
and educational psychology. 

This multilevel relationship has many exceptions, and there 
is a bidirectional exchange between areas (levels). But the pres- 
ent position is that a personality theory that does not take into 
account the various major fields (levels) of psychology can 
only be a part theory. Learning, for example, is important to 
personality, as most personologists would agree. That being 
the case, the field should demand that a personality theory in- 
dicate how it links to and draws from the study of learning. The 
same is true of the fields of child development, experimental 
psychology (in studying language-cognition, emotion-motiva- 
tion, and sensorimotor behavior), biology, and social interac- 
tion. Personality theory on the other side should be basic to 
personality measurement and to abnormal, clinical, educa- 
tional, and industrial psychology. Personality theories should 
be evaluated comparatively for the extent to which they have a 
program for drawing from and contributing to the various 
fields of psychological knowledge (see Staats, 1996, for PB’s 
most advanced statement of its multilevel approach.) 

The traditional oversimplified view of the study of per- 
sonality needs change that broadens and deepens its scope as 
well as its analytic powers. 


PERSONALITY THEORY FOR THE 
TWENTY-FIRST CENTURY 


The PB theory of personality says the phenomena of 
personality—what it is, how it is learned, and the effects it 
has—are complex and require a theory capable of dealing 
with that complexity. And that complex theory suggests many 
more things to do than the traditional approach envisages. For 
one thing, there is a large task of specifying what the person- 
ality repertoires are, how they are learned, and how they op- 
erate. Psychological behaviorism says it has begun the study, 
but the task is huge, and the program for the twenty-first cen- 
tury must be suitably huge. It should be added that PB, while 
showing the task to be more complex than traditionally con- 
sidered, provides a foundation that simplifies the task. For all 
the studies made within its framework will be related and 
meaningful to one another. They all add together and advance 
toward explaining personality. Doing that permits research 
becoming progressively more profound, unimpeded by the 
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necessity of arguing perennially about basics. The fact is the 
traditional framework allows for a seeming simplicity; per- 
sonality theories can be created that are simple, but they have 
very little scope. Worse, however, the traditional framework 
allows for the creation of an infinity of such approaches to 
personality, all of them unrelated. The result is a large and 
chaotic fund of unrelated knowledge, set forth in many differ- 
ent and competing theory languages, impossible to work with 
as a student, researcher, or practitioner. This constitutes irre- 
solvable complexity. And the framework only guides the field 
to multiply its complexity with new and unrelated works. 
Generally, there is no advancement of knowledge in terms of 
parsimony, profundity, organization, non-redundancy, relat- 
edness, and explanatory value. 

Some of the implications of the PB theory of personality 
for study in the twenty-first century will be sketched. 


Biology and Personality 


Biological characteristics do indeed play an important role in 
human behavior and in individual differences in behavior. 
But in the present view, without a good conception of per- 
sonality, biological research is presently not of the type 
needed. The traditional search is for the biological mecha- 
nisms that produce personality traits, which PB considers the 
wrong path. Rather, the PB position is that the individual’s 
biology provides the mechanism by which the learning of 
the BBRs can take place, be stored, and be selectively acti- 
vated by the stimuli of the later environmental situations the 
individual encounters. Biological studies of various kinds 
are needed to specify the biological events involved in these 
processes. 


Learning and Personality 


While biological conditions are the most basic level of study 
proposed, it is the field of learning that is the most important 
basic level. Anomalously, however, especially since most 
every personologist would agree that personality is in good 
measure learned, personologists generally have not studied 
how learning-behavior principles are involved in the acquisi- 
tion or function of personality. There seems to be an implicit 
view that learning is not that much different for people except 
in extreme cases. 

The PB position, on the contrary, is that the personality 
repertoires are learned, that there are wide individual differ- 
ences in the learning conditions involved, and that those 
differences produce infinitely varied personality characteris- 
tics. Psychological behaviorism says that the first major 
task of a personality theory is formulating a basic theory of 


learning-behavior and a theory of human learning. No other 
existing personality theory does this. 


Human Learning and Personality 


The basic animal-conditioning principles are not sufficient for 
dealing with the learning of personality. There have been stud- 
ies, long since abandoned, employing human subjects that 
dealt with more complex learning situations and produced 
principles such as mediated generalization, sensory precondi- 
tioning, and verbal associations. But there has not been a 
conceptual framework to guide the field to study what is nec- 
essary, that is, to study how humans learn complex, functional 
repertoires in an advancing cumulative-hierarchical way. 
There has been no systematic goal of studying the basic be- 
havioral repertoires that are important to humans. Although 
there are research fields that study language, emotion, and 
sensorimotor behavior, these fields do not systematically ad- 
dress how these behaviors are important for human adjust- 
ment. Studies should be conducted that indicate how such 
repertoires function to (a) change the individual’s experience, 
(b) change the individual’s behavior, and (c) change the indi- 
vidual’s ability to learn. Such knowledge is needed to provide 
foundations for advancing the study of personality. For con- 
structing theory, personology needs fundamental knowledge 
of cumulative-hierarchical learning, the BBRs, their content, 
and how the BBRs work to affect experience, learning, and 
behavior. 


Developmental Psychology 


Some of the theories of personality include reference to how 
personality develops in childhood. Freud’s psychoanalytic 
theory initiated this and has had great influence on some 
other personality theories in this respect. But Freud’s theory 
of learning was lacking: He had no understanding of human 
learning principles or what is learned via those principles, no 
concepts of the BBRs, how they are learned, how important 
they are for further learning of personality, and so on. So his 
treatment (and others in this tradition) of child development 
in personality formation had to be limited and lacking. 

The PB position is that the learning experiences of child- 
hood set the individual’s basic personality (BBRs) to a great 
extent so that what follows typically continues in the same 
line of development. This conceptual position and its empiri- 
cal findings indicate that the field of child development should 
be an essential study. The focus of the field in the PB view 
should become the study of the central BBRs that are learned 
in childhood—a large agenda. This position recognizes the 
value of traditional research, such as longitudinal study of 


behavioral development, but also sets new avenues of re- 
search. To illustrate, it is important to know that children stand 
unaided at the age of 6 months and walk at the age of 1 year. 
But that type of knowledge needs to be joined with a behav- 
ioral analysis of the behavior involved, how the behavior 
is learned, and what the function of the behavior is in later 
development. 

Moreover, research needs to be conducted with respect to 
how repertoires are learned in a cumulative-hierarchical 
manner to constitute progressively more complex entities 
that constitute personality. Language development, for exam- 
ple, needs progressive study from the time when the reper- 
toires are simple to the time when they are more complex, 
both in their features that are general to most children as well 
as in features personal to individuals. The manner in which 
different repertoires in language provide the springboards for 
later learning needs study. To illustrate, the verbal-motor 
repertoire (by which the child follows directions) is elabo- 
rated throughout childhood. How is that BBR basic in the 
learning of elements in other language, sensorimotor, and 
emotional repertoires? Such very essential subject matter is 
not being studied today. 

This is to say that the theory of personality as BBRs pro- 
jects anew framework for research in developmental psychol- 
ogy that will make developmental psychology fundamental 
for the fields of personality and personality measurement (see 
Staats, 1966). 


Social Psychology 


The basic principles of learning behavior and the human 
learning principles pertain to single individuals. But much 
learning of humans and much human behavior occur in social 
interaction. While learning and behavior follow the basic 
principles, principles of social interaction can be abstracted 
that are useful in understanding personality formation and 
function. 

Take the child’s learning of the personality repertoires. 
Very central elements are formed in the parent-child interac- 
tions. And that process will be influenced greatly by the 
BBRs the child learns to the parent (as a stimulus object), as 
well as the reverse. To illustrate, the parent ordinarily pro- 
vides for the child’s needs, which means the presentation of 
positive emotional stimuli (food, warmth, caresses) paired 
with the parent. The parent comes thereby to elicit a very pos- 
itive emotional response (love) in the child. And that is im- 
portant to the child’s further learning, for the more positive 
emotion the parent elicits, the more effective the parent will 
be in promoting the child’s learning. That follows from PB’s 
social psychological principle that the stronger a person 


Personality Theory for the Twenty-First Century 153 


elicits a positive emotional response in another individual the 
more effective the person will be as a reinforcing and direc- 
tive stimulus for the individual. That means that the parent 
who is more loved will be more effective in rewarding the 
child for a desired behavior or in admonishing the child for 
an undesirable behavior. The more loved parent will also be a 
stronger “incentive” for the child to follow in learning via im- 
itation. Moreover, generalization will occur to other people 
so the child has learned a general personality trait. 

The point is that the PB framework calls for research that 
concerns how social interaction principles (see Staats, 1996) 
are involved in personality formation and function. 


Personality Tests and Measurement 


There is not room in this chapter to deal with the nature of the 
field of psychological measurement as a science. However, it 
shares the same weakness as the field of personality already 
described and repairing those weaknesses calls for many stud- 
ies of different types, including linking psychological mea- 
surement to other fields of psychology, such as that of learning. 
Traditional behaviorism never made sense of how the concepts 
and methods of psychological testing are related to behavior- 
ism concepts and methods (see Skinner, 1969, pp. 77-78). The 
conceptual gap between the two sets of knowledge is just too 
wide. To understand tests and test construction methods in be- 
havioral terms, it is necessary to have the concepts and princi- 
ples of a behavioral theory of personality, so the developments 
made by PB are necessary for bridging the gap. PB introduces 
the position that tests can provide information about behavior 
and personality. 

Let me begin by making a behavioral analysis of test 
construction methods, in a manner that answers the question 
of why psychological tests can predict later behavior. Tradi- 
tionally, tests are thought to predict behavior because they 
measure an unobservable process-entity of personality. 
Rather, tests can predict behavior because that is what they 
are constructed to do. That is, the test constructor first gathers 
a group of items. But in test construction only those items that 
do predict the behavior of interest are retained. Sometimes the 
test constructor first selects items without any justifying ra- 
tionale. Sometimes, however, the test constructor first selects 
items that are believed to be measures of the personality trait. 
But this selection difference does not matter, for in both cases 
the test constructor discards and retains items on the basis of 
which ones relate to (predict) the behavior of interest. 

The next question is why items are related to behaviors. 
Some, influenced by radical behaviorism, have assumed that 
the test item and the predicted behavior are, and should be, the 
same. However, in most cases that is not true. One real reason 
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for item-behavior relationships is that the test measures an 
element of a BBR or the verbal labeling of that repertoire. For 
example, we would find that a group of people who affirma- 
tively answered the item “I am an excellent athlete” would also 
display more athletic ability than would a group who answered 
negatively. The two behaviors are in the same repertoire. Peo- 
ple generally learn to describe their own behavior with some 
accuracy (but there are variations in that respect). 

It can also be the case that a test, because of how it was 
constructed, measures a BBR that is necessary for the learning 
of the predicted behavior. Intelligence tests are a prime exam- 
ple. Behavioral analysis of IQ test items reveals that many test 
whether the child has the language-cognitive elements. Most 
of the items, for example, test for the child’s verbal-motor 
repertoire that is necessary for following instructions. Others 
test the number concept repertoire, the counting and other 
arithmetic repertoires, and the verbal-labeling repertoire. The 
manner in which items on the WPPSI (Wechsler, 1967) mea- 
sure aspects of the sensorimotor repertoire has been described 
earlier. Why do such items predict later school performance? 
The answer is that the items measure basic behavioral reper- 
toires the child needs to be successful in learning materials 
that are later presented in school. So the items correlate with 
school performance. 

Other tests measure the emotional-motivational BBR. Con- 
sider an interest test. Constructing the test involves gathering a 
number of items together that are thought to represent a range 
of interests that are occupationally relevant. But the important 
part involves the standardization procedures. The items are 
given to different occupational groups, and those that distin- 
guish the groups are retained and organized (keyed). When the 
test is used, it can be ascertained whether the individual an- 
swers the items in a manner that is like some particular occu- 
pational group. What does this mean in the PB analysis? The 
answer is that the items measure emotional responses (indi- 
cated, e.g., by like-dislike) to different life stimuli. So the indi- 
vidual’s test responses reveal life stimuli to which the 
individual has positive and negative emotional responses. Re- 
member that those life stimuli the individual likes or dislikes 
will also serve as positive or negative reinforcers and incen- 
tives. Thus, if the individual has emotional responses to life 
stimuli that are like people who are successful in some occu- 
pation, then the individual should be happy in that same work 
situation. Moreover, the individual should be reinforced by 
that work and be attracted to it incentively. That means that 
other things equal, the individual should work harder in the 
job, study relevant material more, and so on. That is why in- 
terest tests predict job success. 

It is important to bridge the psychological testing— 
behaviorism gap, for unifying the two traditions produces 


new knowledge. For example, in terms of the present theory 
of personality, the various existent psychological tests are 
an invaluable source of knowledge for defining the basic 
behavioral repertoires. PB’s basic experimental studies, de- 
velopmental studies, and behavior therapy studies have 
been important avenues of definition. But the manner in 
which psychological tests have been constructed means that 
their items measure elements of BBRs that constitute as- 
pects of personality. The extensive work of behaviorally an- 
alyzing the items of psychological tests can be expected to 
tell us much about the content of personality (see Staats, 
1996). And, as indicated, those analyses will then yield di- 
rectives for conducting research on how the BBRs involved 
are learned and how they function in producing the individ- 
ual’s behavior. We have already trained children to be more 
intelligent (Staats & Burns, 1981) by training them in basic 
behavioral repertoires. In addition, interest and values (see 
Staats, 1996) tests have been shown to measure aspects of 
the learned emotional-motivational BBR. Those findings 
merely open the way. 

Other positive avenues of development emerge from the 
conceptual unification of tests and PB theory. For one thing, 
the unified theory enables us to understand what tests are. 
That should be valuable in constructing tests. The approach 
provides an avenue for defining in objective, stipulable 
terms just what personality is. That should be valuable in 
using tests, namely, that test items—not just total scores— 
when analyzed behaviorally, describe the content of person- 
ality traits of the individual. This conception of tests, 
moreover, says that tests can yield more than prediction; 
they can describe the contents of personality traits and thus 
the nature of the individual’s BBR being measured. With 
study of how people come to learn those personality traits 
we will have knowledge on how to avoid doing things that 
will give children undesirable traits, while doing things to 
give them desirable traits. And behaviorally-analyzed tests 
will also give specific information regarding what remedial 
treatment needs to do. 

Many studies are needed that analyze existing tests in 
terms of the behavioral repertoires they assess, as already 
demonstrated in PB experiments. With that knowledge tests 
could be compared to one another in a way that would make 
sense of the field. At present tests are independent entities; 
they are not related to each other. Many tests of different as- 
pects of personality are actually redundant and share types of 
items (e.g., interest, values, and needs tests, on the one hand, 
and fears, anxiety, and stress tests, on the other). 

The field of testing does not relate itself to the content areas 
of psychology or to personality theories. The analysis of tests 
in terms of BBRs provides a means for doing so. Studying 


how the repertoires on tests are learned and function will lead 
to studies that are relevant in different areas of psychology. 
There is a vast amount of research to be conducted within this 
framework. The results of that research will help organize the 
presently chaotic knowledge of the field. That research will 
help relate the field of testing to the other fields of psychology. 
That research will render theoretically meaningful many 
works that exist in this field. And the knowledge produced 
should also enable the field to construct better tests. 


Abnormal Psychology 


The PB position is that a theory of personality should contain 
principles and concepts for formulating a theory of abnormal 
psychology. Freud’s psychoanalytic theory was composed to 
have that potentiality, and this was elaborated in others 
works. Radical behaviorism has not produced such a theory, 
nor has the traditional behavioral field. 

Psychological behaviorism, however, began a new devel- 
opment in behaviorism when it analyzed the opposite speech 
of the schizophrenic patient (Staats, 1957). Not only was the 
abnormal symptom considered as a behavior, but the analysis 
also indicated how the symptom was learned and how it 
could be extinguished and replaced with normal behavior. In 
the early presentation of PB (see Staats, 1963) one chapter 
was devoted to further formulation of its theory of abnormal 
behavior. This theory was employed in the social learning 
theory of abnormal behavior (Bandura, 1968) and in later be- 
havioral works of various kinds. However, the PB theory was 
developed a good deal further after its theory of personality 
was systematically formulated (Staats, 1975) and then further 
extended (Staats, 1989, 1996). 

The PB theory of abnormal behavior follows the theory of 
personality schematized in Figure 6.1. However, each term in 
the causal circumstances can be normal or abnormal and result 
in abnormal behavior. With respect to biological conditions, 
O,, O,, and O, may be abnormal in some way. For example, 
because of organic conditions a child with Down syndrome 
does not learn normally and will display deficits in the BBRs 
and thus not behave normally in various life situations, such as 
school. The same is true of the OH and O°. When they are ab- 
normal, they will produce abnormal behavior. 

In addition, the behavioral variables in the schematized 
theory of personality can be either normal or abnormal. In 
this case abnormal can mean either deficits in what should be 
or inappropriate conditions that should not occur. The origi- 
nal learning, for example, S,, may be deficit or inappropriate 
and produce deficit or inappropriate BBRs that will result in 
deficit or inappropriate behavior in later situations such that 
the individual will be diagnosed as abnormal. The deficit or 
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inappropriate conditions can also occur in S, and produce be- 
havior that will be judged as abnormal. 

The task is to analyze, for the various diagnostic categories, 
these various behavioral or organic conditions that produce 
abnormal behavior. Each such analysis constitutes a theory of 
the disorder involved that can be employed by therapists or 
parents. For example, if the deficit or inappropriate conditions 
occur at S,, the analysis can be used to instruct parents how to 
see to it that the child does not develop abnormal BBRs. The 
analysis will also provide the practitioner with knowledge 
about how to correct the abnormal conditions and treat the be- 
havior disorder after it has occurred. For example, PB works 
have presented analyses of developmental disorders, develop- 
mental reading disorder, autism, and mental retardation 
(Staats, 1996). In addition, PB theories of depression, the anx- 
iety disorders (phobic disorder, generalized anxiety disorder, 
panic disorder, obsessive-compulsive disorder, and posttrau- 
matic stress disorder) have also been presented. Various other 
behavior therapists have produced other analyses of behavior 
disorders that use elements from PB theory. However, typi- 
cally they do not employ the full approach, and there remains 
a general need to stipulate the elements of abnormal BBRs fur- 
ther, how they are learned, and how they have their effects 
in a general theory of abnormal behavior. 

The PB theory of abnormal behavior takes the position 
that traditional descriptions of categories of abnormal behav- 
ior (see DSM-IV; American Psychiatric Association, 1994) 
are valuable. PB analyses of behavior disorders yield exten- 
sive implications for research that PB suggests for the 
twenty-first century. So, in addition to those already made, 
many analyses of the various behavior disorders are needed. 
Such analyses need to be empirically verified, their implica- 
tions for prevention and treatment assessed, and their impli- 
cations for test construction exploited. Centrally, research is 
needed that gathers observations of the development of ab- 
normal behavior through learning that, strangely enough, 
have never been made. 


Application of the Personality Theory 


From the beginning the PB position has been that basic and 
applied work should be closely related in psychology but 
presently are not. For example, the field of animal learning has 
ceased providing useful information to the various areas of 
human study because the field needs input from those areas 
concerning important things to study. As an example in the 
other direction, a personality theory in the PB view should 
have implications for the improvement of psychology’s fields 
of practice. To illustrate, my own personal experience has 
exposed me to cases of disadvantageous parenting that re- 
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sulted from following psychoanalytic theory or traditional 
developmental (biologically oriented) theory. Such cases 
of applied failure represent disconfirmation of the theory. A 
good theory should yield good applications. An important part 
of PB’s development, thus, has been directed toward practice, 
as will be briefly mentioned. 


Clinical Applications 


The analysis of the opposite speech of the schizophrenic pa- 
tient contained clinical directives. The analysis said that the 
opposite speech was learned and maintained via the inadver- 
tent reinforcement provided by the professional staff (Staats, 
1957). That analysis led directly to applications (Ayllon & 
Michael, 1959). As another example, PB’s token reinforcer 
system was employed as the token economy in dealing with 
hospitalized psychotic patients (Ayllon & Azrin, 1968). Psy- 
chological behaviorism analyses and reinforcement methods 
have been used to train mentally retarded children (Bijou, 
1965; Birnbrauer, Bijou, Wolf, & Kidder, 1965) and autistic 
children (Lovaas, 1977), to toilet train children (Azrin & 
Foxx, 1974), and to treat juvenile delinquents in different set- 
tings (see Staats & Butterfield, 1965; Wolf, et al., 1976). Wolf, 
Risely, and Mees (1964) used the PB approach of working in 
the naturalistic situation, including PB’s time-out procedure, 
in their seminal study to treat an autistic child’s behavior prob- 
lems. Many of the other extensions of PB’s methods, as sug- 
gested for a wide variety of children’s problems (see Staats, 
1963; Staats & Butterfield, 1965; Staats & Staats, 1962) were 
accomplished by others, creating the body of works con- 
tributed to the establishment of the field of behavior analysis. 
As another example, the PB theory of language provided a 
basis for understanding why traditional verbal psychotherapy 
could be used to change behavior therapeutically laying a 
foundation for the field of cognitive behavior therapy (Staats, 
1972). Radical behaviorism, however, rejected for some 
16 years. Finally, verbal therapy was later accepted as though 
it were a derivative of radical behaviorism (Hamilton, 1988; 
Hayes & Wilson, 1994). Additional projections of clinical 
research and treatment have been outlined based on the addi- 
tional developments of PB (Staats, 1996, chap. 8). 


Educational Psychology Applications 


The PB research on reading and treatment of nonreading has 
already been mentioned. Reading was conceptualized as a 
later elaboration of the language-cognitive BBR. Learned on 
the foundation of the repertoires of language, it is a complex 
repertoire that requires long-term training and a huge number 
of training trials. The subrepertoires of reading, when they 


have been acquired, serve various learning functions for the 
individual in later school learning (Staats, 1975). The PB 
theory of reading focuses on this extensive learning and de- 
nies the existence of biological defects responsible for learn- 
ing disabilities such as dyslexia because the children have 
normal intelligence, which means normal language BBRs. 
PB research and analysis thus states the definitive principle 
that if the child has developed normal language, then the 
child has all the cognitive ability needed to learn reading per- 
fectly well because no additional abilities are required for 
reading (see Staats, 1975). 

Dyslexia arises because there is inadequate reinforcement 
to maintain the child’s attention and participation in the long 
task. I designed the token reinforcer system to solve the moti- 
vation problem by providing reinforcement for the child’s at- 
tention and participation. The system works widely, as shown 
by its use in the multitude of studies and programs designed to 
treat reading and other developmental academic disorders (see 
Burns & Kondrick, 1998; Sulzer-Azeroff & Mayer, 1986). The 
Sylvan Learning Centers enterprise by its use of the token re- 
inforcer system validates the system as well as the PB theories 
of developmental academic disorders (see Staats, 1963, 1968, 
1975, 1996). The PB theories of the various academic reper- 
toires (reading, writing, counting, number operations, math) 
provide the foundation for deriving a large body of additional 
research to understand school learning and to solve the prob- 
lems of school learning. The educational field’s absorption 
with cognitive psychology stands in the way of the vast re- 
search and application that would advance education so much. 


CONCLUSION 


The PB theory of personality is set in a general theory that goes 
from the study of basic learning, including the biology of that 
learning, through the multiple levels of study that provide 
its principles and concepts. The theory of personality, thus, is 
sunk into general psychology, making connections to various 
fields in psychology. It is specific, objective, and empirical. It 
draws widely on various areas of study, and it has implications 
for conducting large amounts of additional research and appli- 
cation in various areas and fields of study. The theory provides 
a philosophy of science and methodology of theory construc- 
tion. This is the only theory of personality that claims it can be 
employed to establish or to change personality, a claim that if 
fulfilled would have enormous importance. It is the only the- 
ory that is unified and has comprehensive scope—sorely 
needed developments for the field and psychology generally. 
It is a theory that ties together personality and personality 
measurement on a broad front. And it projects new areas and 


topics of research. An important need for the twenty-first cen- 
tury is to compare this theory with others as part of the general 
comparison and evaluation of personality theories called for 
by PB. Another is to exploit the theory in the various areas of 
theoretical analysis and empirical research it suggests. 
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Cognitive-experiential self-theory (CEST) is a broadly inte- 
grative theory of personality that is compatible with a variety 
of other theories, including psychodynamic theories, learn- 
ing theories, phenomenological self-theories, and modern 
cognitive scientific views on information processing. CEST 
achieves its integrative power primarily through three as- 
sumptions. The first is that people process information by two 
independent, interactive conceptual systems, a preconscious 
experiential system and a conscious rational system. By intro- 
ducing a new view of the unconscious in the form of an expe- 
riential system, CEST is able to explain almost everything that 
psychoanalysis can and much that it cannot, and it is able to do 
so in a scientifically much more defensible manner. The sec- 
ond assumption is that the experiential system is emotionally 
driven. This assumption permits CEST to integrate the pas- 
sionate phallus-and-tooth unconscious of psychoanalysis 
with the “kinder, gentler” affect-free unconscious of cognitive 
science (Epstein, 1994). The third assumption is that four 
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basic needs, each of which is assumed in other theories to be 
the one most fundamental need, are equally important accord- 
ing to CEST. 

In this chapter, I review the basic assumptions of CEST, 
summarize the research conducted to test the theory, and note 
the implications of the theory for research and psychotherapy. 


TWO INFORMATION-PROCESSING SYSTEMS 


According to CEST, humans operate by two fundamental 
information-processing systems: a rational system and an 
experiential system. The two systems operate in parallel and 
are interactive. CEST has nothing new to say about the ratio- 
nal system, other than to emphasize the degree to which it is 
influenced by the experiential system. CEST does have a 
great deal to say about the experiential system. In effect, 
CEST introduces a new system of unconscious processing in 
the experiential system that is a substitute for the unconscious 
system in psychoanalysis. Although like psychoanalysis, 
CEST emphasizes the unconscious, it differs from psycho- 
analysis in its conception of how the unconscious operates. 
Before proceeding further, it should be noted that the word 
rational as used in the rational system refers to a set of 
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analytical principles and has no implications with respect to 
the reasonableness of the behavior, which is an alternative 
meaning of the word. 

It is assumed in CEST that everyone, like it or not, auto- 
matically constructs an implicit theory of reality that includes 
a self-theory, a world-theory, and connecting propositions. 
An implicit theory of reality consists of a hierarchical organi- 
zation of schemas. Toward the apex of the conceptual struc- 
ture are highly general, abstract schemas, such as that the self 
is worthy, people are trustworthy, and the world is orderly 
and good. Because of their abstractness, generality, and their 
widespread connections with schematic networks throughout 
the system, these broad schemas are normally highly stable 
and not easily invalidated. However, should they be invali- 
dated, the entire system would be destabilized. Evidence that 
this actually occurs is provided by the profound disorganiza- 
tion following unassimilable experiences in acute schizo- 
phrenic reactions (Epstein, 1979a). At the opposite end of the 
hierarchy are narrow, situation-specific schemas. Unlike the 
broad schemas, the narrower ones are readily susceptible to 
change, and their changes have little effect on the stability of 
the personality structure. Thus, the hierarchical structure of 
the implicit theory allows it to be stable at the center and flex- 
ible at the periphery. It is important to recognize that unlike 
other theories that propose specific implicit or heuristic rules 
of information processing, it is assumed in CEST that the ex- 
periential system is an organized, adaptive system, rather 
than simply a number of unrelated constructs or so-called 
cognitive shortcuts (e.g., Tversky & Kahneman, 1974). As it 
is assumed in CEST that the experiential system in humans is 
the same system by which nonhuman animals adapt to their 
environments, it follows that nonhuman animals also have an 
organized model of the world that is capable of disorganiza- 
tion. Support for this assumption is provided by the wide- 
spread dysfunctional behavior that is exhibited in animals 
when they are exposed to emotionally significant unassimil- 
able events (e.g., Pavlov, 1941). 

Unlike nonhuman animals, humans have a conscious, ex- 
plicit theory of reality in their rational system in addition to 
the model of reality in their experiential system. The two the- 
ories of reality coincide to different degrees, varying among 
individuals and situations. 


Comparison of the Operating Principles 
of the Two Systems 


The experiential system in humans is the same system with 
which other higher order animals have adapted to their envi- 
ronments over millions of years of evolution. It adapts by 
learning from experience rather than by logical inference, 


TABLE 7.1 Comparison of the Experiential and Rational Systems 


Experiential System Rational System 


1. Holistic. 1. Analytic. 


2. Emotional; pleasure-pain 2. Logical; reason oriented 
oriented (what feels good). (what is sensible). 


3. Associationistic connections. 3. Cause-and-effect connections. 


4. Outcome oriented. 4. Process oriented. 


5. Behavior mediated by vibes 5. Behavior mediated by con- 
from past experience. scious appraisal of events. 

6. Encodes reality in concrete 6. Encodes reality in abstract 
images, metaphors, and symbols, words, and 
narratives. numbers. 

7. More rapid processing; oriented 7. Slower processing; oriented 
toward immediate action. toward delayed action. 

8. Slower to change; changes with 8. Changes more rapidly; 
repetitive or intense experience. changes with speed of 

thought. 
9. More crudely differentiated; 9. More highly differentiated; 


broad generalization gradient; dimensional thinking. 


categorical thinking. 
10. More crudely integrated; 10. More highly integrated. 
dissociative, organized 
in part by emotional 
complexes (cognitive- 
affective modules). 


11. Experienced passively and 11. Experienced actively and 
preconsciously; seized by consciously; in control of 
our emotions. our thoughts. 

12. Self-evidently valid: “Seeing is 12. Requires justification via 


believing.” logic and evidence. 


Note. Adapted from Cognitive-experiential self-theory: An integrative the- 
ory of personality by S. Epstein, 1991, in R. C. Curtis, editor, The relational 
self: Theoretical convergences in psychoanalysis and social psychology, 
New York: Guilford. Adapted by permission. 


which is the exclusive domain of the rational system. The 
experiential system operates in a manner that is preconscious, 
automatic, rapid, effortless, holistic, concrete, associative, 
primarily nonverbal, and minimally demanding of cognitive 
resources (see Table 7.1 for a more complete comparison of 
the two systems). It encodes information in two ways: as 
memories of individual events, particularly events that were 
experienced as highly emotionally arousing, and also in a 
more abstract, general way. 

Although the experiential system is a cognitive system, its 
operation is intimately related to the experience of affect. It is, 
in fact, inconceivable that a conceptual system that learns 
from experience would not be used to facilitate positive affect 
and avoid negative affect. According to CEST, the experien- 
tial system both influences and is influenced by affect. Not 
only does the experiential system direct behavior in a manner 
anticipated to achieve pleasurable outcomes and to avoid un- 
pleasurable ones, but the cognitions themselves are influenced 
by affect. As noted previously, the experiential conceptual 


system, according to CEST, is emotionally driven. After this 
is recognized, it follows that the affect-free unconscious 
proposed by cognitive scientists is untenable. The automatic, 
preconscious experiential conceptual system that regulates 
everyday behavior is of necessity an emotionally driven, dy- 
namic unconscious system. Because affect determines what is 
attended to and what is reinforced, without affect there would 
be neither schemas nor motivation in the experiential system, 
and, therefore, no experiential system. It follows that CEST is 
as much an emotional as a cognitive theory. 

In contrast to the experiential system, the rational system 
is an inferential system that operates according to a person’s 
understanding of the rules of reasoning and of evidence, 
which are mainly culturally transmitted. The rational system, 
unlike the experiential system, has a very brief evolutionary 
history. It operates in a manner that is conscious, analytical, 
effortful, relatively slow, affect-free, and highly demanding 
of cognitive resources (see Table 7.1). 

Which system is superior? At first thought, it might seem 
that it must be the rational system. After all, the rational 
system, with its use of language, is a much more recent evo- 
lutionary development than is the experiential system, and it 
is unique to the human species. Moreover, it is capable 
of much higher levels of abstraction and complexity than is 
the experiential system, and it makes possible planning, long- 
term delay of gratification, complex generalization and 
discrimination, and comprehension of cause-and-effect rela- 
tions. These attributes of the rational system have been the 
source of humankind’s remarkable scientific and technologi- 
cal achievements. Moreover, the rational system can under- 
stand the operation of the experiential system, whereas the 
reverse is not true. 

On the other side of the coin, carefully consider the fol- 
lowing question: If you could have only one system, which 
would you choose? Without question, the only reasonable 
choice is the experiential system. You could exist with an ex- 
periential system without a rational system, as the existence 
of nonhuman animals testifies, but you could not exist with 
only a rational system. Even mundane activities such as 
crossing a street would be excessively burdensome if you had 
to rely exclusively on conscious reasoning. Imagine having 
to estimate your walking speed relative to that of approaching 
vehicles so that you could determine when to cross a street. 
Moreover, without a system guided by affect, you might not 
even be able to decide whether you should cross the street. 
Given enough alternative activities to consider, you might re- 
main lost in contemplation at the curb forever. 

The experiential system also has other virtues, including 
the ability to solve some kinds of problems that the rational 
system cannot. For example, by reacting holistically, the 
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experiential system can respond adaptively to real-life prob- 
lems that are too complex to be analyzed into their compo- 
nents. Also, there are important lessons in living that can be 
learned directly from experience and that elude articulation 
and logical analysis. Moreover, as our research has demon- 
strated, the experiential system is more strongly associated 
with the ability to establish rewarding interpersonal relation- 
ships, with creativity, and with empathy than is the rational 
system (Norris & Epstein, 2000b). Most important is that the 
experiential system has demonstrated its adaptive value over 
millions of years of evolution, whereas the rational system has 
yet to prove itself and may yet be the source of the destruction 
of the human species as well as all other life on earth. 

Fortunately, there is no need to choose between the sys- 
tems. Each has its advantages and disadvantages, and the ad- 
vantages of one can offset the limitations of the other. Besides, 
we have no choice in the matter. We are they, and they are us. 
Where we do have a choice is in improving our ability to use 
each and to use them in a complementary manner. As much as 
we might wish to suppress the experiential system in order to 
be rational, it is no more possible to accomplish this than to 
stop breathing because the air is polluted. Rather than achiev- 
ing control by denying the experiential system, we lose con- 
trol when we attempt to do so: By being unaware of its 
operation, we are unable to take its influence into account. 
When we are in touch with the processing of the experiential 
system, we can consciously decide whether to heed or dis- 
count its influence. Moreover, if, in addition, we understand 
its operation, we can begin to take steps to improve it by 
providing it with corrective experiences. 


How the Experiential System Operates 


As noted, the operation of the experiential system is intimately 
associated with the experience of affect. For want of a better 
word, I shall use the word vibes to refer to vague feelings that 
may exist only dimly (if at all) in a person’s consciousness. 
Stating that vibes often operate outside of awareness is not 
meant to imply that people cannot become aware of them. 
Vibes are a subset of feelings, which include other feelings 
that are more easily articulated than vibes, such as those that 
accompany standard emotions. Examples of negative vibes 
are vague feelings of agitation, irritation, tension, disquietude, 
queasiness, edginess, and apprehension. Examples of positive 
vibes are vague feelings of well-being, gratification, positive 
anticipation, calmness, and light-heartedness. 

When a person responds to an emotionally significant 
event, the sequence of reactions is as follows: The experiential 
system automatically and instantaneously searches its mem- 
ory banks for related events. The recalled memories and 
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feelings influence the course of further processing and of be- 
havioral tendencies. If the recalled feelings are positive, the 
person automatically thinks and has tendencies to act in ways 
anticipated to reproduce the feelings. If the recalled feelings 
are negative, the person automatically thinks and has tenden- 
cies to act in ways anticipated to avoid experiencing the feel- 
ings. As this sequence of events occurs instantaneously and 
automatically, people are normally unaware of its operation. 
Seeking to understand their behavior, they usually succeed in 
finding an acceptable explanation. Insofar as they can manage 
it without too seriously violating reality considerations, they 
will also find the most emotionally satisfying explanation 
possible. This process of finding an explanation in the rational 
system for what was determined primarily by the experiential 
system and doing so in a manner that is emotionally acceptable 
corresponds to what is normally referred to as rationalization. 
According to CEST, such rationalization is a routine process 
that occurs far more often than is generally recognized. Ac- 
cordingly, the influences of the experiential system on the ra- 
tional system and its subsequent rationalization are regarded, 
in CEST, as major sources of human irrationality. 


The Four Basic Needs 


Almost all of the major theories of personality propose a sin- 
gle, most basic need. CEST considers the four most often 
proposed needs as equally basic. It is further assumed in 
CEST that their interaction plays an important role in behav- 
ior and can account for paradoxical reactions that have 
eluded explanation by other theoretical formulations. 


Identification of the Four Basic Needs 


In classical Freudian theory, before the introduction of the 
death instinct, the one most basic need was the pleasure prin- 
ciple, which refers to the desire to maximize pleasure and min- 
imize pain (Freud, 1900/1953). Most learning theorists make 
a similar implicit assumption in their view of what constitutes 
reinforcement (e.g., Dollard & Miller, 1950). For other theo- 
rists, such as object-relations theorists, most notably Bowlby 
(1988), the most fundamental need is the need for relatedness. 
For Rogers (1951) and other phenomenological psycholo- 
gists, itis the need to maintain the stability and coherence of a 
person’s conceptual system. For Allport (1961) and Kohut 
(1971), it is the need to enhance self-esteem. (For a more thor- 
ough discussion of these views, see Epstein, 1993.) Which 
of these views is correct? From the perspective of CEST, they 
are all correct, because each of the needs is basic—but they are 
also all incorrect because of their failure to recognize that 
the other needs are equally fundamental. They are equally 
fundamental in the sense that each can dominate the others. 


Moreover, there are equally serious consequences, including 
disorganization of the entire personality structure, when any 
one of the needs is insufficiently fulfilled. 


Interactions Among the Basic Needs 


Given four equally important needs that can operate simulta- 
neously, it follows that behavior is determined by the com- 
bined influence of those needs that are activated in a particular 
situation. An important adaptive consequence of such influ- 
ence is that the needs serve as checks and balances against 
each other. When any need is fulfilled at the expense of the 
others, the intensity of the others increases, thereby increasing 
the motivation to satisfy the other needs. However, under 
certain circumstances the frustration of a need may be so great 
that frustration of the other needs is disregarded, which can 
have serious maladaptive consequences. As is shown next, 
these assumptions about the interaction of basic needs can 
resolve some important, otherwise paradoxical findings. 

The finding that normal people characteristically have 
unrealistic self-enhancing and optimistic biases (Taylor & 
Brown, 1988) has evoked considerable interest because it ap- 
pears to contradict the widely held assumption that reality 
awareness is an important criterion of mental health. From the 
perspective of CEST, this finding does not indicate that reality 
awareness is a false criterion of mental health, but only that it 
is not the only criterion. According to CEST, a compromise 
occurs between the need to realistically assimilate the data of 
reality into a stable, coherent conceptual system and the need 
to enhance self-esteem. The result is a modest self-enhancing 
bias that is not unduly unrealistic. It suggests that normal in- 
dividuals tend to give themselves the benefit of the doubt in 
situations in which the cost of slight inaccuracy is outweighed 
by the gain in positive feelings about the self. Note that this as- 
sumes that the basic need for a favorable pleasure-pain bal- 
ance is also involved in the compromise. 

There are more and less effective ways of balancing basic 
needs. A balance that is achieved among equally unfulfilled 
competing needs is a prescription for chronic distress—not 
good adjustment. Whereas poorly adjusted people tend to ful- 
fill their basic needs in a conflictual manner, well-adjusted 
people fulfill their basic needs in a synergistic manner, in 
which the fulfillment of one need contributes to rather than 
conflicts with the fulfillment of the other needs. They thereby 
maintain a stable conceptual system, a favorable pleasure- 
pain balance, rewarding interpersonal relationships, and a 
high level of self-esteem. 

Let us first consider an example of a person who balances 
her basic needs in a synergistic manner and then consider an 
opposite example. Mary is an emotionally stable, happy per- 
son with high self-esteem who establishes warm, rewarding 


relationships with others. She derives pleasure from helping 
others. This contributes to her self-esteem, as she is proud of 
her helpful behavior and others admire and appreciate her for 
it. As a result, Mary’s behavior also contributes to favorable 
relationships with others. Thus, Mary satisfies all her basic 
needs in a harmonious manner. 

Now, consider a person who fulfills his basic needs in a 
conflictual manner. Ralph is an unhappy, unstable person 
with low self-esteem who establishes poor relationships with 
others. Because of his low self-esteem, Ralph derives plea- 
sure from defeating others and behaving in other ways that 
make him feel momentarily superior. Not surprisingly, this 
alienates people, so he has no close friends. Because of his 
low self-esteem and poor relationships with others, he antici- 
pates rejection, from which he protects himself by maintain- 
ing a distance from people. His low self-esteem and poor 
relationships with others contribute to feelings of being un- 
worthy of love as well as to an unfavorable pleasure-pain bal- 
ance. Because his conceptual system is failing to fulfill its 
function of directing his behavior in a manner that fulfills his 
basic needs, it is under the stress of potential disorganization, 
which he experiences in the form of anxiety. The more his 
need for enhancing his self-esteem is thwarted, the more he 
acts in a self-aggrandizing manner, which exacerbates his 
problems with respect to fulfilling his other basic needs. 


Imbalances in the Basic Needs as Related 
to Specific Psychopathologies 


Specific imbalances among the basic needs are associated 
with specific mental disorders. For present purposes, it will 
suffice to present some of the more obvious examples. 

Paranoia with delusions of grandeur can be understood as 
a compensatory reaction to threats to self-esteem. In a des- 
perate attempt to buoy up self-esteem, paranoid individuals 
disregard their other needs. They sacrifice their need to main- 
tain a favorable pleasure-pain balance because their desperate 
need to maintain their elevated self-esteem is continuously 
threatened. They sacrifice their need to maintain relationships 
because their grandiose behavior alienates others who do not 
appreciate being treated as inferiors and who are repelled by 
their unrealistic views. The situation is somewhat more com- 
plicated with respect to their need to realistically assimilate 
the data of reality into a coherent, stable, conceptual system. 
They sacrifice the reality aspect of this need but not the co- 
herence aspect. In both of these respects they are similar to 
paranoid individuals with delusions of persecution, consid- 
ered in the next example. 

Paranoia with delusions of persecution can be understood 
as a desperate attempt to defend the stability of a person’s con- 
ceptual system and, to a lesser extent, to enhance self-esteem. 
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By viewing their problems in living as resulting from persecu- 
tion by others, paranoid people with delusions of persecution 
can focus all their attention and resources on defending them- 
selves. Such focus and mobilization provide a highly unifying 
state that serve as an effective defense against disorganization. 
Delusions of persecution also contribute to self-esteem be- 
cause the perception of the persecutors as powerful or presti- 
gious, which is invariably the case, implies that the target of 
the persecution must also be important. The basic needs that 
are sacrificed are the pleasure principle, as being persecuted 
is a terrifying experience, and the need for relatedness, as 
others are either viewed as enemies or repelled by the unreal- 
istic behavior. 

Schizophrenic disorganization can be understood as the 
best bargain available for preventing extreme misery under 
desperate circumstances in which fulfillment of the basic 
needs is seriously threatened. Ultimate disorganization is a 
state devoid of conceptualization and (relatedly) therefore of 
feelings. Although its anticipation is dreaded, its occurrence 
corresponds to a state of nonbeing, a void in which there are 
neither pleasant nor unpleasant feelings (Jefferson, 1974). 
Thus, what is gained is a net improvement in the pleasure- 
pain balance (from a negative to a zero value). What is sacri- 
ficed are the needs to maintain the stability of the conceptual 
system, to maintain relatedness, and to enhance self-esteem. 


The Four Basic Beliefs 


The four basic needs give rise to four corresponding basic 
beliefs, which are among the most central constructs in a per- 
sonal theory of reality. They therefore play a very important 
role in determining how people think, feel, and behave in the 
world. Moreover, as previously noted, because of their domi- 
nant and central position and their influence on an entire 
network of lower-order beliefs, should any of them be invali- 
dated, the entire conceptual system would be destabilized. An- 
ticipation of such disorganization would be accompanied by 
overwhelming anxiety. The disorganization, should it occur 
(as previously noted) would correspond to an acute schizo- 
phrenic reaction. 

The question may be raised as to how the four basic needs 
give rise to the development of four basic beliefs. Needs, or 
motives, in the experiential system, unlike those in the rational 
systems, always include an affective component. They there- 
fore determine what is important to a person at the experiential 
level and what a person is spontaneously motivated to pursue 
or avoid. Positive affect is experienced whenever a need is 
fulfilled, and negative affect is experienced whenever the 
fulfillment of a need is frustrated. Because people wish to 
experience positive affect and to avoid negative affect, they 
automatically attend to whatever is associated with the 
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fulfillment or frustration of a basic need. As a result, they de- 
velop implicit beliefs associated with each of the basic needs. 
Let us examine this idea in greater detail. 

Depending on a person’s history in fulfilling the need to 
maximize pleasure and minimize pain, a person tends to de- 
velop a basic belief about the world along a dimension vary- 
ing from benign to malevolent. Thus, if a person experienced 
an environment that was predominantly a source of pleasure 
and security, the person will most likely develop the basic be- 
lief that the world is a good place in which to live. If a person 
has the opposite experiences, the person will tend to develop 
the opposite basic belief. The basic belief about the benignity 
versus malevolence of the world is the core of a network of 
related beliefs, including optimistic versus pessimistic views 
about future events. 

Corresponding to the basic need to represent the data of 
reality in a stable and coherent conceptual system is a basic 
belief about the world that varies along a dimension of mean- 
ingful versus meaningless. Included in the network of related 
beliefs are beliefs about the predictability, controllability, and 
justness of the world versus its unpredictability, uncontrolla- 
bility, and lack of justice. Corresponding to the basic need for 
relatedness is a basic belief about people that varies along a 
dimension from helpful and trustworthy to dangerous and 
untrustworthy. Included in the network of related beliefs are 
beliefs about the degree to which people are loving versus 
rejecting and trustworthy versus untrustworthy. Correspond- 
ing to the basic need for self-enhancement is a basic belief 
about the self that varies along a dimension from worthy to 
unworthy. Included in the network of related beliefs are be- 
liefs about how competent, moral, worthy of love, and strong 
the self is compared to how incompetent, immoral, unworthy 
of love, and weak it is. 


Interaction of the Experiential and Rational Systems 


As previously noted, according to CEST, the experiential and 
rational systems operate in parallel and are interactive. 


The Influence of the Experiential System 
on the Rational System 


As the experiential system is the more rapidly reacting sys- 
tem, it is able to bias subsequent processing in the rational 
system. Because it operates automatically and precon- 
sciously, its influence normally occurs outside of awareness. 
As noted previously, this prompts people to search for an 
explanation in their conscious rational system, which often 
results in rationalization. Thus, even when people believe 
their thinking is completely rational, it is often biased by their 
experiential processing. 


The biases that influence conscious, rational thinking in 
everyday life are, for the most part, adaptive, as the experien- 
tial system operates according to schemas learned from past 
experience. In some situations, however, the experientially 
determined biases and their subsequent rationalizations are 
highly maladaptive. An extreme case is the life-long pursuit 
of “false goals.” Such goals are false in the sense that their 
achievement is followed by disappointment and sadness, 
rather than by the anticipated happiness, enhanced self- 
esteem, or security that was the reason for their pursuit. It is 
noteworthy that the achievement of a false goal is experien- 
tially disappointing although at the rational level, it is viewed 
as a significant achievement about which the individual is 
proud. The following passage from Tolstoi (1887), in which he 
describes his thoughts during a period of depression, provides 
a poignant example of such a reaction: 


When I thought of the fame which my works had gained me, I 
used to say to myself, ‘Well, what if I should be more famous 
than Gogol, Pushkin, Shakespeare, Moliere—than all the writers 
of the world—well, and what then? I could find no reply. Such 
questions demand an answer, and an immediate one; without one 
it is impossible to live, but answer there was none. 

My life had come to a sudden stop. I was able to breathe, to 
eat, to drink, to sleep. I could not, indeed, help doing so; but 
there was no real life in me. I had not a single wish to strive for 
the fulfillment of what I could feel to be reasonable. If I wished 
for something, I knew beforehand, that were I to satisfy the wish, 
nothing would come of it, I should still be dissatisfied. 

Such was the condition I had come to, at the time when all the 
circumstances of my life were preeminently happy ones, and 
when I had not yet reached my fiftieth year. I had a good, a lov- 
ing, and a well-beloved wife, good children, a fine estate, which, 
without much trouble on my part, continually increased my 
income; I was more than ever respected by my friends and 
acquaintances; I was praised by strangers, and could lay claim to 
having made my name famous... 

The mental state in which I then was seemed to me summed 
up into the following: my life was a foolish and wicked joke 
played on me by I knew not whom... 

Had I simply come to know that life has no meaning, I could 
have quietly accepted it as my allotted position. I could not, how- 
ever, remain thus unmoved. Had I been like a man in a wood, 
about which he knows that there is no issue, I could have lived 
on; but I was like a man lost in a wood, and, who, terrified by the 
thought, rushes about trying to find a way out, and though he 
knows each step can only lead him farther astray, can not help 
running backwards and forwards. 


Two features of Tolstoi’s situation are of particular interest. 
One is that he experiences deep despair after achieving his life 
goals. This suggests that his achievements, although viewed 


as successes in his rational system, failed to fulfill a basic need 
or needs in his experiential system. His success, therefore, can 
be said to be success at the rational level but failure at the ex- 
periential level. This raises the question of what the deeply 
frustrated need in his experiential system might be. In the ab- 
sence of additional information, it is, of course, impossible to 
know, and one can only speculate. One possibility within the 
framework of CEST is that the frustrated need was for uncon- 
ditional love in early childhood. Such a need, of course, can- 
not be satisfied by material rewards or accomplishments. 

The other interesting observation is that Tolstoi is dis- 
tressed not only because of his feelings of emptiness and 
meaninglessness, but that, try as he might, he cannot solve the 
problem of why he should be unhappy when all the conditions 
of his life suggest that he should be happy. It follows from 
CEST that the reason he cannot solve his problem, despite his 
considerable intelligence and motivation, is that he believes it 
exists in his rational system when in fact it exists in his expe- 
riential system. Moreover, assuming the speculation about 
frustration of unconditional love in childhood is true, its early, 
preverbal occurrence and its remoteness from the kinds of 
motives normally present in the rational systems of adults can 
help account for Tolstoi’s inability to articulate the source of 
his distress. 

The influence of the experiential system on the rational sys- 
tem can be positive as well as negative. As an associative sys- 
tem, the experiential system can be a source of creativity by 
suggesting ideas that would not otherwise be available to the 
linear-processing rational system. Because the experiential 
system is a learning system, it can be a source of useful infor- 
mation, which can be incorporated into the rational system. 
Most important is that the experiential system can provide a 
source of passion for the rational system that it would other- 
wise lack. The result is that intellectual pursuits can be pursued 
with heart, rather than as dispassionate intellectual exercises. 


The Influence of the Rational System 
on the Experiential System 


As the slower system, the rational system is in a position to 
correct the experiential system. It is common for people to re- 
flect on their spontaneous, impulsive thoughts, recognize they 
are inappropriate, and then substitute more constructive ones. 
For example, in a flash of anger an employee may have the 
thought that he would like to tell off his boss, but on further re- 
flection may decide this course of action would be most un- 
wise. To investigate this process, we conducted an experiment 
in which people were asked to list the first three thoughts that 
came to mind in response to reading a variety of provocative 
situations. The first thought was often counterproductive and 
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in the mode of the experiential system, whereas the third 
thought was usually corrective and in the mode of the rational 
system. 

The rational system can also influence the experiential 
system by providing the understanding that allows a person 
to train the experiential system so that its initial reactions are 
more appropriate. That is, by understanding the operating 
principles of the experiential system as well as its schemas, it 
is possible to determine how that system can be improved; 
this can be accomplished in a variety of ways, the most ob- 
vious of which is by disputing the maladaptive thoughts in 
the experiential system, a procedure widely utilized by cog- 
nitive therapists. As the experiential system learns directly 
from experience, another procedure is to provide real-life 
corrective experiences. A third procedure is to utilize im- 
agery, fantasy, and narratives for providing corrective expe- 
riences vicariously. 

The rational system can influence the experiential system 
in automatic, unintentional ways as well as by its intentional 
employment. As the experiential system operates in an asso- 
ciative manner, thoughts in the rational system can trigger as- 
sociations and thereby emotions in the experiential system. 
For example, a student attempting to solve a mathematics 
word problem may react to the content with conscious 
thoughts that produce associations in the experiential system; 
the associations then elicit emotional reactions that interfere 
with performance. In this illustration, we have an interesting 
cycle of the rational system’s influencing the experiential 
system, which in turn influences the rational system. 

Another unintentional way in which the rational system 
can influence the experiential system is through repetition of 
thoughts or behavior in the rational system. Through such rep- 
etition, thoughts and behavior that were originally under ratio- 
nal control can become habitualized or proceduralized, with 
the control shifting from the rational to the experiential system 
(Smith & DeCoster, 2000). An obvious advantage to this shift 
in control is that the thought and behavior require fewer cog- 
nitive resources and can occur without conscious awareness. 
Potential disadvantages are that the habitual thoughts and be- 
havior are under reduced volitional control and are more dif- 
ficult to change. Although this can be desirable for certain 
constructive thoughts and behaviors, it is problematic when 
the thoughts and behavior are counterproductive. 


The Lower and Higher Reaches 
of the Experiential System 


The experiential system operates at different levels of com- 
plexity. Classical conditioning is an example of the operation 
of the experiential system at its simplest level. In classical 
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conditioning, a conditioned, neutral stimulus (the CS), such as 
atone, precedes an unconditioned stimulus (the UCS), such as 
food. Over several trials, a connection is formed between the 
conditioned and unconditioned stimulus, so that the condi- 
tioned stimulus evokes a conditioned response (the CR), such 
as salivation, that originally occurred only to the UCS. This 
process illustrates the operation of several of the attributes of 
the experiential system, including associative processing, au- 
tomatic processing, increased strength of learning over trials, 
affective influence (e.g., emotional significance of the UCS), 
and arbitrary outcome-orientation (e.g., reacting to the CS 
independent of its causal relation to the UCS). The CS is also 
responded to holistically, as the animal reacts not only to the 
tone, but to the entire laboratory context. 

Amore complex operating level of the experiential system 
is exhibited in heuristic processing. In an article that has had a 
widespread influence on understanding decisional processes, 
Tversky and Kahneman (1974) introduced the concept of 
heuristics, which they defined as cognitive shortcuts that peo- 
ple use naturally in making decisions in conditions of uncer- 
tainty. They and other cognitive psychologists have found 
such processing to be a prevalent source of irrational reactions 
in a wide variety of situations. For example, people typically 
report that the protagonists in specially constructed vignettes 
would become more upset following arbitrary outcomes pre- 
ceded by acts of commission than by acts of omission, by near 
than by far misses, by free than by constrained behavior, and 
by unusual than by usual acts. As they respond as if the pro- 
tagonist’s behavior were responsible for the arbitrary out- 
comes, their thinking is heuristic in the sense that it is based 
on simple associative reasoning rather than on cause-and- 
effect analysis. 

A vast amount of research on heuristic processing (see 
review in Fiske & Taylor, 1991) has produced results that are 
highly consistent with the principles of experiential process- 
ing. Although the data-driven views on heuristic processing 
derived from social-cognitive research and the theory-driven 
views of CEST have much in common, the two approaches 
differ in three important respects. One is that CEST attributes 
heuristics to the normal mode of operation of an organized 
conceptual system, the experiential system, that is contrasted 
with an alternative organized conceptual system, the rational 
system. The second is that heuristic processing and the expe- 
riential system in CEST are embedded in a global theory of 
personality. The third is that heuristic processing, according 
to CEST, has withstood the test of time over millions of years 
of evolution, and is considered to be primarily adaptive. In 
contrast to these views, social cognitive psychologists, such 
as Kahneman and Tversky (1973) and Nisbett and Ross 
(1980), regard heuristics as individual “cognitive tools” that 


are employed within a single conceptual system that includes 
both associative (experiential) and analytical (rational) rea- 
soning. These theorists further regard heuristics as quirks in 
thinking that although sometimes advantageous are common 
sources of error in everyday life, and therefore are usually de- 
sirable to eliminate. It is of interest in this respect to note how 
resistant some of these blatantly nonrational ways of process- 
ing have been to elimination by training. From the perspec- 
tive of CEST, given the intrinsically compelling nature of 
experiential processing and its highly adaptive value in most 
situations in everyday life, such resilience is to be expected. 

Although the experiential system encodes events con- 
cretely and holistically, it is nevertheless able to generalize, 
integrate, and direct behavior in complex ways, some of 
which very likely involve a contribution by the rational sys- 
tem. It does this through prototypical, metaphorical, sym- 
bolic, and narrative representations in conjunction with the 
use of analogy and metaphor. Representations in the experi- 
ential system are also related and generalized through their 
associations with emotions. It is perhaps through processes 
such as these that the experiential system is able to make its 
contributions to empathy, creativity, the establishment of re- 
warding interpersonal relationships, and the appreciation of 
art and humor (Norris & Epstein, 2000b). 


PSYCHODYNAMICS 


Psychodynamics, as the term is used here, refers to the inter- 
actions of implicit motives and of implicit beliefs and their 
influence on conscious thought and behavior. The influence 
on conscious thought and behavior is assumed to be mediated 
primarily by vibes. Two major sources of vibes that are im- 
portant sources of maladaptive behavior are early-acquired 
beliefs and needs. 


The Influence of Early-Acquired Beliefs 
on Maladaptive Behavior 


As you will recall, according to CEST, the implicit beliefs in a 
person’s experiential system consist primarily of generaliza- 
tions from emotionally significant past experiences. These 
affect-laden implicit beliefs correspond to schemas about 
what the self and other people are like and how one should 
relate to them. Particularly important sources of such schemas 
are experiences with mother and father figures and with sib- 
lings. The schemas exist in varying degrees of generality. At 
the broadest level is the basic belief about what people in gen- 
eral are like, as previously discussed. At a more specific level 
are views about particular categories of people, such as 


authority figures, maternal figures, mentors, and peers. Such 
implicit beliefs, both broader and narrower ones, exert a strong 
influence on how people relate to others, particularly to those 
who provide cues that are reminders of the original general- 
ization figures. The influence of the schemas is mediated by 
the vibes automatically activated in cue-relevant situations. 

It is understandable why implicit beliefs that contribute to 
a person’s happiness and security are maintained. But why 
should implicit beliefs that appear to contribute only to 
misery also be maintained? Why do they not extinguish as a 
result of the negative affect following their retrieval? Ac- 
cording to the pleasure principle, they should, of course. 
They do not because of the influence of the need to maintain 
the stability of one’s conceptual system (Epstein & Morling, 
1995; Hixon & Swann, 1993; Morling & Epstein, 1997; 
Swann, 1990). Depending on circumstances, the need for 
stability can override the pleasure principle. But how exactly 
does this operate? What do people actually do that prevents 
their maladaptive beliefs acquired in an earlier period from 
being extinguished when they are exposed to corrective 
experiences in adulthood? 

There are three things people do or fail to do that serve to 
maintain their maladaptive implicit beliefs. First, they tend to 
perceive and interpret events in a manner that is consis- 
tent with their biasing beliefs. Biased perceptions and inter- 
pretations allow individuals to experience events as verifying 
a belief even when on an objective basis they should be dis- 
confirming it. For example, an offer to help or an expression 
of concern can be perceived as an attempt to control one, and 
an expression of love can be viewed as manipulative. Second, 
people often engage in self-verifying behavior, such as by 
provoking counterbehavior in others that provides objective 
confirmation of the initial beliefs. For example, a person who 
fears rejection in intimate relationships may behave with ag- 
gression or withdrawal whenever threatened by relationships 
advancing toward intimacy. This predictably provokes the 
other person to react with counteraggression or withdrawal, 
thereby providing objective evidence confirming the belief 
that people are rejecting. Third, people fail to recognize the 
influence of their implicit beliefs and associated vibes on their 
behavior and conscious thoughts, which prevents them from 
identifying and correcting their biased interpretations and 
self-verifying behavior. As a result, they attribute the conse- 
quences of their maladaptive behavior to unfavorable circum- 
stances or, more likely, to the behavior of others. In the event 
that after repeated failed relationships, they should consider 
the possibility that their own behavior may play a role, they 
are at a loss to understand in what way this could be true, as 
they can cite objective evidence to support their biased views. 
You will recall that an important maxim in CEST is that a 
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failure to recognize the operation of one’s experiential system 
means that one will be controlled by it. 

There is an obvious similarity between the psychoanalytic 
concept of transference and the view in CEST that people’s 
relationships are strongly influenced by generalizations from 
early childhood experiences with significant others. Psycho- 
analysts have long emphasized the importance of transfer- 
ence relations in psychotherapy. They have observed that 
their patients, after a period in therapy, react to the analyst as 
if the analyst were a mother or a father figure. They encour- 
age the development of such transference reactions with the 
aim of providing a corrective emotional experience. Through 
the use of this procedure as well as by interpreting the trans- 
ference, the analyst hopes to eliminate the tendency of the pa- 
tient to establish similar relationships with others. Although 
this procedure is understandable from the perspective of 
CEST, it is fraught with danger, as the patient may become 
overly dependent on the therapist and the therapist, despite 
the best of intentions, may provide a destructive rather than a 
corrective experience. Moreover, working through a transfer- 
ence relationship—even when successful—may not be the 
most efficient way of treating inappropriate generalizations. 
Nevertheless, for present purposes, it illustrates how general- 
izations from early childhood tend to be reproduced in later 
relationships, including those with therapists, and how 
appropriate emotional experiences can correct maladaptive 
generalizations. 

Although there are obvious similarities between the con- 
cepts of transference in psychoanalysis and of generalization 
in CEST, there are also important differences. Generalization 
is a far broader concept, which, unlike transference, is not re- 
stricted to the influence of relationships with parents. Rather, 
it refers to the influence of all significant childhood relation- 
ships, including in particular those with siblings as well as 
with parents. Schemas derived from childhood experiences 
are emphasized in CEST because later experiences are as- 
similated by earlier schemas. Also, generalizations acquired 
from childhood experiences are likely to be poorly articu- 
lated (if articulated at all) in the rational system. Their influ- 
ence, therefore, is likely to continue to be unrecognized into 
adulthood. 


The Influence of Early-Acquired Motives 
on Maladaptive Behavior 


Much of what has been said about implicit beliefs in the ex- 
periential system can also be applied to implicit needs. Like 
implicit beliefs, implicit needs or motives are acquired from 
emotionally significant experiences. They are also main- 
tained for similar reasons. As previously noted, when people 
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experience a positive or negative event, they automatically 
acquire a behavioral tendency or motive to reproduce the ex- 
perience if it was favorable and to avoid experiencing it if it 
was unfavorable. The stronger the emotional response and 
the more often it occurs in the same or similar situations, the 
greater the strength of the motive. Although this learning pro- 
cedure is adaptive most of the time, it is maladaptive when 
past conditions are unrepresentative of present ones. One 
such condition is when a child has experiences involving the 
deep thwarting of one or more basic needs. For example, if 
the need to maintain self-esteem is deeply frustrated in child- 
hood, the child will acquire a sensitivity to threats to self- 
esteem and a corresponding compulsion to protect himself or 
herself from such threats in the future. Sensitivities, in CEST, 
refer to areas of particular vulnerability, and compulsions 
refer to rigid, driven behavioral tendencies with the aim of 
protecting oneself from sensitivities. Such sensitivities and 
compulsions are considered in CEST to be major sources of 
maladaptive behavior. 

The following case history illustrates the operation of a 
sensitivity and compulsion. In this and other case histories, 
names, places, and details are altered to protect the anonymity 
of the protagonists. Ralph was the oldest child in a family that 
included three other children. He was extremely bright and far 
outshone his siblings in academic performance. However, 
rather than being appreciated for it, he was resented by both 
his parents and siblings. When he eagerly showed his mother 
the excellent grades on his report card, she would politely tell 
him that she was busy at the moment and would like to look at 
it later, when she had more time. Not infrequently, she would 
forget to do so. It gradually became evident to Ralph that she 
was more upset than pleased with his accomplishments, so he 
stopped informing her about them. 

The mother’s behavior can be understood in terms of her 
own background. She had been deeply resentful, as a child, 
when her mother expressed admiration for the accomplish- 
ments of her brighter sibling and ignored her own accomplish- 
ments. Thus, her automatic reaction to cues that reminded her 
of such experiences was to have unpleasant vibes accompa- 
nied by resentful thoughts. Consequently, although she meant 
to be a good mother to Ralph, her experiential reactions un- 
dermined her conscious intent. Being unaware of her underly- 
ing experiential reactions, she could not help but react as she 
did. Moreover, over time she found objective reasons for con- 
sidering him as her least favored child. Little did she realize 
that his resentful and reticent attitude toward her and others 
were reactions to her own behavior toward him. She simply re- 
garded him as a stubborn, difficult child by nature. 

As a result of his experience in the family, Ralph devel- 
oped feelings of being unlovable and unworthy and felt 


depressed much of the time. As an adult, he devoted his en- 
ergy to bolstering his self-esteem by working extremely hard 
at becoming a successful businessman. He succeeded at this 
to aremarkable extent, becoming wealthy at an early age. Yet 
despite his success and accumulation of material things that 
other people admired, happiness eluded him. He continued to 
feel unlovable and depressed no matter what his possessions 
were and no matter that he had a wife and children who tried 
hard to please him. When his wife praised the children for 
their accomplishments, he became resentful toward her and 
the children. He spent less and less time with his family and 
increasingly immersed himself in his business. He also began 
to accuse his wife and children of not loving him and said that 
was the reason he was spending so little time with them. In 
his eyes, he was the victim of rejection, not its perpetrator. 
The result was that he increasingly alienated his family, 
which verified for him that they did not love him. He became 
convinced that his wife would ask him for a divorce, and 
rather than be openly rejected by her, he asked her for a di- 
vorce first. He was sure she would be pleased to oblige, and 
he was extremely relieved when she protested that she did not 
want a divorce. She said that she wanted more than anything 
else for them to work together to improve their relationship. 
This gave a great boost to Ralph, and he tried to the best of 
his ability to be a more attentive husband and father. This was 
no easy task for him, particularly as he had no insight into the 
role his own behavior played in his distressing relationships 
with his family. It remains to be seen if he will succeed. From 
the perspective of CEST, it is doubtful that he will unless he 
gains insight into the influence of his experiential system. 

This case illustrates the development, operation, and con- 
sequences of a sensitivity and compulsion. Of further interest 
is that it illustrates the transference of sensitivities and com- 
pulsions across generations. The mother’s sensitivity was to 
being outshone intellectually, and her compulsion was to get 
back in some way or other at whomever activated the sensi- 
tivity. In this case it was her own son, who provided cues 
reminiscent of her childhood experiences with her brighter 
sibling. Lest you blame the mother, consider that her reac- 
tions occurred automatically, outside of her awareness, and 
that she was no less a victim than was Ralph. 

Ralph had three related sensitivities: threat to his self- 
esteem, lack of appreciation for his accomplishments, and re- 
jection by a loved one. His compulsive reaction in response 
to the first sensitivity was to attempt to increase his self- 
esteem by becoming an outstanding success in business and 
thereby gaining the admiration of others. His compulsive re- 
action to the second sensitivity was again to gain the admira- 
tion of others for his success and material possessions. His 
compulsive reaction to the third sensitivity was to withdraw 


from and reject the members of his family before they re- 
jected him. Not surprisingly, his compulsive reactions inter- 
fered with rather than facilitated gaining the love he so 
desperately desired. 


RESEARCH SUPPORT FOR THE CONSTRUCT 
VALIDITY OF CEST 


Research generated by a variety of dual-process theories other 
than CEST has produced many findings consistent with the 
assumptions in CEST (see review in Epstein, 1994, and arti- 
cles in Chaiken & Trope, 1999). As a review of this extensive 
literature is beyond the scope of this chapter, here I confine the 
discussion to studies my associates and I specifically designed 
to test assumptions in CEST. Three kinds of research are 
reviewed: research on the operating principles of the experi- 
ential system, research on the interactions within and between 
the two systems, and research on individual differences in the 
extent and efficacy in the use of the two systems. 


Research on the Operating Principles 
of the Experiential System 


For some time, my associates and I have been engaged in a 
research program for testing the operating principles of the 
experiential system. One of our approaches consisted of 
adapting procedures used by Tversky and Kahneman and 
other cognitive and social-cognitive psychologists to study 
heuristic, nonanalytical thinking through the use of specially 
constructed vignettes (for examples of this research by oth- 
ers, see Fiske & Taylor, 1991; Tversky & Kahneman, 1974, 
1983; Kahneman, Slovic, & Tversky, 1982). 


Irrational Reactions to Unfavorable Arbitrary Outcomes 


People in everyday life often react to arbitrary, unintended 
outcomes as if they were intentionally and causally deter- 
mined. Thus, they view more favorably the proverbial bearer 
of good than of evil tidings despite knowing full well that the 
messenger is not responsible for the message. Such behavior 
is an example of outcome-oriented processing. It is the typi- 
cal way the experiential system reacts to events—by associ- 
ating outcomes with the stimuli that precede the outcomes, as 
in classical conditioning. 

As an example of the kinds of vignettes we used, one of 
them described a situation in which two people, as the result of 
unanticipated heavy traffic, arrive at an airport 30 minutes 
after the scheduled departure of their flights. One learns that 
her flight left on time, and the other learns that her flight just 
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left. Tversky and Kahneman (1983) found that people typi- 
cally reported that the one who barely missed her flight would 
be more upset than the other protagonist would be, although 
from a rational perspective it should not matter at all as both 
were equally inconvenienced and neither was responsible for 
the outcome. We modified Tversky and Kahneman’s experi- 
ment by having the participants respond from three perspec- 
tives: how they believed most people would react; how they 
themselves would react based on how they have reacted to 
similar situations in the past, and how a completely logical 
person would react (Epstein, Lipson, Holstein, & Huh, 1992). 
The first two perspectives were considered to be mainly under 
the jurisdiction of the experiential system and the third to be 
mainly under the jurisdiction of the rational system. In order to 
control for and examine the influence of each of the perspec- 
tives on the effect of subsequent perspectives, we counterbal- 
anced the order of presentation of the perspectives. 

The findings supported the following hypotheses: There 
are two different modes of information processing, experi- 
ential and rational; the experiential system is an associative 
system that automatically relates outcomes to preceding situ- 
ations and behavior, treating them as if they are causally 
related, even when the relation is completely arbitrary; the 
rational system is an analytical system that judges cause-and- 
effect relations according to logical rules; and the systems are 
interactive, with each influencing the other. Support for the 
last hypothesis is of particular interest, as it supports the im- 
portant assumption in CEST that the prevalence of irrational 
thinking in humans can be attributed largely to the influence 
of their automatic, preconscious experiential processing on 
their conscious analytical thinking. 

In research on arbitrary outcomes in which we varied the 
affective consequences of the outcomes, the results supported 
the assumption in CEST that the degree of experiential relative 
to rational influence varies directly with the intensity of the 
affect that is implicated (Epstein et al., 1992). What we found 
is that the greater the emotional intensity of the outcomes, 
the more the responses reflected experiential (vs. rational) 
processing. 


The Ratio-Bias Phenomenon 


Imagine that you are told that on every trial in which you 
blindly draw a red jellybean from a bowl containing red and 
white jellybeans, you will receive two dollars. To make mat- 
ters more interesting, you are given a choice between drawing 
from either of two bowls that offer the same 10% odds of 
drawing a winning bean. One contains one red jellybean and 
nine white ones; the other contains 10 red jellybeans and 90 
white ones. Which bowl would you choose to draw from, and 
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how much would you pay for the privilege of drawing 
from the bowl of your choice, rather than having the choice 
decided by the toss of a coin? When people are simply 
asked how they would behave, almost all say they would 
have no preference and would not pay a cent for a choice 
between two equal probabilities. Yet when they are placed 
in a real situation, most willingly pay small sums of money 
for the privilege of drawing from the bowl with more red 
jellybeans (Kirkpatrick & Epstein, 1992). This difference 
in response to the verbally presented and the real situation can 
be explained by the greater influence of the experiential than 
the rational system in real situations with emotionally signif- 
icant consequences compared to simulated situations with- 
out consequences. According to CEST, the experiential 
system is particularly reactive to real experience, whereas 
the rational system is uniquely responsive to abstract, verbal 
representations. 

This jellybean experimental situation, otherwise referred 
to as the ratio-bias experimental paradigm, is particularly in- 
teresting with respect to CEST because it pits experiential 
against rational processing. The conflict between the two 
modes of processing arises because the experiential system is 
a concrete system that is less responsive to abstractions such 
as ratios than to the numerousness of objects. Comprehension 
of numerousness, unlike comprehension of ratios, is an ex- 
tremely fundamental ability that is within the capacity of 
3-year-old children and nonhuman animals (Gallistel & 
Gelman, 1992). 

Even more impressive than the irrational behavior exhib- 
ited by people paying for the privilege of choosing between 
bowls that offer equal probabilities are the results obtained 
when unequal probabilities are offered by the bowls. If our 
reasoning is correct, a conflict between the two systems can 
be established by having one bowl probability-advantaged 
and the other numerousness-advantaged. In one study, the 
probability-advantaged bowl always contained | in 10 red 
jellybeans, whereas the numerousness-advantaged bowl of- 
fered between 5 and 9 red jellybeans out of 100 jellybeans, 
depending on the trial (Denes-Raj & Epstein, 1994). Under 
these circumstances, many adults made nonoptimal re- 
sponses by selecting the numerousness-advantaged bowl 
against the better judgment of their rational thinking. For ex- 
ample, they often chose to draw from the bowl that contained 
8 of 100 (8%) in preference to the one that contained 1 of 10 
(10%) red jellybeans. Some sheepishly commented that they 
knew it was foolish to go against the probabilities, but some- 
how they felt they had a better chance of drawing a red jelly- 
bean when there were more of them. Of additional interest, 
participants made nonoptimal responses only to a limited 
degree, thereby suggesting a compromise between the two 


systems. Thus, although many selected a numerousness- 
advantaged 8% option (8 of 100 red jellybeans) over a 10% 
probability-advantaged one (1 of 10 red jellybeans), almost 
no one selected a 5% numerousness-advantaged option (5 
of 100 red jellybeans) over a 10% probability-advantaged 
option (1 of 10 red jellybeans). Apparently, most people pre- 
ferred to behave according to their experiential processing 
only up to a point of violating their rational understanding. To 
be sure, there were participants who always responded ratio- 
nally. What was impressive about the study, however, was the 
greater number who responded irrationally despite knowing 
better (in their rational systems). 

To determine whether children who have not had formal 
training in ratios have an intuitive understanding of ratios, we 
conducted a series of studies in which we examined chil- 
dren’s responses to the ratio-bias experimental paradigm 
(Yanko & Epstein, 2000). We were also interested in these 
studies in determining whether children who have only an in- 
tuitive understanding of ratios exhibit compromises between 
the two systems. We found that children without formal 
knowledge of ratios had only a rudimentary comprehension 
of ratios. They responded appropriately to differences be- 
tween ratios only when the magnitude of the differences 
was large. Like adults, children exhibited compromises, but 
their compromises were more in the experiential direction. 
For example, many children but no adults selected a 5% 
numerousness-advantaged bowl over a 10% probability- 
advantaged one. However, very few of the same children 
selected a 2% numerousness-advantaged bowl over a 10% 
probability-advantaged one. 

We also used the ratio-bias experimental paradigm to test 
the assumption in CEST that the experiential system re- 
sponds to visual imagery in a way similar to the way it does 
to real experience (Epstein & Pacini, 2001). We presented 
participants in an experimental group with a verbal descrip- 
tion of the ratio-bias experimental paradigm after training 
them to vividly visualize the situation. Participants in the 
control group were given only the verbal description. In sup- 
port of the assumption, the visual-imaging group but not the 
control group exhibited the ratio-bias phenomenon in a man- 
ner similar to what we have repeatedly found in real situa- 
tions but not in simulated situations. 

The overall results from the many studies we conducted 
with the ratio-bias paradigm (Denes-Raj & Epstein, 1994; 
Denes-Raj, Epstein, & Cole, 1995; Kirkpatrick & Epstein, 
1992; Pacini & Epstein, 1999a, 1999b; Yanko & Epstein, 
2000) provided support for the following assumptions and 
hypotheses derived from CEST. There are two independent 
information-processing systems. Sometimes they conflict 
with each other, but more often they form compromises. With 


increasing maturation from childhood to adulthood, the bal- 
ance of influence between the two processing systems shifts 
in the direction of increased rational dominance. The experi- 
ential system is more responsive than is the rational system to 
imagery and to other concrete representations than the ratio- 
nal system, whereas the rational system is more responsive 
than is the experiential system to abstract representations. 
Engaging the rational system in children who do not have 
formal knowledge of ratios by asking them to give the rea- 
sons for their responses interferes with the application of 
their intuitive understanding of ratios, resulting in a deterio- 
ration of performance. 

We have also used the ratio-bias phenomenon to elucidate 
the thinking of people with emotional disorders. In a study of 
depressed college students (Pacini, Muir, & Epstein, 1998), 
the ratio-bias phenomenon helped to clarify the paradoxical 
depressive-realism phenomenon (Alloy & Abramson, 1988). 
The phenomenon refers to the finding that depressed partici- 
pants are more rather than less accurate than are nondepressed 
participants in judging contingencies between events. We 
found that the depressed participants made more optimal re- 
sponses than did their nondepressed counterparts only when 
the stakes for nonoptimal responding were inconsequential. 
When we raised the stakes, the depressed participants re- 
sponded more experientially and the control participants re- 
sponded more rationally, so that the groups converged and no 
longer differed. We concluded that the depressive-realism 
phenomenon can be attributed to an overcompensatory reac- 
tion by subclinically depressed participants in trivial situa- 
tions to a more basic tendency to behave unrealistically in 
emotionally significant situations. We further concluded that 
normal individuals tend to rely on their less demanding expe- 
riential processing when incentives are low, but increasingly 
engage their more demanding rational processing as incen- 
tives are increased. 


The Global-Evaluation Heuristic 


The global-person-evaluation heuristic refers to the ten- 
dency of people to evaluate others holistically as either good 
or bad people rather than to restrict their judgments to 
specific behaviors or attributes. Because the global-person- 
evaluation heuristic is consistent with the assumption that 
holistic evaluation is a fundamental operating principle of the 
experiential system (see Table 7.1), it follows that global- 
person-evaluations tend to be highly compelling and not eas- 
ily changed. The heuristic is particularly important because 
of its prevalence and because of the problems that arise from 
it—such as when jurors are influenced by the attractiveness 
of a defendant’s appearance or personality in judging his or 
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her guilt. An interesting example of this phenomenon was 
provided in the hearing of Clarence Thomas for appointment 
to the United States Supreme Court. The testimony by Anita 
Hill about the obscene sexual advances she alleged he made 
to her was discredited in the eyes of several senators because 
of the favorable testimony by employees and acquaintances 
about his character and behavior. It seemed inconceivable to 
the senators that an otherwise good person could be sexually 
abusive. 

We studied the global-person-evaluation heuristic (re- 
ported in Epstein, 1994) by having participants respond to a 
vignette adapted from a study by Miller and Gunasegaram 
(1990). In the vignette, a rich benefactor tells three friends 
that if each throws a coin that comes up heads, he will give 
each $100. The first two throw a heads, but Smith, the third, 
throws a tails. When asked to rate how each of the protago- 
nists feels, most participants indicated that Smith would feel 
guilty and the others would feel angry with him. In an alter- 
native version with reduced stakes, the ratings of guilt and 
anger were correspondingly reduced. When asked if the other 
two would be willing, as they previously had intended, to in- 
vite Smith to join them on a gambling vacation in Las Vegas, 
where they would share wins and losses, most partici- 
pants said they would not “because he is a loser.” These re- 
sponses were made both from the perspective of how the 
participants reported they themselves would react in a real 
situation and how they believed most people would react. 
When responding from the perspective of how a completely 
logical person would react, most participants said a logical 
person would recognize that the outcome of the coin tosses 
was arbitrary, and they therefore would not hold it against 
Smith. They further indicated that a logical person would 
invite him on the gambling venture. 

This study indicates that people tend to judge others holis- 
tically by outcomes, even arbitrary ones. It further indicates 
that people intuitively recognize that there are two systems of 
information processing that operate in a manner consistent 
with the principles of the experiential and rational systems. It 
also supports the hypotheses that experiential processing be- 
comes increasingly dominant with an increase in emotional 
involvement and that people overgeneralize broadly in judg- 
ing others on the basis of outcomes over which the person 
has no control, even though they know better in their rational 
system. 


Conjunction Problems 


The Linda conjunction problem is probably the most 
researched vignette in the history of psychology. It has evoked 
a great deal of interest among psychologists because of its 
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paradoxical results. More specifically, although the solution 
to the Linda problem requires the application of one of the 
simplest and most fundamental principles of probability 
theory, almost everyone—including people sophisticated in 
statistics —gets it wrong. How is this to be explained? As you 
might suspect by now, the explanation lies in the operating 
principles of the experiential system. 

Linda is described as a 31-year-old woman who is single, 
outspoken, and very bright. In college she was a philosophy 
major who participated in antinuclear demonstrations and 
was concerned with issues of social justice. How would you 
rank the following three possibilities: Linda is a feminist, 
Linda is a bank teller, and Linda is a feminist and a bank 
teller? If you responded like most people, you ranked Linda 
as being a feminist and a bank teller ahead of Linda’s being 
just a bank teller. In doing so, you made what Tversky & 
Kahneman (1982) refer to as a conjunction fallacy, and which 
we refer to as a conjunction error (CE). It is an error or fallacy 
because according to the conjunction rule, the occurrence of 
two events cannot be more likely than the occurrence of only 
one of them. 

The usual explanation of the high rate of CEs that people 
make is that they either do not know the conjunction rule or 
they do not think of it in the context of the Linda vignette. 
They respond instead, according to Tversky and Kahneman, 
by the representativeness heuristic, according to which being 
both a bank teller and a feminist is more representative of 
Linda’s personality than being just a bank teller. 

In a series of studies on conjunction problems, including 
the Linda problem (Donovan & Epstein, 1997; Epstein, 
Denes-Raj, & Pacini, 1995; Epstein & Donovan, 1995; 
Epstein, Donovan, & Denes-Raj, 1999; Epstein & Pacini, 
1995), we concluded that the major reason for the difficulty of 
the Linda problem is not an absence of knowledge of the con- 
junction rule or a failure to think of it. We demonstrated that al- 
most all people have intuitive knowledge of the conjunction 
rule, as they apply it correctly in natural contexts, such as in 
problems about lotteries. Nearly all of our participants, 
whether or not they had formal knowledge of the conjunction 
rule, reported that winning two lotteries, one with a very low 
probability of winning and the other with a higher probability, 
is less likely than is winning either one of them (Epstein et al., 
1995). This finding is particularly interesting from the 
perspective of CEST because it indicates that the experiential 
system (which knows the conjunction rule intuitively) 
is sometimes smarter than the rational system (which may not 
be able to articulate the rule). We also found that when we pre- 
sented the conjunction rule among other alternatives, thereby 
circumventing the problem of whether people think of it in the 
context of the Linda problem, most people selected the wrong 


rule. They made the rule fit their responses to the Linda prob- 
lem rather than the reverse, thereby demonstrating the com- 
pelling nature of experiential processing and its ability to 
dominate analytical thinking in certain situations. 

The conclusions from our series of studies with the Linda 
problem can be summarized as follows: 


e The difficulty of the Linda problem cannot be fully ac- 
counted for by the misleading manner in which it is pre- 
sented, for even with full disclosure about the nature of the 
problem and the request to treat it purely as a probability 
problem, a substantial number of participants makes CEs. 
Apparently, people tend to view the Linda problem as a per- 
sonality problem rather than as a probability problem, no 
matter what they are told. 


e The difficulty of the Linda problem can be explained by the 
rules of operation of the experiential system, which is the 
mode employed by most people when responding to it. 
Thus, people tend to reason associatively, concretely, holis- 
tically, and in a narrative manner rather than abstractly and 
analytically when responding to the problem. For example, 
a number of participants explained their responses that vio- 
lated the conjunction rule by stating that Linda is more 
likely to be a bank teller and a feminist than just a feminist 
because she has to make a living. 


e The essence of the difficulty of the Linda problem is that it 
involves an unnatural, concrete presentation, where an un- 
natural presentation is defined as one that differs from the 
context in which a problem is normally presented. We 
found that concrete presentations facilitate performance in 
natural situations (in which the two processing systems 
operate in synchrony) and interfere with performance in 
unnatural situations (in which the two systems operate in 
Opposition to each other). 

e Processing in the experiential mode is intrinsically highly 
compelling and can override processing in the rational 
mode even when the latter requires no more effort. Thus, 
many people, despite knowing and thinking of the conjunc- 
tion rule, nevertheless prefer a representativeness solution. 


e Priming intuitive knowledge in the experiential system can 
facilitate the solution to problems that people are unable 
initially to solve intellectually. 


Interaction Between the Two Processing Systems 


An important assumption in CEST is that the two systems 
are interactive. Interaction occurs simultaneously as well as 
sequentially. Simultaneous interaction was demonstrated in 
the compromises between the two systems observed in the 


studies of the ratio-bias phenomenon. Sequential interaction 
was demonstrated in the study in which people listed their first 
three thoughts and in the studies of conjunction problems, in 
which presenting concrete, natural problems before abstract 
problems facilitated the solution of the abstract problems. 

There is also considerable evidence that priming the expe- 
riential system subliminally can influence subsequent re- 
sponses in the rational system (see review in Bargh, 1989). 
Other evidence indicates that the form independent of the 
content of processing in the rational system can be influenced 
by priming the experiential system. When processing in 
the experiential mode is followed by attempts to respond 
rationally, the rational mode itself may be compromised by 
intrusions of experiential reasoning principles (Chaiken & 
Maheswaren, 1994; Denes-Raj, Epstein, & Cole, 1995; 
Edwards, 1990; Epstein et al., 1992). 

Sequential influence does not occur only in the direction 
of the experiential system influencing the rational system. As 
previously noted, in everyday life sequential processing 
often proceeds in the opposite direction, as when people react 
to their irrational, automatic thoughts with corrective, ratio- 
nal thoughts. In a study designed to examine this process, we 
instructed participants to list the first three thoughts that 
came to mind after imagining themselves in various situa- 
tions described in vignettes (reported in Epstein, 1994). The 
first response was usually a maladaptive thought consistent 
with the associative principle of the experiential system, 
whereas the third response was usually a more carefully rea- 
soned thought in the mode of the rational system. As an ex- 
ample, consider the responses to the following vignette, 
which describes a protagonist who fails to win a lottery be- 
cause she took the advice of a friend rather than follow her 
own inclination to buy a ticket that had her lucky number on 
it. Among the most common first thoughts were that the 
friend was to blame and that the participant would never take 
her advice again. By the third thought, however, the partici- 
pants were likely to state that the outcome was due to chance 
and no one was to blame. 


Interaction Between the Basic Needs 


You will recall that a basic assumption in CEST is that behav- 
ior often represents a compromise among multiple basic 
needs. This process is considered to be particularly important, 
as it provides a means by which the basic needs serve as checks 
and balances against each other, with each need constrained by 
the influence of the other needs. To test the assumption about 
compromises, we examined the combined influence of the 
needs for self-enhancement and self-verification. Swann and 
his associates had previously demonstrated that the needs for 
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enhancement and verification operate sequentially, with the 
former tending to precede the latter (e.g., Swann, 1990; 
Hixon & Swann, 1993). We wished to demonstrate that they 
also operate simultaneously, as manifested by compromises 
between them. Our procedure consisted of varying the favor- 
ableness of evaluative feedback and observing whether partic- 
ipants had a preference for feedback that matched or was more 
favorable to various degrees than their self-assessments 
(Epstein & Morling, 1995; Morling & Epstein, 1997). In 
support of our hypotheses, participants preferred feedback 
that was only slightly more favorable than their own self- 
assessments, consistent with a compromise between the need 
for verification and the need for self-enhancement. 


Research on Individual Differences 


Individual Differences in the Intelligence 
of the Experiential System 


If there are two different systems for adapting to the environ- 
ment, then it is reasonable to suspect that there are individual 
differences in the efficacy with which people employ each. It 
is therefore assumed in CEST that each system has its own 
form of intelligence. The question remains as to how to mea- 
sure each. The intelligence of the rational system can be mea- 
sured by intelligence tests, which are fairly good predictors of 
academic performance. To a somewhat lesser extent, they 
also predict performance in a wide variety of activities in the 
real world, including performance in the workplace, particu- 
larly in situations that require complex operations (see re- 
views in Gordon, 1997; Gottfredson, 1997; Hunter, 1983, 
1986; Hunter & Hunter, 1984). However, intelligence tests do 
not measure other kinds of abilities that are equally important 
for success in living, including motivation, practical intelli- 
gence, ego strength, appropriate emotions, social facility, and 
creativity. 

Until recently, there was no measure of the intelligence of 
the experiential system; one reason for this is that the concept 
of an experiential system was unknown. Having established 
its theoretical viability, the next step was to construct a way of 
measuring it, which resulted in the Constructive Thinking In- 
ventory (CTI; Epstein, 2001). The measurement of experien- 
tial intelligence is based on the assumption that experiential 
intelligence is revealed by the adaptiveness of the thoughts 
that tend to spontaneously occur in different situations or 
conditions. 

People respond to the CTI by reporting on a 5-point scale 
the degree to which they have certain common adaptive and 
maladaptive automatic or spontaneous thoughts. An example 
of an item is / spend a lot of time thinking about my mistakes, 
even if there is nothing I can do about them (reverse scored). 
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The CTI provides a Global Constructive Thinking scale and 
six main scales, most of which have several facets, or sub- 
scales. The six main scales are Emotional Coping, Behavioral 
Coping, Categorical Thinking, Esoteric Thinking, Naive Op- 
timism, and Personal Superstitious Thinking. The main scales 
all have high internal-consistency reliability coefficients and 
evidence for their validity in numerous studies. They are pre- 
dictive of a wide variety of criteria related to success in living. 
A review of the extensive literature supporting the construct 
validity of the CTI is beyond the scope of this chapter, but is 
available elsewhere (Epstein, 2001). For present purposes, it 
will suffice to note that favorable CTI scores have been found 
to be significantly associated with performance in the work- 
place and in the classroom, social competence, leadership 
ability, ability to cope with stress, emotional adjustment, 
physical well-being, and an absence of drug and alcohol 
abuse. 

The relation of constructive thinking to intellectual intelli- 
gence is of considerable interest for theoretical as well as 
practical reasons. According to CEST, the experiential and ra- 
tional systems operate independently, each by its own set of 
principles (see Table 7.1). One would therefore expect the in- 
telligence or efficacy of the two processing systems to be in- 
dependent. This is exactly what we have repeatedly found in 
several studies that have compared scores on the Global CTI 
scale with measures of intellective intelligence (Epstein, 
2001). Of additional interest, constructive thinking and intel- 
lectual intelligence were found to exhibit opposite courses of 
development across the life span. Constructive thinking is at 
its nadir in adolescence, when intellectual intelligence is at its 
peak, and it gradually increases throughout most of the adult 
years when intellectual intelligence is gradually declining. 
Unlike intellectual intelligence, constructive thinking is only 
negligibly related to academic achievement tests. Yet it adds 
significant variance in addition to the contribution of intellec- 
tual intelligence to the prediction of performance in the class- 
room, asindicated by grades received and class rank (Epstein, 
2001). Apparently, good constructive thinkers are able to 
capitalize on their knowledge and obtain appropriate recogni- 
tion for their achievements, whereas poor constructive 
thinkers are more likely to engage in counterproductive be- 
havior such as antagonizing their teachers, resulting in their 
being downgraded. 


Individual Differences in Rational and Experiential 
Thinking Styles 


If people process information by two different systems, the 
extent to which they employ each should be an important 
personality variable. To investigate this aspect of personality, 
we constructed a self-report test, the Rational-Experiential 


Inventory (REI). The REI has main scales of rational and ex- 
periential processing. Each of the main scales has subscales 
of self-assessed effectiveness and of frequency in use of the 
thinking style. 

The REI scales have internal-consistency reliabilities of 
.87-.90 for the main scales and .79-.84 for the subscales. 
There is considerable evidence in support of their construct 
validity. The major findings from several studies (Epstein 
et al., 1996; Norris & Epstein, 2000a, 2000b; Pacini & 
Epstein, 1999b; Pacini, Muir, & Epstein, 1998; Rosenthal & 
Epstein, 2000) can be summarized as follows: 


e In support of the assumption in CEST of independent ra- 
tional and experiential processing systems, the two main 
scales are independent. 


e In support of the inclusion of the subscales, they exhibit 
factorial, discriminant, and convergent validity. 


e The rational and experiential scales are coherently associ- 
ated with objective measures of heuristic processing. As 
expected, the relation of the rational scale with heuristic 
processing is inverse, and the relation of the experiential 
scale with heuristic processing is direct. 


e Although the rational and experiential main scales are 
uniquely associated with some variables, they make inde- 
pendent, supplementary contributions to the prediction of 
other variables. The rational scale is more strongly posi- 
tively associated than is the experiential scale with intel- 
lectual performance, as measured by SAT scores and 
grade point average, and with adjustment, including mea- 
sures of ego strength and self-esteem, and with measures 
of openness, conscientiousness, favorable beliefs about 
the self and the world, and physical well-being. The ratio- 
nal scale is more strongly negatively associated than the 
experiential scale with measures of neuroticism, depres- 
sion, anxiety, stress in college life, subtle racism, extreme 
conservatism, alcohol abuse, and naive optimism. The ex- 
periential scale is more strongly positively associated than 
the rational scale with measures of extroversion, agree- 
ableness, favorable interpersonal relationships, empathy, 
creativity, emotionality, sense of humor, and art apprecia- 
tion, and it is more strongly negatively associated than the 
rational system with distrust and intolerance. 


When introducing a new measure, it is important to 
demonstrate that the measure provides information that is not 
readily available from existing instruments. In order to deter- 
mine whether the REI is redundant with more standard per- 
sonality measures, we conducted a study (Pacini & Epstein, 
1999b) in which we compared the REI to the NEO Five- 
Factor Inventory (NEO-FFI; Costa & McCrae, 1989), the 
most popular measure of the Big Five personality traits. 


The two inventories contributed independent, supplementary 
variance to the prediction of many of the same variables and 
unique variance to the prediction of other variables. More- 
over, when the five NEO-FFI scales were entered into a re- 
gression equation as predictors of the REI scales, they 
accounted for only 37% of the variance of the Rationality 
scale and 11% of the variance of the Experientiality scale. 
This is of interest not only because it demonstrates that the 
REI is mainly independent of the NEO-FFI, but also because 
of the information it provides about the NEO-FFI. It suggests 
that the NEO-FFI mainly measures attributes associated with 
the rational system and is relatively deficient in measuring at- 
titudes and behavior associated with preconscious, automatic 
information processing. 

Consistent with gender stereotypes, women report signifi- 
cantly greater appreciation of and engagement in experiential 
processing than men, and men report greater appreciation of 
and engagement in rational processing. However, the mean 
gender differences are small, and there is a great deal of over- 
lap between the groups. 

Given fundamentally different ways of processing infor- 
mation, it might reasonably be expected that people with dif- 
ferent thinking styles would be receptive to different kinds of 
messages. To test this hypothesis, Rosenthal and Epstein 
(2000) conducted a study with the REI in which they com- 
pared the reactions of women with high scores on rationality 
and low scores on experientiality with women with the oppo- 
site pattern. The groups were subdivided according to 
whether they received messages on the danger of breast can- 
cer and the importance of self-examination in the form of 
information designed to appeal to the rational or the experi- 
ential mode of information processing. The rational mes- 
sage emphasized actuarial and other objective information, 
whereas the experiential message included personal appeals 
and vivid individual cases. The dependent variable was the 
intention to regularly conduct breast self-examinations. Both 
groups reported a greater intention to conduct breast exami- 
nations when the message they received matched their own 
thinking style. 


Individual Differences in Basic Beliefs About 
the Self and the World 


The Basic Beliefs Inventory (BBI; Catlin & Epstein, 1992) is 
a self-report questionnaire that measures the four basic be- 
liefs proposed in CEST. It includes a global scale of overall 
favorability of basic beliefs and separate scales for measuring 
each of the basic beliefs. The internal-consistency reliabili- 
ties (coefficients alpha) of the scales are between .77 to .91. 
The scales are moderately intercorrelated with a median cor- 
relation of .42, thereby justifying combining them into an 
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overall scale of favorability of beliefs as well as considering 
them individually. 

You will recall that according to CEST, a person’s basic 
beliefs are primarily derived from emotionally significant 
personal experiences. To test this hypothesis, Catlin and 
Epstein (1992) examined the relations of scores on the BBI 
and self-reports of two kinds of highly significant emotional 
experiences. The two kinds of experiences were extreme life 
events, such as loss of a loved one, and the quality of rela- 
tionships with parents during early childhood. In support of 
hypothesis, both kinds of experiences were significantly and 
coherently related to basic beliefs. Often, the two kinds of ex- 
perience made independent, supplementary contributions to 
the prediction of the same basic belief. Of additional interest, 
the self-reported quality of childhood relationships with par- 
ents moderated the influence of extreme life events on basic 
beliefs. 


Summary and Conclusions Regarding Research 
Support for CEST 


In summary, the program of research on CEST has provided 
impressive support for its construct validity. The following 
basic assumptions of CEST have all been verified: There 
are two independent information-processing systems that op- 
erate in parallel by different rules. The systems are interac- 
tive, with each influencing the other, and the interaction 
occurs both sequentially and simultaneously. The influence 
of experiential processing on rational processing is of partic- 
ular importance, as it identifies a process by which people’s 
automatic, preconscious, experiential processing routinely 
biases their conscious rational thinking. The experiential sys- 
tem is an associative, rapid, concretist, primarily nonverbal 
system that is intrinsically highly compelling to the extent 
that it can override the rational system, leading people to “be- 
have against their better judgment.” 

When people are aware of the maladaptive thoughts gen- 
erated by their automatic experiential processing, they often 
correct the thoughts through more deliberative reasoning in 
their rational systems. There are reliable individual differ- 
ences in the efficacy or intelligence of the experiential system. 
The intelligence of the experiential system is independent of 
the intelligence of the rational system and is more strongly as- 
sociated with a variety of indexes of success in living than is 
the intelligence of the rational system. Included are work suc- 
cess, social facility, absence of drug and alcohol abuse, and 
mental and physical well-being. There are reliable individual 
differences in experiential and rational thinking styles. The 
two thinking styles exhibit coherent patterns of relations 
with a variety of criterion variables. There are also reliable 
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individual differences in the four basic beliefs proposed by 
CEST. As the basic beliefs influence behavior simultaneously 
in the form of compromises, they serve as checks and bal- 
ances against each other. 


IMPLICATIONS OF COGNITIVE-EXPERIENTIAL 
SELF-THEORY FOR PSYCHOTHERAPY 
AND RESEARCH 


Implications for Psychotherapy 


For psychotherapy to be effective, it is necessary according to 
CEST for changes to occur in the experiential system. This is 
not meant to imply that changes in the rational system are 
of no importance, but rather to suggest that changes in the 
rational system are therapeutic only to the extent that they 
facilitate changes in the experiential system. 

There are three basic ways of producing changes in the ex- 
periential system. These include the use of the rational system 
to correct and train the experiential system, the provision of 
emotionally significant corrective experiences, and commu- 
nicating with the experiential system in its own medium— 
namely fantasy, imagery, metaphor, concrete representations, 
and narratives. These three approaches provide a unifying 
framework for a wide variety of approaches in psychother- 
apy, including insight approaches, cognitive-behavioral ap- 
proaches, and experiential approaches, including gestalt 
therapy and psychosynthesis (Epstein, 1994, 1998). 


Using the Rational System to Correct 
the Experiential System 


The rational system has an important advantage over the ex- 
periential system in that it can understand the experiential 
system, whereas the reverse is not true. Thus, one way the 
rational system can be used to improve the functioning of the 
experiential system is by teaching people to understand 
the operation of their experiential systems. Almost everyone 
is aware of having conflicts between the heart and the head as 
well as having unbidden distressing thoughts that they can 
not consciously control. These are not deep, dark, inaccessi- 
ble thoughts, but rather ones of which people are acutely 
aware. Beginning with a discussion of such reactions, it 
should not be difficult to convince people that they operate by 
two independent systems. The next step is to teach them 
about the operating principles of the experiential system and 
the manner in which it influences their behavior and biases 
their conscious thought. They then can be helped to under- 
stand that their problems are almost always in their automatic 
experiential processing, not in their conscious thinking. Not 


only is such knowledge useful for correcting and training the 
experiential system, but it also provides a useful foundation 
for the other two approaches. 

One of the important advantages of clients’ recognizing 
that their problems lie primarily in their experiential and not 
their rational systems is that it reduces resistance and defen- 
siveness because they no longer have to defend the reason- 
ableness of their behavior. For example, if a client engages in 
excessive rational discourse and feels compelled to defend 
his or her behavior as reasonable, the therapist can remind the 
client that the experiential system does not operate by logic. 
Rather, what is important is to uncover the maladaptive be- 
liefs and needs in the experiential system and ultimately 
change them in a constructive way. 

Uncovering implicit beliefs in the experiential system can 
be accomplished in several ways. One way is by noting repet- 
itive behavior patterns, and in particular becoming aware of 
sensitivities, compulsions, and ego-alien behavior, and be- 
coming aware of the situations in which they arise. A second 
way is by using fantasy to vicariously explore reactions to 
different situations. A third way is by attending to emotional 
reactions, vibes, and the kinds of automatic thoughts that 
instigate them. 

Emotional reactions are particularly revealing according to 
CEST because they provide a royal road to the important 
schemas in people’s implicit theories of reality. They do this in 
two ways. First, whenever an event elicits a strong emotional 
response, it indicates that a significant schema in a person’s 
implicit theory of reality has been implicated. Accordingly, by 
noting the events that elicit emotional responses, some of 
the more important schemas in a person’s theory of reality 
can be determined. Second, emotions can be used to infer 
schemas through knowledge of the relation between specific 
thoughts and specific emotions (e.g., Averill, 1980; Beck, 
1976; Ellis, 1973; Epstein, 1983, 1984; Lazarus, 1991). This 
relation has been well documented by the clinical observa- 
tions of cognitive-behavioral therapists (e.g., Beck, 1976; 
Ellis, 1973) and by research that has examined the relation of 
thoughts and emotions in everyday life (e.g., Averill, 1980; 
Epstein, 1983). It follows from the relation of automatic 
thoughts to emotions that people who characteristically have 
certain emotions characteristically spontaneously think in 
certain ways. For example, angry people can be assumed to 
have the implicit belief that people often behave badly and de- 
serve to be punished, frightened people can be assumed to 
have the implicit belief that the world is dangerous and they 
should be prepared for flight, and sad people can be assumed 
to have the implicit belief that they have sustained an irre- 
placeable loss, or that they are inadequate, bad, or unlovewor- 
thy people, and there is nothing they can do about it. 
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The most obvious way in which the rational system can be 
used to correct maladaptive feelings and behavior is by detect- 
ing and disputing the automatic thoughts that precede the 
feelings and behavior, a technique widely practiced by 
cognitive-behavioral therapists (e.g., Beck, 1976; Ellis, 1973). 
Clients can be taught to attend to the automatic thoughts that 
immediately precede troublesome emotions and behavior. By 
recognizing these thoughts as destructive and repeatedly sub- 
stituting more constructive ones, they often can change the 
maladaptive emotions and behavior that had been instigated 
by the thoughts. 

Another way that people can employ their rational system 
to correct their experiential processing is by understanding 
the value of real-life corrective emotional experiences. 
Clients can be helped to understand how their biased inter- 
pretations and habitual reaction tendencies—particularly 
those involving sensitivities and compulsions—have served 
to maintain their maladaptive reactions in the past, and how 
changing them can allow them to have and learn from poten- 
tially corrective experiences. 

The rational system can also be employed to teach people 
about the rules of operation of the two systems, the weak- 
nesses and strengths of each system, and the importance of 
using the two systems in a supplementary manner. They 
should understand that neither system is superior to the other, 
and that each has certain advantages and limitations. They 
should appreciate that each processing mode can provide use- 
ful guidance and each can lead one astray when not checked 
by the other. As an example of how the two systems can be 
used together when making an important decision, a client can 
be told to ask him- or herself, “How do I feel about doing this, 
what do I think about doing it, and considering both, what 
should I do?” In evaluating the wisdom of behaving according 
to one’s feelings, it is helpful to consider the influence of past 
experiences on current feelings (particularly when sensitivi- 
ties are implicated), and to consider how appropriate the past 
experiences are as a guide for reacting to the present situation. 


Learning Directly from Emotionally 
Significant Experiences 


As its name implies, the essence of the experiential system is 
that it is a system that learns from experience. It follows that 
the most direct route for changing maladaptive schemas in the 
experiential system is to provide corrective experiences. One 
way to accomplish this is through the relationship between 
client and therapist. This procedure is particularly emphasized 
in psychoanalytic transference relationships. Another way 
to learn directly from experience is by having corrective 
emotionally significant experiences in everyday life. As 


previously noted, it can be very useful in this respect for 
clients to gain insight into their biasing interpretations and 
self-verifying behavior. In the absence of such insight, poten- 
tially corrective experiences can be misinterpreted in a way 
that makes them contribute to the reinforcement rather than 
extinction of their destructive thoughts and behavior patterns. 
Having emphasized the contribution of insight, a caveat is in 
order concerning valuing it too highly and considering it a 
necessary condition for improvement. Although insight can be 
very useful, it is not a necessary condition for improvement. It 
is quite possible for change to occur in the experiential system 
in the absence of intellectual understanding of the process, 
which, of course, is the way nonhuman animals as well as peo- 
ple who are not in therapy normally learn from experience. 
Many a novel has been written about cures through love. In 
fact, for clients who are nonintellectual, corrective experi- 
ences in the absence of insight may be the only way to proceed 
in therapy. In the absence of recognizing the limited value of 
intellectual insight, there is the danger that therapists will in- 
sufficiently attend to the experiential aspects of therapy. 


Communicating with the Experiential System 
in Its Own Medium 


Communicating with the experiential system in its own 
medium refers to the use of association, metaphor, imagery, 
fantasy, and narrative. Within the scope of this chapter, it is im- 
possible to discuss all of these procedures or even to discuss 
any in detail. It is important to recognize in this regard that 
there is no single kind of therapy that is specific to CEST. 
Rather, CEST is an integrative personality theory that pro- 
vides a framework for placing into broad perspective a variety 
of therapies. For present purposes, it will suffice to present 
both a simple and a more complicated example of how com- 
munication with the experiential system in its own medium 
can be used therapeutically. 

The simple example concerns a person who under the 
guidance of a therapist visualizes a situation to learn how he 
might react to the situation in real life. The procedure is based 
on the assumption that the experiential system reacts to visu- 
alized events in a similar way as to real events, an assumption 
supported by research expressly designed to test it (Epstein & 
Pacini, 2001). 

Robert exhibited a life pattern of ambivalence about get- 
ting married. Recently, the woman he had been dating for 
several years gave him an ultimatum. She demanded that 
either he pronounce his intention to marry her or she would 
leave him. Robert loved her dearly, but he did not feel ready 
for marriage. He had always assumed he would settle down 
and raise a family, but somehow whenever he came to the 
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point of committing himself, something went wrong with the 
relationship, and he and his partner parted ways. At first, 
Robert attributed the partings to failings in his partners, but 
after repeated reenactments, it occurred to him that he might 
be ambivalent about marriage. Because this made no sense to 
him, he decided to seek the help of a therapist. The therapist 
instructed and trained Robert to vividly imagine being mar- 
ried and coming home to his wife and children after work. 
When he had the scene clearly in mind, he was asked to care- 
fully attend to his feelings. To his surprise, he felt irritated 
and burdened when his wife greeted him at the door and the 
children eagerly began relating the events of the day. The 
therapist then instructed Robert to imagine another scene in 
which he had the very same feelings. His mind turned to his 
childhood, and he had an image of taking care of his younger 
siblings when his parents went out for entertainment. He 
deeply resented having to take care of them frequently and 
not being able to play with his peers. The result was that he 
learned to dislike interacting with children at the experiential 
level, but had never articulated this feeling at the rational 
level. 

As an adult, although Robert believed in his conscious, ra- 
tional mind that he wanted to get married and raise a family, 
in his experiential mind, the thought of being in the company 
of children produced unpleasant vibes. He and his therapist 
discussed whether he should follow his heart or his mind. In 
order to help him to decide, the therapist pointed out that fol- 
lowing his heart would be the path of least resistance. He 
added that if Robert decided to follow his mind, it would be 
important for him to work on overcoming his negative feel- 
ings toward children. When Robert decided that is what he 
wanted to do, he was given an exercise to practice in fantasy 
that consisted of scenes in which Robert engaged in enjoy- 
able activities with children. He was also encouraged to visu- 
alize whatever occasions he could remember from his 
childhood in which he enjoyed being with his siblings. He 
was given other scenes to imagine, including being pleased 
with himself for behaving as a better parent to his imaginary 
children than his parents had behaved to him. 

The more complex example is taken from a book by Alice 
Epstein (1989) in which she described her use of fantasy and 
other procedures designed to communicate with her experi- 
ential system. She attributed a surprisingly rapid reorganiza- 
tion of her personality to this procedure. She also believed 
that the change in her feelings that accompanied the change 
in her personality contributed to a dramatic recovery from a 
life-threatening illness against all odds. 

Alice began psychotherapy after receiving a diagnosis of 
terminal cancer and being informed that she would not likely 


live more than three months. The statistics at that time of her 
diagnosis on the outcome of a metastasized hypernephroma, 
the form of kidney cancer that she had, indicated that no more 
than 4 in 1000 cases experienced remission from the disease, 
let alone cure. Now, many years after that diagnosis, Alice has 
no detectable signs of cancer and has been considered cured 
for more than 15 years. Whether her belief that the psychother- 
apy actually saved her life is correct is not at issue here. What 
is of primary interest is the rapid resolution of deep-seated 
problems through the use of fantasy that usually require a pro- 
longed period of intensive psychotherapy. However, given in- 
creasing evidence of the relation of emotions to the immune 
system, it would be unwise to summarily reject her belief that 
her psychological recovery contributed to her physical recov- 
ery. It is possible that the experiential system has a relation to 
physical well-being much stronger than orthodox medicine 
recognizes. 

The following is one of the early fantasies described by 
Alice in her book: In the session preceding the fantasy, she 
had expressed hostility toward her mother for her mother’s 
behavior to her during a period of extended turmoil in the 
household. During that period, the mother surprisingly gave 
birth to Alice’s younger sister after denying being pregnant 
and attributing the change in her appearance to a gain in 
weight from eating too much. During the same period, the 
mother’s mother, who shared the household with the family, 
and to whom the mother was deeply attached, was dying of 
cancer. After the session in which Alice Epstein (1989) ex- 
pressed her hostility to her mother, she experienced a pro- 
longed feeling of isolation and loneliness that lasted until 
she reported and discussed the following fantasy with her 
therapist. 


My therapist and I decided to try the same technique to try to 
understand my intense discomfort at being alone. Visualizing 
isolation was much more difficult than visualizing pain. After 
many attempts that we both rejected as trivial, I finally caught the 
spirit of what I was experiencing. I saw some figures with 
shrouds—very unclear. Then as they took on a more distinct 
form, I saw that they were witches standing around a fire. My 
therapist told me to ask them to come over to talk to us. They 
were frightening to me in the light of the fire, but they were more 
horrible as they came closer. They laughed at me and started to 
poke at me with their sticks. The visualization was so real and 
their presence was so chilling to me that I burst into tears over 
the interaction with them. 

My therapist told me to ask them what I could do to get 
rid of the awful fear of isolation. Finally they revealed their 
price. It was that I make a sacrifice so that they could be- 
come beautiful and mingle with other people. When I heard 
their price I began to tremble. In an almost inaudible voice I 
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whispered, ‘They want my children so they can turn them 
into witches like them, but I’ll never do it. P’ l never give them 
my children!’ 

My therapist then told me to destroy them, but I told him that 
I couldn’t possibly do it. He urged me to try to turn my fear to 
wrath, to try to imagine a creature that could help me. The image 
that came to me was a white winged horse. He told me to mount 
the horse and to supply myself with a weapon that would destroy 
them. I refused to kill them myself, but said that the wings of the 
horse would fan the flames of their fire, which would turn back 
on them and destroy them. 

There was only one problem with this scenario—the horse 
and I were one now and I couldn’t get airborne. The wings were 
so heavy that I couldn’t flap them hard enough to catch the 
breeze. The harder I tried, the more I failed and the more the 
witches laughed at me. My therapist... told me that another 
horse who loved the first horse very much would join her and to- 
gether they would destroy the witches. The other horse flew 
above me and made a vacuum into which I could take off. Once 
in the air, I flew effortlessly and fanned the fire into a huge blaze. 
The witches ran here and there trying to avoid the flames but in 
the end they were consumed by the fire. 

I practiced the scene over and over again until it became 
easy, but I never enjoyed it. I liked to fly, but I felt sorry for the 
witches, no matter how mean they were to me. My therapist felt 
that it was a mistake to feel sympathy for them because they 
would take advantage of any mercy that I displayed. He felt they 
would use any deception and illusion they could to control me. 
I was not so sure but I did agree with him that I must assume the 
right to soar into the world and be free of their influence. After 
the session, my therapist and I discussed the meaning of the im- 
ages. Although I had begun with the concept of isolation in 
mind, I knew that the witches related to my mother, particularly 
the way she would poke at me and shame me. They probably 
represented my fear of isolation if I did not acquiesce to her de- 
mands. My therapist added that in destroying the witches I was 
only destroying the hostile part of our relationship, the witch 
part of it, and leaving the loving part intact. This was necessary 
for me to be free, autonomous, and no longer ensnared by fear 
of abandonment. 

The concept that I had a great deal of conflict between the 
need for association and the need for autonomy was not new. I 
believed I had to buy affection and that no one would love me if 
I were myself, i.e., if I attended to my own wants. I knew also 
that I felt that I had to carry the burden of being responsible for 
my mother’s well-being, that she would die at some level if I 
broke the bond with her. (pp. 45-47) 


There are several aspects of this fantasy that warrant further 
comment. First, it is noteworthy that the only aspect that 
reached awareness before the fantasy was an enduring feel- 
ing of loneliness and isolation. The source of the feeling and 


its associations remained unconscious until they were dealt 
with at the experiential level and perhaps assimilated at the 
rational level. 

Second, the insight represented in the fantasy—namely, 
that Alice had a conflict between autonomy and related- 
ness—was not new to her. As she noted, she had been con- 
sciously aware of this conflict before. What, then, did the 
fantasy accomplish? What it accomplished was to produce a 
vicarious corrective emotional experience that had a pro- 
found effect at the experiential level. The previous intellec- 
tual insight in the absence of involvement of the experiential 
system had accomplished little. To make a therapeutic contri- 
bution, the same information had to be felt and processed 
experientially. 

Third, the fantasy provided useful diagnostic clues for the 
psychotherapist. Alice, apparently, could not free herself from 
the hold of the bad mother figure until a loving figure sup- 
ported her independence, after which she could soar freely. 
This suggested that what she needed to resolve her conflict 
was to be convinced at a compelling experiential level that it 
is possible to be autonomous and loved at the same time. This 
was duly noted by her therapist, who made a point of encour- 
aging its implementation in her family, as well as supporting it 
himself in the therapeutic relationship. 

Fourth, the fantasy illustrates the usefulness of vicarious 
symbolic experience as a therapeutic tool. Alice sponta- 
neously began to practice in fantasy enjoying the feeling of 
soaring freely into space, and as a result she was able to gain 
a newfound freedom without guilt or fear of abandonment. 
What she learned through the fantasy at a deep experiential 
level suggests a therapeutic technique that may be more gen- 
erally useful—namely, the practice in symbolic form of cop- 
ing with a deep-seated problem that cannot be resolved by 
intellectual insight. Of additional general value of this exam- 
ple of a spontaneous fantasy is that it indicates how such fan- 
tasies can provide diagnostic information that can be useful 
in therapy. 

There is, of course, no way of knowing the extent to which 
the use of fantasy relative to other factors, such as having 
a highly supportive environment, played in Alice’s rapid 
progress. It is very likely that both contributed. However, it 
should be considered in this respect that the equally favor- 
able environment before the therapy was insufficient to re- 
solve Alice’s conflict between autonomy and relatedness. As 
she reported, the love and affection that were abundantly 
available to her from her husband, her children, her extended 
family, and her deeply caring friends could not penetrate, so 
long as she felt that the price of love was the sacrifice of au- 
tonomy. Having developed a lifelong pattern of self-sacrifice 
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in order to maintain relationships, she had no way of learning 
before therapy that it was unnecessary. 


Implications for Research 


If there are two different information-processing systems, it 
can only be a source of confusion to conduct research as if 
there were only one, which is the customary practice. As an 
example, given the existence of two different systems, it is 
meaningless to investigate “the” self-concept because a per- 
son’s self-concept in one system may not conform to the self- 
concept in the other system. Moreover, the difference between 
the two self-concepts can be of considerable importance in its 
own right. The problem of treating the two self-concepts as if 
there were only one has been particularly evident in research 
on self-esteem, in which individuals are typically classified as 
high or low in self-esteem based on self-report questionnaires. 
Yet if there are two self-concepts, then it is quite possible for 
people to be high in self-esteem in one system and low in the 
other. For example, a person might be high in self-esteem in 
the rational system, as measured by a self-report test, yet low 
in self-esteem in the experiential system, as inferred from be- 
havior (Savin-Williams & Jaquish, 1981). 

There has been much disagreement concerning whether 
elevating students’ self-esteem by treating them as successful 
no matter what their performance is desirable or undesirable. 
In order to resolve this issue, from the perspective of CEST it 
is necessary to recognize that high self-esteem at the con- 
scious, rational level may coexist with low self-esteem at the 
experiential level. It is one thing to teach students to con- 
sciously believe they have high self-regard and another to 
have them acquire the quiet confidence that comes from feel- 
ings of mastery and competence that are a consequence of 
real accomplishment. The former in the absence of the latter 
can be considered to be no more than self-deception and a po- 
tential source of disillusionment in the future. It follows that 
not only is it important to examine self-esteem separately in 
each of the two systems, but it is equally important to conduct 
research on their convergence. What is obviously true of self- 
esteem in this respect is equally true of other personality vari- 
ables, including basic needs and beliefs. 

Although the importance of four basic needs and corre- 
sponding beliefs is emphasized in CEST, this is not meant to 
imply that lower-level beliefs and needs are not also very im- 
portant. Recently, social and personality psychologists have 
emphasized midlevel motivational constructs (e.g., Emmons, 
1986; Markus & Nurius, 1986; Mischel & Shoda, 1995). It is 
assumed in CEST that personality is hierarchically organized, 
with broad, basic needs subsuming midlevel motives, which 
in turn subsume narrower, situation-specific motives. It would 


therefore be desirable to examine the organization of such 
needs and beliefs, and to determine in particular the kinds of 
relations the different levels establish with each other, as well 
as with other variables. It might reasonably be expected that 
the lowest-order needs and beliefs are most strongly associ- 
ated with situationally specific behaviors, and the higher-order 
beliefs and needs are more strongly associated with broad dis- 
positions, or traits. The higher-order beliefs and needs can also 
be expected to be more weakly but more extensively associ- 
ated with narrow behavioral tendencies. Midlevel motives can 
be expected to have relations that fall in between those of the 
higher- and lower-order needs. A particularly important hy- 
pothesis with regard to CEST is that higher-order needs and 
beliefs are more resistant to change than are lower-order needs 
and beliefs, but should they be changed they have greater ef- 
fects on the overall personality structure. Moreover, any major 
changes, including positive changes, are disorganizing and 
anxiety-producing because of the basic need to maintain the 
stability of the conceptual system. 

Although considerable research has recently been con- 
ducted on midlevel needs that has demonstrated their theoret- 
ical importance and predictive value (e.g., Emmons, 1986; 
Markus & Nurius, 1986; Mischel & Shoda, 1995), the ques- 
tion remains as to how the midlevel needs can best be desig- 
nated and measured. The most thorough and compelling list 
of midlevel needs to date still appears to be the list proposed 
by Henry A. Murray (1938) many years ago. It is interesting 
from the perspective of CEST that Murray measured mid- 
level needs both explicitly via direct self-report and implic- 
itly through the use of the Thematic Apperception Test (TAT; 
Murray, 1943). Amore psychometrically advanced procedure 
for measuring the Murray midlevel needs at the explicit level 
has since become available in the form of the Edwards Per- 
sonal Preference Schedule (Edwards, 1959). 

There is a need for research to further explore the TAT as 
a measure of implicit needs and to also examine additional 
measures of implicit needs. Included could be older proce- 
dures such as word association and sentence completion, as 
well as promising new procedures such as priming tech- 
niques and subthreshold measures (see Bargh & Chartrand, 
in press, for a review of such techniques). It would be inter- 
esting to relate the various implicit measures to each other to 
determine whether they have enough in common to combine 
them into an overall measure. The implicit measure (or mea- 
sures) of needs could then be related to explicit measures of 
needs, and both could be related to external criteria. Through 
such procedures it should be possible to determine in what 
ways implicit and explicit measures are similar and different. 
It could also be determined whether they contribute in a sup- 
plementary way to the prediction of the same variables and 


whether the degree to which they coincide in individuals is an 
important personality variable, as assumed in CEST. It would 
be informative to determine what kinds of combinations of 
implicit needs usually result in compromises, what kinds usu- 
ally result in conflict, and how this differs among individuals. 
Such research would not only be of theoretical importance, 
but would also have important implications for the diagnosis 
of sources of distress and for therapy. 

Although considerable research has been done with the 
CTI that has supported its construct validity (see review in 
Epstein, 2001), there are many areas that could profit from fur- 
ther research with it. One such area is the predictive value of 
the CTI for success in a variety of work situations that have not 
yet been investigated. It would be interesting, for example, to 
conduct a study comparing the contribution of intellectual in- 
telligence, as measured by a standard intelligence test, and ex- 
periential intelligence, as measured by the CTI, for predicting 
performance in graduate school and beyond. A hypothesis 
derived from CEST and consistent with previous research 
(Epstein, 2001) is that intellectual intelligence is a stronger 
predictor of grades and scores on paper-and-pencil tests, 
whereas constructive thinking is a stronger predictor of practi- 
cal performance. The latter could be indicated by demonstra- 
tions of research productivity and creativity, by length of time 
to complete the PhD degree, and by successful professional 
performance after obtaining the PhD degree. 

As noted previously, with the aid of a newly constructed 
instrument, the Rational-Experiential Inventory (REI; 
Epstein et al., 1996; Norris & Epstein, 2000a, 2000b; Pacini & 
Epstein, 1999b), it is possible to study the effects of individual 
differences in processing in each of the two modes. Of partic- 
ular interest is the independent contribution of each of the 
modes for predicting well-being and performance in different 
kinds of activities. Although a promising beginning has been 
made in this area, there is a need for more extensive research, 
particularly with the use of objective rather than self-reported 
dependent variables. 

An important area of research with both practical and theo- 
retical implications is the relation of the two thinking styles to 
receptivity to different kinds of messages. The one research 
project that has been completed on this issue (Rosenthal & 
Epstein, 2000) has produced interesting results consistent with 
CEST and suggests that it is a promising area for further re- 
search. It remains to be determined how each of the processing 
styles—separately and in combination—is related to receptiv- 
ity to messages regarding politics, advertising, and health- 
related behaviors such as smoking and sexual risk-taking. 

An area of particular theoretical and practical importance 
is the influence of the experiential system on the rational sys- 
tem. As previously noted, this relation can account for the 
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paradoxical irrationality exhibited by humans despite their 
unique capacity for rational reasoning. The influence of ex- 
periential on rational processing is assigned an extremely im- 
portant role in CEST, equivalent to the influence of the 
Freudian unconscious in psychoanalysis. It is therefore 
important from the perspective of CEST to conduct further 
research to demonstrate the influence of experiential on ratio- 
nal processing under various conditions. Relatedly, it is im- 
portant to test the hypotheses that such influence is often 
mediated by feelings, the identification of which, accord- 
ingly, can be helpful as a first step in controlling the influence 
of the experiential on the rational system. 

Research is needed on the positive contributions of expe- 
riential processing to creativity, wisdom, and physical and 
mental well-being. It is important in this respect to determine 
how people can most effectively influence and learn from 
their experiential systems by communicating with these 
systems in their own medium, as illustrated in the case his- 
tory that was presented. You will recall that Alice, by practic- 
ing soaring freely and unaided in fantasy, was able to accept 
the belief, at a deep experiential level, that it is possible to be 
an autonomous being without fear of rejection in a way that 
intellectual insight was unable to accomplish. It will be inter- 
esting to determine how effective such symbolic rehearsal is 
more generally as a way of resolving deep-seated conflicts at 
the experiential level. 


SUMMARY AND CONCLUSIONS 


Cognitive-experiential self-theory (CEST) is a psychody- 
namic global theory of personality that substitutes a different 
kind of unconscious processing for the Freudian unconscious. 
Unlike the maladaptive Freudian unconscious, the uncon- 
scious of CEST is an adaptive, associative learning system. It 
is the same system with which higher-order animals have in- 
creasingly effectively adapted to their environments over mil- 
lions of years of evolution. Because it is a system that learns 
from experience, it is referred to as the experiential system. In 
addition to an experiential system, humans uniquely have a 
rational system. The rational system is a logical, inferential 
system that operates with the aid of language. The experien- 
tial system can account for the widespread irrationality in the 
thinking of humans despite their unique capacity for reason- 
ing rationally by recognizing that it biases conscious thinking 
automatically and outside of awareness. 

The operating principles of the experiential system were 
described and contrasted with those of the rational system. 
Although the systems are independent in the sense that 
they operate by different rules, they nevertheless are highly 
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interactive. The two systems usually operate in synchrony and 
produce compromises between them, but sometimes they 
conflict with each other, resulting in what are commonly re- 
ferred to as conflicts between the heart and the head. A re- 
search program was described that provided support for many 
of the assumptions in CEST. The implications of CEST were 
discussed for psychotherapy and psychological research. 

It was noted that neither system is superior to the other. 
They are simply different ways of understanding the world 
and behaving in it. The experiential system is intimately as- 
sociated with emotions and adapts by learning from out- 
comes. The rational system is a affect-free and adapts by 
logical inference. Each has its advantages and disadvantages. 
Although the rational system is responsible for remarkable 
achievements in science and technology, it is less well suited 
for everyday living than is the experiential system. Moreover, 
the experiential system can intuitively and holistically solve 
some problems that are beyond the capacity of the analytical, 
rule-based reasoning of the rational system (Hammond, 
1996). The experiential system is also a source of some of hu- 
mankind’s most desirable attributes, including the capacity 
for passion, compassion, love, creativity, and appreciation of 
aesthetics. However, it is also a source of serious difficulties, 
including superstitious thinking, prejudice, violence, and— 
perhaps most important—undermining people’s ability to 
think rationally. Thus, the experiential system is a mixed 
blessing; it is difficult to live with it, but it would be impossi- 
ble to live without it. 


REFERENCES 


Alloy, L. B., & Abramson, L. Y. (1988). Depressive realism: Four 
theoretical perspectives. In L. B. Alloy (Ed.), Cognitive processes 
in depression (pp. 167-232). New York: Guilford Press. 


Allport, G. W. (1961). Pattern and growth in personality. New 
York: Holt, Rinehart, and Winston. 


Averill, J. R. (1980). A constructionist view of emotion. In R. 
Plutchik & H. Kellerman (Eds.), Emotion, theory, research, and 
experience: Vol. 1. Theories of emotion. New York: Academic 
Press. 


Bargh, J. A. (1989). Conditional automaticity: Varieties of auto- 
matic influence in social perception and cognition. In J. S. 
Uleman & J. A. Bargh (Eds.), Unintended thought (pp. 3-51). 
New York: Guilford Press. 


Bargh, J. A., & Chartrand, T. L. (in press). Studying the mind in the 
middle: A practical guide to priming and automaticity research. 
In H. Reis & C. Judd (Eds.), Research methods in the social 
sciences. New York: Cambridge University Press. 

Beck, A. T. (1976). Cognitive therapy and the emotional disorders. 
New York: International Universities Press. 


Bowlby, J. (1988). A secure base. New York: Basic Books. 


Catlin, G., & Epstein, S. (1992). Unforgettable experiences: The 
relation of life-events to basic beliefs about the self and world. 
Social Cognition, 10, 189-209. 

Chaiken, S., & Maheswaren, D. (1994). Heuristic processing can 
bias systematic processing: Effects of resource credibility, argu- 
ment ambiguity, and task importance on attitude adjustment. 
Journal of Personality and Social Psychology, 66, 460-473. 


Chaiken, S., & Trope, Y. (1999). Dual-process theories in social 
psychology. New York: Guilford Press. 


Costa, P. T., & McCrae, R. R. (1989). NEO/FFI: Manual Supple- 
ment. Odessa, FL: Psychological Assessment Resources. 


Denes-Raj, V., & Epstein, S. (1994). Conflict between experiential 
and rational processing: When people behave against their better 
judgment. Journal of Personality and Social Psychology, 66, 
819-829. 


Denes-Raj, V., Epstein, S., & Cole, J. (1995). The generality of the 
ratio-bias phenomenon. Personality and Social Psychology Bul- 
letin, 10, 1083-1092. 


Dollard, J., & Miller, N. E. (1950). Personality and psychotherapy: 
An analysis in terms of learning, thinking, and culture. New 
York: McGraw-Hill. 


Donovan, S., & Epstein, S. (1997). The difficulty of the Linda con- 
junction problem can be attributed to its simultaneous concrete 
and unnatural presentation, and not to conversational implica- 
ture. Journal of Experimental Social Psychology, 33, 1-20. 


Edwards, A. L. (1959). Manual for Edwards Personal Preference 
Schedule (Rev. ed). New York: Psychological Corporation. 


Edwards, K. (1990). The interplay of affect and cognition in attitude 
formation and change. Journal of Personality and Social Psy- 
chology, 59, 202-216. 


Ellis, A. (1973). Humanistic psychotherapy. New York: McGraw- 
Hill. 


Emmons, R. A. (1986). Personal strivings: An approach to personal- 
ity and subjective wellbeing. Journal of Personality and Social 
Psychology, 51, 1058-1068. 


Epstein, A. (1989). Mind, fantasy, and healing. New York: Dela- 
corte. (This book is out of print, but copies can be obtained from 
Amazon.com or from Balderwood Books, 37 Bay Road, 
Amherst, MA 01002 by enclosing a check for $18.00, which 
includes postage.) 


Epstein, S. (1979a). Natural healing processes of the mind: Pt. 1. 
Acute schizophrenic disorganization. Schizophrenic Bulletin, 5, 
313-321. 


Epstein, S. (1983). A research paradigm for the study of personality 
and emotions. In M. M. Page (Ed.), 1982 Nebraska Symposium 
on motivation: Personality—Current theory and research (pp. 
91-154). Lincoln: University of Nebraska Press. 

Epstein, S. (1984). Controversial issues in emotion theory. In 
P. Shaver (Ed.), Annual review of research in personality and 
social psychology (pp. 64-87). Beverly Hills, CA: Sage. 


Epstein, S. (1987). Implications of cognitive self-theory for psy- 
chopathology and psychotherapy. In N. Cheshire & H. Thomae 
(Eds.), Self, symptoms and psychotherapy (pp. 43-58). New 
York: Wiley. 

Epstein, S. (1990). Cognitive-experiential self-theory. In L. Pervin 
(Ed.), Handbook of personality theory and research: Theory and 
research (pp. 165-192). New York: Guilford. 


Epstein, S. (1993). Emotion and self-theory. In M. Lewis & J. 
Haviland (Eds.), The handbook of emotions (pp. 313-326). New 
York: Guilford. 


Epstein, S. (1994). Integration of the cognitive and the psychody- 
namic unconscious. American Psychologist, 49, 709-724, 

Epstein, S. (1998). Constructive thinking: The key to emotional 
intelligence. Westport, CT: Praeger. 


Epstein, S. (2001). Manual for the Constructive Thinking Inventory. 
Lutz, FL: Psychological Assessments Resources. 

Epstein, S., Denes-Raj, V., & Pacini, R. (1995). The Linda problem 
revisited from the perspective of cognitive-experimental self- 
theory. Personality and Social Psychology Bulletin, 11, 1124- 
1138. 

Epstein, S., & Donovan, S. (1995). [Is the Linda problem a true con- 
junction problem? Implications from frequency versus probabil- 
ity versions and individual differences in analytical style.] 
Unpublished raw data. 

Epstein, S., Donovan, S., & Denes-Raj, V. (1999). The missing link 
in the paradox of the Linda conjunction problem: Beyond know- 
ing and thinking of the conjunction rule, the intrinsic appeal 
of heuristic processing. Personality and Social Psychology 
Bulletin, 25, 204-214. 

Epstein, S., Lipson, A., Holstein, C., & Huh, E. (1992). Irrational re- 
actions to negative outcomes: Evidence for two conceptual sys- 
tems. Journal of Personality and Social Psychology, 62, 328-339. 

Epstein, S., & Morling, B. (1995). Is the self motivated to do more 
than enhance and verify itself? In M. H. Kernis (Ed.), Efficacy, 
agency, and self-esteem (pp. 9-29). New York: Plenum Press. 

Epstein, S., & Pacini, R. (1995). [The Linda problem: A true con- 
junction problem, or does the conjunction rule not apply to mul- 
tiple attributes in a single person?] Unpublished raw data. 


Epstein, S., & Pacini, R. (2001). A comparison of the influence of 
imagined and unimagined verbal information on intuitive and 
analytical information processing. Imagination, Cognition, and 
Personality, 20, 195-216. 

Epstein, S., Pacini, R., Denes-Raj, V., & Heier, H. (1996). Individ- 
ual differences in intuitive-experiential and analytical-rational 
thinking styles. Journal of Personality and Social Psychology, 
71, 390-405. 

Fiske, S. T., & Taylor, S. E. (1991). Social cognition (2nd ed.). New 
York: McGraw-Hill. 

Freud, S. (1953). The interpretation of dreams. In J. Strachey (Ed. & 
Trans.), The standard edition of the complete psychological 
works of Sigmund Freud (Vols. 4 & 5). London: Hogarth. (Orig- 
inal work published 1900) 


References 183 


Gallistel, C. R., & Gelman, R. (1992). Preverbal and verbal count- 
ing and computation. Cognition, 44, 43-74. 


Gordon, R. A. (1997). Everyday life as an intelligence test [Special 
issue]. Intelligence, a Multidisciplinary Journal, 24, 203-320. 


Gottfredson, L. S. (1997). Why g matters: The complexity of every- 
day life [Special issue]. Intelligence, a Multidisciplinary Jour- 
nal, 24, 79-132. 

Hammond, K. R. (1996). Human judgment and Social policy. New 
York: Oxford University Press. 


Hixon, J. G., & Swann, W. B. (1993). When does introspection bare 
fruit? Self-reflection, self-insight, and interpersonal choices. 
Journal of Personality and Social Psychology, 64, 35-43. 


Hunter, J. E. (1983). Overview of validity generalization for the U.S. 
Employment Service (USES Test Research Report No. 43). 
Washington, DC: U.S. Department of Labor, Employment, and 
Training Administration. 

Hunter, J. E. (1986). Cognitive ability, cognitive aptitudes, job 
knowledge, and job performance. Journal of Vocational Behav- 
ior, 29, 340-362. 

Hunter, J. E., & Hunter, R. F., (1984). Validity and utility of alterna- 
tive predictors of job performance. Psychological Bulletin, 96, 
72-98. 

Jefferson, L. (1974). These are my sisters. Garden City, NY: Anchor 
Press. 

Kahneman, D., Slovic, P., & Tversky, A. (1982). Judgment under 
uncertainty: Heuristics and biases. New York: Cambridge 
University Press. 

Kahneman, D., & Tversky, A. (1973). On the psychology of predic- 
tion. Psychological Review, 80, 237-251. 

Kirkpatrick, L. A., & Epstein, S. (1992). Cognitive-experiential 
self-theory and subjective probability: Further evidence for two 
conceptual systems. Journal of Personality and Social Psychol- 
ogy, 63, 534-544. 

Kohut, H. (1971). The analysis of the self. New York: International 
Universities Press. 

Lazarus, R. (1991). Emotion and adaptation. New York: Oxford 
University Press. 

Markus, H., & Nurius, P. (1986). Possible selves. American Psy- 
chologist, 41, 954-969. 

Miller, D. T., & Gunasegaram, S. (1990). Temporal order and the per- 
ceived mutability of events: Implications for blame assignment. 
Journal of Personality and Social Psychology, 59, 1111-1118. 

Mischel, W., & Shoda, Y. (1995). A cognitive-affective system 
theory of personality: Reconceptualizing situations, dispositions, 
dynamics, and invariance in personality structure. Psychological 
Review, 102, 246-268. 

Morling, B., & Epstein, S. (1997). Compromises produced by the 
dialectic between self-verification and self-enhancement. Jour- 
nal of Personality and Social Psychology, 73, 1268-1283. 

Murray, H. A. (1938). Explorations in personality. New York: 
Oxford University Press. 


184 Cognitive-Experiential Self-Theory of Personality 


Murray, H. A. (1943). Thematic Apperception Test manual. 
Cambridge, MA: Harvard University Press. 


Nisbett, R., & Ross, L. (1980). Human inference: Strategies and 
shortcomings of social judgment. Englewood Cliffs, NJ: Prentice 
Hall. 

Norris, P., & Epstein, S. (2000a). The measurement of analytical 
and intuitive thinking styles with a short form of the Rational- 
Experiential Inventory. Manuscript submitted for publication. 


Norris, P., & Epstein, S. (2000b). [Objective and subjective corre- 
lates of rational and experiential thinking styles]. Unpublished 
raw data. 


Pacini, R., & Epstein, S. (1999a). The interaction of three facets of 
concrete thinking in a game of chance. Thinking and Reasoning, 
5, 303-325. 


Pacini, R., & Epstein, S. (1999b). The relation of rational and expe- 
riential information processing styles to personality, basic 
beliefs, and the ratio-bias phenomenon. Journal of Personality 
and Social Psychology, 76, 972-987. 

Pacini, R., Muir, F., & Epstein, S. (1998). Depressive realism from 
the perspective of cognitive-experiential self-theory. Journal of 
Personality and Social Psychology, 74, 1056-1068. 

Pavlov, I. P. (1941). Conditioned reflexes in psychiatry (W. H. Gannt, 
Trans.). Madison, CT: International Universities Press. 

Rogers, C. R. (1951). Client-centered therapy: Its current practice, 
applications, and theory. Boston: Houghton Mifflin. 

Rosenthal, L., & Epstein, S. (2000). Rational and experiential think- 
ing styles as related to receptivity to messages syntonic and dys- 
tonic with thinking style. Unpublished manuscript, University of 
Massachusetts at Amherst. 


Savin- Williams, R. C., & Jaquish, G. A. (1981). The assessment of 
adolescent self-esteem: A comparison of methods. Journal of 
Personality, 49, 324-335. 


Smith, E. R., & DeCoster, J. (2000). Dual-process models in social 
and cognitive psychology: Conceptual integration and links to 
underlying memory systems. Personality and Social Psychology 
Review, 4, 108-131. 

Swann, W. B., Jr. (1990). To be known or to be adored: The interplay 
of self-enhancement and self-verification. In R. M. Sorrentino & 
E. T. Higgins (Eds.), Handbook of motivation and cognition: 
Foundations of social behavior (Vol. 2, pp. 408-448). New 
York: Guilford. 

Taylor, S. E., & Brown, J. D. (1988). Illusion and wellbeing: A social 
psychological perspective on mental health. Psychological 
Bulletin, 103, 193-210. 


Tolstoi, L. (1887). My confession. New York: Crowell. 


Tversky, A., & Kahneman, D. (1974). Judgment under uncertainty: 
Heuristics and biases. Science, 185, 1124-1131. 

Tversky, A., & Kahneman, D. (1982). Judgments of and by repre- 
sentativeness. In D. Kahneman, P. Slovic, & A. Tversky (Eds.), 
Judgment under uncertainty: Heuristics and biases. New York: 
Cambridge University Press. 

Tversky, A., & Kahneman, D. (1983). Extensional versus intuitive 
reasoning: the conjunction fallacy in probability judgment. 
Psychological Review, 90, 293-315. 

Yanko, J., & Epstein, S. (2000). [The intuitive knowledge of 
ratios in children without formal training in ratios and the 
interference produced by requesting explanations.] Unpublished 
raw data. 


CHAPTER 8 


Self-Regulatory Perspectives on Personality 


CHARLES S. CARVER AND MICHAEL F. SCHEIER 


BEHAVIOR AS GOAL DIRECTED 
AND FEEDBACK CONTROLLED 
Feedback Processes 186 
Re-emergent Interest in Approach and Avoidance 

HIERARCHICALITY AMONG GOALS 188 
Differentiating Goals by Levels of Abstraction 188 
Multiple Paths to High-Level Goals, Multiple Meanings 

from Concrete Acts 190 
Goals and the Self 190 

FEEDBACK LOOPS AND CREATION OF AFFECT 
Velocity Control 190 
Research Evidence 191 
Cruise Control Model 191 
Affect from Discrepancy-Enlarging Loops 
Merging Affect and Action 192 
Comparison with Biological Models of Bases of Affect 

RESPONDING TO ADVERSITY: PERSISTENCE 
AND GIVING UP 194 
Behavioral Manifestations 194 
Is Disengagement Good or Bad? 194 
Hierarchicality and Importance Can 

Impede Disengagement 195 
Watersheds, Disjunctions, and Bifurcations 
Among Responses 195 


186 


188 


190 


192 


193 


Personality is a difficult concept to pin down. By necessity it 
is a very broad concept because personality impinges on vir- 
tually all aspects of human behavior. This breadth is viewed 
differently by different theorists, however. As a result, many 
different approaches have been taken to thinking about and 
conceptualizing personality. The diversity in focus among the 
chapters in the first part of this volume attests very clearly to 
that fact. 

We were both trained as personality psychologists. 
Throughout our careers, however, our research interest has 
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focused on a set of issues regarding the structure of behavior. 
These issues link the concept of personality and its function- 
ing to a set of themes that might be regarded as representing 
the psychology of motivation. Our interest in how behav- 
ior occurs has taken us into a number of specific research 
domains—most recently health-related behavior and respon- 
ses to stress (Carver & Scheier, 2001; Scheier, Carver, & 
Bridges, 2001). However, these specific explorations have 
almost always occurred in service to a more general interest 
in the structure of behavior. 

What we mean by “the structure of behavior” is reflected 
in the issues underlying questions such as these: What is the 
most useful way to think about how people create actions 
from their intentions, plans, and desires? Once people have 
decided to do something, how do they stay on course? What 
is the relation between people’s values and their actions? 
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What processes account for the existence of feelings as 
people make their way through the world? 

As we have tried to address such questions, we have con- 
sistently returned to the idea that people are self-regulatory 
entities. That is, human behavior is an attempt to make some- 
thing occur in action that is already held in mind. Similarly, 
affects serve as self-regulatory controls on what actions take 
place and with how much urgency. 

The self-regulatory principles we emphasize in our writ- 
ings were not conceived as being a model of personality. 
However, the principles do turn out to provide an interest- 
ing perspective on personality. They suggest some implica- 
tions about how personality is organized and expressed in 
people’s actions. These principles also point to some of the 
issues that are involved in successfully negotiating the world. 
The principles we emphasize deal most explicitly with 
the “process” aspect of personality—the functions that make 
everyone a little bit alike—but they can also be seen to have 
implications for the individual differences that are part of 
personality psychology. 

This chapter is organized as a series of conceptual themes 
that reflect this self-regulatory perspective on personality. We 
start with basic ideas about the nature of behavior and some 
of the processes by which we believe behavior is regulated. 
We then turn to emotion—how we think it arises and a way in 
which two classes of affects differ from each other. This leads 
to a discussion of the fact that people sometimes are unable 
to do what they set out to do and of what follows from that 
problem. The next sections are more speculative and reflect 
emerging themes in thinking about behavior. They deal with 
dynamic systems, connectionism, and catastrophe theory as 
models for behavior and how such models may influence 
how people such as ourselves view self-regulation. 


BEHAVIOR AS GOAL DIRECTED AND 
FEEDBACK CONTROLLED 


The view we take on behavior begins with the concept of goal 
and the process of feedback control, ideas we see as inti- 
mately linked. Our focus on goals is in line with a growing re- 
emergence of goal constructs in personality psychology (e.g., 
Austin & Vancouver, 1996; Elliott & Dweck, 1988; Miller & 
Read, 1987; Pervin, 1989), constructs known by a variety 
of labels such as current concern (Klinger, 1975, 1977), 
personal strivings (Emmons, 1986), life task (Cantor & 
Kihlstrom, 1987), and personal project (Little, 1983). The 
goal construct is at its core very simple. Yet these theories all 
emphasize that it has room for great diversity and individual- 
ization. For example, any life task can be achieved in diverse 


ways. People presumably choose paths for achieving a given 
life task that are compatible with other aspects of their life sit- 
uation (e.g., many concerns must usually be managed simul- 
taneously) and with other aspects of their personality. 

Two goal constructs that differ somewhat from those 
named thus far are the possible self (Markus & Nurius, 1986) 
and the self-guide (Higgins, 1987, 1996). These constructs 
were intended to bring a dynamic quality to conceptualization 
of the self-concept. In contrast to traditional views, but 
consistent with other goal frameworks, possible selves are 
future oriented. They concern how people think of their as- 
yet-unrealized potential, the kind of people they might be- 
come. Self-guides similarly reflect dynamic aspects of the 
self-concept. 

Despite differences among these various constructs (see 
Austin & Vancouver, 1996; Carver & Scheier, 1998), they are 
the same in many ways. All include the idea that goals ener- 
gize and direct activities; all implicitly convey the sense that 
goals give meaning to people’s lives (cf. Baumeister, 1989). 
Each theory emphasizes the idea that understanding the per- 
son means in part understanding the person’s goals. Indeed, 
the view represented by these theories often implies that the 
self consists partly of the person’s goals and the organization 
among them. 


Feedback Processes 


How are goals used in behaving? We believe that goals serve 
as reference values for feedback loops (Wiener, 1948). A 
feedback loop, the unit of cybernetic control, is a system of 
four elements in a particular organization (cf. MacKay, 1956; 
Miller, Galanter, & Pribram, 1960). The elements are an 
input function, a reference value, acomparator, and an output 
function (see Figure 8.1). 


Goal, 
standard, 
reference value 


Comparator 


Output function 


Input function 


Effect on 
environment 


Figure 8.1 Schematic depiction of a feedback loop, the basic unit of 
cybernetic control. In such a loop a sensed value is compared to a reference 
value or standard, and adjustments are made in an output function (if neces- 
sary) to shift the sensed value in the appropriate direction. 


An input function is a sensor. Think of it as perception. The 
reference value is a bit of information specified from within 
the system. Think of it as a goal. A comparator is something 
that makes continuous or repeated comparisons between the 
input and the reference value. The comparison yields one of 
two outcomes: values being compared either are or are not dis- 
criminably different from one another. Following the compar- 
ison is an output function. Think of this as behavior (although 
the behavior sometimes is internal). If the comparison yielded 
“no difference,” the output function remains whatever it was. 
If the comparison yielded “discrepancy,” the output changes. 

There are two kinds of feedback loops, corresponding to 
two kinds of goals. In a discrepancy-reducing loop (a nega- 
tive feedback loop), the output function is aimed at diminish- 
ing or eliminating any detected discrepancy between input 
and reference value. It yields conformity of input to refer- 
ence. This conformity is seen in the attempt to approach or at- 
tain a valued goal. 

The other kind of feedback loop is a discrepancy-enlarging 
loop (a positive feedback loop). The reference value here is 
not one to approach, but one to avoid. Think of it as an “anti- 
goal.” An example is a feared possible self. Other examples 
would be traffic tickets, public ridicule, and the experience of 
being fired from your job. This loop senses present condi- 
tions, compares them to the anti-goal, and tries to enlarge the 
discrepancy. For example, a rebellious adolescent who wants 
to be different from his parents senses his own behavior, com- 
pares it to his parents’ behavior, and tries to make his own 
behavior as different from theirs as possible. 

The action of discrepancy-enlarging processes in living 
systems is typically constrained in some way by discrepancy- 
reducing loops (Figure 8.2). To put it differently, avoidance 
behaviors often lead into approach behaviors that are com- 
patible with the avoidance. An avoidance loop creates pres- 
sure to increase distance from the anti-goal. The movement 
away occurs until it is captured by the influence of an ap- 
proach loop. This loop then serves to pull the sensed input 
into its orbit. The rebellious adolescent, trying to be different 
from his parents, soon finds other adolescents to conform to, 
all of whom are actively deviating from their parents. 

Our use of the word orbit in the last paragraph suggests a 
metaphor that may be useful for those to whom these con- 
cepts do not feel very intuitive. You might think of feedback 
processes as metaphorically equivalent to gravity and anti- 
gravity. The discrepancy-reducing loop exerts a kind of grav- 
itational pull on the input it is controlling, pulling that input 
closer to its ground zero. The discrepancy-enlarging loop has 
a kind of antigravitational push, moving sensed values ever 
farther away. Remember, though, that this is a metaphor. 
More is involved here than a force field. 
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Figure 8.2 The effects of discrepancy-enlarging feedback systems are often 
constrained by discrepancy-reducing feedback systems. A value moves away 
from an undesired condition in an avoidance loop and then comes under the 
influence of an approach loop, moving toward its goal value. Source: From 
C. S. Carver and M. F. Scheier, On the Self-Regulation of Behavior, copyright 
1998, Cambridge University Press. Reprinted with permission. 


Note that situations are often more complex than the one 
in Figure 8.2 in that there often are several potential values to 
move toward. Thus, if several people try to deviate from a 
mutually disliked reference point, they may diverge from one 
another. For example, one adolescent trying to escape from 
his parents’ values may gravitate toward membership in a 
rock band, whereas another may gravitate toward the army. 
Presumably, the direction in which the person moves will de- 
pend in part on the fit between the available reference values 
and the person’s preexisting values, and in part on the direc- 
tion the person takes initially to escape from the anti-goal. 

Feedback processes have been studied for a long time in a 
variety of physical systems (cf. Wiener, 1948). With respect 
to living systems, they are commonly invoked regarding 
physiological systems, particularly those that maintain the 
equilibriums that sustain life. We all know of the existence of 
homeostatic systems that regulate, for example, temperature 
and blood pressure. It is a bit of a stretch to go from homeo- 
static maintenance processes to intentional behavior, but the 
stretch is not as great as some might think (see Miller et al., 
1960; MacKay, 1956; Powers, 1973). 

One key to this extrapolation is the realization that refer- 
ence values for feedback loops need not be static. They can 
change gradually over time, and one can be substituted 
quickly for another. Thus, a feedback system need not be 
purely homeostatic. It can be highly dynamic—chasing (and 
avoiding) moving targets and changing targets. This is not 
too far from a description (albeit a very abstract one) of the 
events that make up human life. 

Some years ago we argued that the comparator of a psy- 
chological feedback process is engaged by self-focused 
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attention (Carver, 1979; Carver & Scheier, 1981). Indeed, the 
similarity between self-focus effects and feedback effects 
was one thing that attracted us to the feedback model in the 
first place. Self-focused attention leads to more comparisons 
with salient standards (Scheier & Carver, 1983) and to 
greater conformity to those standards. On the avoidance side, 
self-focus has led to rejection of attitudinal positions held by 
a negative reference group (Carver & Humphries, 1981) and 
to stronger reactance effects (Carver & Scheier, 1981). 

The literature of self-awareness is not the only one in 
personality-social psychology that fits well the structure of 
the feedback loop. Another good example (Carver & Scheier, 
1998) is the literature of social comparison. People use 
upward comparisons to help them pull themselves toward de- 
sired goals. People use downward comparisons to help them 
force themselves farther away from (upward from) those who 
are worse off than they are. 


Re-emergent Interest in Approach and Avoidance 


Our interest in the embodiment of these two different kinds 
of feedback processes in behavior is echoed in the recent 
emergence of interest in two modes of regulation in several 
other literatures. One of the most prominent of these litera- 
tures stems from a group of theories that are biological in 
focus. Their research base ranges from animal conditioning 
and behavioral pharmacology (Gray, 1982, 1987b) to studies 
of human brain activity (Davidson, 1992a, 1992b; Tomarken, 
Davidson, Wheeler, & Doss, 1992). These theories assume 
that two core biological systems (sometimes more) are in- 
volved in regulating behavior. 

One system, managing approach behavior, is called the be- 
havioral activation system (Cloninger, 1987; Fowles, 1980), 
behavioral approach system (Gray, 1987a, 1990), behavioral 
engagement system (Depue, Krauss, & Spoont, 1987), or 
behavioral facilitation system (Depue & Iacono, 1989). The 
other, dealing with withdrawal or avoidance, is usually called 
the behavioral inhibition system (Cloninger, 1987; Gray, 
1987a, 1990), and sometimes a withdrawal system (Davidson, 
1992a, 1992b). The two systems are generally regarded as rel- 
atively independent, with different portions of the brain being 
most involved in their functioning. 

Another literature with a dual-motive theme derives from 
self-discrepancy theory (Higgins, 1987, 1996). This theory 
holds that people relate their perceptions of their actual selves 
to several self-guides, particularly ideals and oughts. Ideals 
are qualities the person desires to embody: hopes, aspira- 
tions, positive wishes for the self. Living up to an ideal means 
attaining something desired. An ideal is clearly an approach 
goal. 


Oughts, in contrast, are defined by a sense of duty, respon- 
sibility, or obligation. An ought is a self that one feels com- 
pelled to be, rather than intrinsically desires to be. The ought 
self is a positive value, in the sense that people try to conform 
to it. However, living up to an ought also implies acting to 
avoid a punishment—self-disapproval or the disapproval of 
others. In our view, oughts are more complex structurally 
than ideals. Oughts intrinsically imply both an avoidance 
process and an approach process. Their structure thus resem- 
bles what was illustrated earlier in Figure 8.2. Recent work 
has demonstrated the avoidance aspect of the dynamics 
behind the ought self (Carver, Lawrence, & Scheier, 1999; 
Higgins & Tykocinski, 1992). 

A similar theme can be seen in the literature of self- 
determination theory (e.g., Deci & Ryan, 1985, 2000; Ryan & 
Deci, 2000). That theory focuses on the importance of having 
a sense of self-determination in one’s actions. Actions that 
are self-determined are engaged in because they are of intrin- 
sic interest or because they reflect values that are incorpo- 
rated within the self. Such behavior clearly represents a 
voluntary approach of positive goal values. In contrast to this 
is what is termed controlled behavior, meaning that the be- 
havior occurs in response to some sort of coercive force. The 
coercion can be from outside, or it can be self-coercion. An il- 
lustration of the latter is doing something because you feel 
you have to do it in order not to feel guilty. Such introjected 
values are very similar to the oughts of self-discrepancy 
theory, and we have suggested that they similarly involve 
an avoidance process along with the approach (Carver & 
Scheier, 1999a, 2000). 


HIERARCHICALITY AMONG GOALS 


Another key issue in the translation of goals into behavior re- 
flects the obvious fact that some goals are broader in scope 
than others. How to think about the difference in breadth is not 
always easy to put your finger on. Sometimes it is a difference 
in temporal commitment. Sometimes, though, it’s more than 
that: It’s a difference in the goal’s level of abstraction. 


Differentiating Goals by Levels of Abstraction 


The notion that goals differ in their level of abstraction is easy 
to illustrate. You may have the goal of being an honorable 
person or a self-sufficient person—goals at a fairly high level 
of abstraction. You may also have the goal of avoiding a per- 
son at work who gossips or of making dinner for yourself, 
which are at a lower level of abstraction. The first set con- 
cerns being a particular kind of person, whereas the second 


set concerns completing a particular kind of action. You can 
also think of goals that are even more concrete, such as the 
goal of walking quietly to your office and closing the door 
without being noticed, or the goal of slicing vegetables into a 
pan. These goals (which some would call plans or strategies) 
are closer to specifications of individual acts than was the 
second set, which consisted more of summary statements 
about the desired outcomes of intended action patterns. 

As you may have noticed, the examples used to illustrate 
concrete goals relate directly to the examples of abstract 
goals. We did this to show how abstract goals join with con- 
crete goals in a hierarchy of levels of abstraction. In 1973 
William Powers argued that a hierarchical organization of 
feedback loops underlies the self-regulation of behavior, thus 
proposing a model of hierarchicality among goals. 

His line of thought ran as follows: In a hierarchical organi- 
zation of feedback systems, the output of a high-level system 
consists of the resetting of reference values at the next-lower 
level of abstraction. To put it differently, higher order systems 
“behave” by providing goals to the systems just below them. 
The reference values are more concrete and restricted as one 
moves from higher to lower levels. Each level regulates a qual- 
ity that contributes to (though not entirely defining) the quality 
controlled at the next-higher level. Each level monitors input 
at the level of abstraction of its own functioning, and each level 
adjusts output to minimize its discrepancies. Structures at var- 
ious levels presumably handle their concerns simultaneously. 

Powers (1973) focused particularly on low levels of ab- 
straction. He said much less about the levels we’re most inter- 
ested in, though he did suggest labels for several of them. 
What he called sequences are strings of action that run off 
directly once cued. Programs, the next-higher level, are activ- 
ities involving conscious decisions at various points. Pro- 
grams are webs of sequences with an overall purpose that 
synthesizes the goals of the constituent sequences. The next 
level is principles, qualities that are abstracted from (or imple- 
mented by) programs. These are the kinds of qualities that are 
represented by trait labels. Powers gave the name system con- 
cepts to the highest level he considered. Goals there include 
the idealized sense of self, relationship, or group identity. 

A simple way of portraying this hierarchy is shown in 
Figure 8.3. This diagram omits the loops of feedback 
processes, using lines to indicate only the links among goal 
values. The lines imply that moving toward a particular lower 
goal contributes to the attainment of some higher goal (or 
even several at once). Multiple lines to a given goal indicate 
that several lower level action qualities can contribute to its 
attainment. As indicated previously, there are goals to be a 
particular way and goals to do certain things (and at lower 
levels, goals to create physical movement). 
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Figure 8.3 A hierarchy of goals (or of feedback loops). Lines indicate the 
contribution of lower level goals to specific higher level goals. They can also 
be read in the opposite direction, indicating that a given higher order goal 
specifies more-concrete goals at the next-lower level. The hierarchy de- 
scribed in text involves goals of “being” particular ways, which are attained 
by “doing” particular actions. Source: From C. S. Carver and M. F. 
Scheier, On the Self-Regulation of Behavior, copyright 1998, Cambridge 
University Press. Reprinted with permission. 


Although the Powers hierarchy per se has not been studied 
empirically, research has been done from the perspective of 
another theory that strongly resembles it—Vallacher and 
Wegner’s (1985) action identification theory. This model is 
framed in terms of how people think about their actions, but 
it also conveys the sense that how people think about their 
actions is informative about the goals by which they are 
guiding the actions. People can identify a given action in 
many different ways, and the identifications can vary in 
level of abstraction. High-level identifications are abstract, 
whereas lower level identifications are more concrete. Low- 
level identifications tend to convey a sense of how an activity 
is done, whereas high-level identifications tend to convey a 
sense of why. 

The Vallacher and Wegner (1985) model does not specify 
what qualities define various levels but simply assumes that 
where there is a potential emergent property, there is the po- 
tential for differing levels of identification. However, the ex- 
amples used to illustrate the theory tend to map onto levels of 
the Powers hierarchy: sequences of acts, programs of actions 
(with variations of smaller scale and larger scale programs), 
and principles of being. Thus, work on action identification 
tends to suggest the reasonableness of these particular levels 
of abstraction in thinking about behavior. 

Step back from this hierarchy for a moment to consider its 
broader implications. Our present interest is in linking these 
ideas to the construct of personality. It should be clear that 
this model provides a way to talk about how the values that 
are embedded in a person’s personality are manifested in 
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that person’s actions. Values are the source of intentions to 
take certain patterns of actions, and those programmatic 
action plans are realized in an extended series of sequences of 
movement. This view also provides for a mechanism by 
which the actions themselves take place, which is not typi- 
cally the case in models of personality. 


Multiple Paths to High-Level Goals, Multiple Meanings 
from Concrete Acts 


This hierarchy also has implications for several further issues 
in thinking about behavior (for more detail see Carver & 
Scheier, 1998, 1999a). In this view, goals at a given level can 
often be attained by a variety of means at lower levels. This 
addresses the fact that people sometimes shift radically the 
manner in which they try to reach a goal when the goal itself 
has not changed. This happens commonly when the emergent 
quality that is the higher order goal is implied in several 
lower order activities. For example, a person can be helpful 
by writing a donation check, picking up discards for a recy- 
cling center, volunteering at a charity, or holding a door open 
for someone else. 

Just as a given goal can be obtained via multiple path- 
ways, so can a specific act be performed in the service of di- 
verse goals. For example, you could buy someone a gift to 
make her feel good, to repay a kindness, to put her in your 
debt, or to satisfy a perceived holiday-season role. Thus, a 
given act can have strikingly different meanings depending 
on the purpose it’s intended to serve. This is an important 
subtheme of this view on behavior: Behavior can be under- 
stood only by identifying the goals to which behavior is ad- 
dressed. This is not always easy to do, either from an 
observer’s point of view (cf. Read, Druian, & Miller, 1989) 
or from the actor’s point of view. 


Goals and the Self 


Another point made by the notion of hierarchical organiza- 
tion concerns the fact that goals are not equivalent in their 
importance. The higher you go into the organization, the 
more fundamental to the overriding sense of self are the qual- 
ities encountered. Thus, goal qualities at higher levels would 
appear to be intrinsically more important than those at lower 
levels. 

Goals at a given level are not necessarily equivalent to 
one another in importance, however. In a hierarchical system 
there are at least two ways in which importance accrues to a 
goal. The more directly an action contributes to attainment 
of some highly valued goal at a more abstract level, the more 


important is that action. Second, an act that contributes to 
the attainment of several goals at once is thereby more im- 
portant than an act that contributes to the attainment of only 
one goal. 

Relative importance of goals returns us again to the concept 
of self. In contemporary theory the self-concept has several 
aspects. One is the structure of knowledge about your personal 
history; another is knowledge about who you are now. Another 
is the self-guides or images of potential selves that are used to 
guide movement from the present into the future. As stated 
earlier, a broad implication of this view is that the self— 
indeed, personality—consists partly of a person’s goals. 


FEEDBACK LOOPS AND CREATION OF AFFECT 


We turn now to another aspect of human self-regulation: 
emotion. Here we add a layer of complexity that differs 
greatly from the complexity represented by hierarchicality. 
Again, the organizing principle is feedback control. But now 
the control is over a different quality. 

What are feelings, and what makes them exist? Many have 
analyzed the information that feelings provide and situations 
in which affect arises (see, e.g., Frijda, 1986; Lazarus, 1991; 
Ortony, Clore, & Collins, 1988; Roseman, 1984; Scherer & 
Ekman, 1984). The question we address here is slightly dif- 
ferent: What is the internal mechanism by which feelings 
arise? 


Velocity Control 


We have suggested that feelings arise within the functioning 
of another feedback process (Carver & Scheier, 1990). This 
process operates simultaneously with the behavior-guiding 
process and in parallel to it. One way to describe this second 
function is to say that it is checking on how well the behavior 
loop is doing at reducing its discrepancies. Thus, the input for 
this second loop is a representation of the rate of discrepancy 
reduction in the action system over time. We focus first on 
discrepancy-reducing loops and turn later to enlarging loops. 

We find an analogy useful here. Because action implies 
change between states, think of behavior as analogous to dis- 
tance. If the action loop deals with distance, and if the affect- 
relevant loop assesses the progress of the action loop, then 
the affect loop is dealing with the psychological equivalent of 
velocity, the first derivative of distance over time. To the ex- 
tent that the analogy is meaningful, the perceptual input to 
this loop should be the first derivative over time of the input 
used by the action loop. 


This input does not in itself create affect because a given 
rate of progress has different affective consequences under 
different circumstances. As in any feedback system, this 
input is compared against a reference value (cf. Frijda, 1986, 
1988). In this case, the reference is an acceptable or desired 
rate of behavioral discrepancy reduction. As in other feed- 
back loops, the comparison checks for a deviation from the 
standard. If there is one, the output function changes. 

We suggest that the result of the comparison process in 
this loop (the error signal generated by its comparator) ap- 
pears phenomenologically in two forms. One is a nonverbal 
sense of confidence or doubt (to which we turn later). The 
other is affect, feeling, a sense of positivity or negativity. 


Research Evidence 


Because this idea is relatively novel, we should devote some 
attention to whether any evidence supports it. Initial support 
came from Hsee and Abelson (1991), who arrived indepen- 
dently at the velocity hypothesis. They conducted two studies 
of velocity and satisfaction. In one, participants read descrip- 
tions of paired hypothetical scenarios and indicated which 
they would find more satisfying. For example, they chose 
whether they would be more satisfied if their class standing 
had gone from the 30th percentile to the 70th over the past 
6 weeks, or if it had done so over the past 3 weeks. Given 
positive outcomes, they preferred improving to a high out- 
come over a constant high outcome; they preferred a fast 
velocity over a slow one; and they preferred fast small 
changes to slower larger changes. When the change was neg- 
ative (e.g., salaries decreased), they preferred a constant low 
salary to a salary that started high and fell to the same low 
level; they preferred slow falls to fast falls; and they preferred 
large slow falls to small fast falls. 

We have since conducted a study that conceptually repli- 
cates aspects of these findings but with an event that was 
personally experienced rather than hypothetical (Lawrence, 
Carver, & Scheier, in press). We manipulated success feed- 
back on an ambiguous task over an extended period. The 
patterns of feedback converged such that block 6 was iden- 
tical for all subjects at 50% correct. Subjects in a neutral 
condition had 50% on the first and last block, and 50% av- 
erage across all blocks. Others had positive change in per- 
formance, starting poorly and gradually improving. Others 
had negative change, starting well and gradually worsening. 
All rated their mood before starting and again after block 6 
(which they did not know ended the session). Those whose 
performances were improving reported mood improvement, 
whereas those whose performances were deteriorating 
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reported mood deterioration, compared to those with a con- 
stant performance. 

Another study that appears to bear on this view of affect 
was reported by Brunstein (1993). It examined subjective 
well-being among college students over the course of an 
academic term, as a function of several perceptions, includ- 
ing perception of progress toward goals. Of particular interest 
at present, perceived progress at each measurement point was 
strongly correlated with concurrent well-being. 


Cruise Control Model 


Although the theory may sound complex, the system we have 
proposed functions much the same as another device that is 
well known to many people: the cruise control on a car. If you 
are moving too slowly toward a goal, negative affect arises. 
You respond to this condition by putting more effort into your 
action, trying to speed up. If you are going faster than you 
need to, positive affect arises, and you pull back effort and 
coast. A car’s cruise control is very similar. You come to a 
hill, which slows you down. The cruise control responds by 
feeding the engine more gas to bring the speed back up. If 
you pass the crest of a hill and roll downhill too fast, the sys- 
tem pulls back on the gas, which eventually drags the speed 
back down. 

This analogy is intriguing because it concerns regulation 
of the very quality that we believe the affect system is regu- 
lating: velocity. It is also intriguing that the analogy incor- 
porates a similar asymmetry in the consequences of 
deviating from the set point. That is, both in a car’s cruise 
control and in human behavior, going too slow calls for in- 
vesting greater effort and resources. Going too fast does not. 
It calls only for pulling back on resources. That is, the cruise 
control does not apply the brakes; it just cuts back on the 
gasoline. In this way it permits the car to coast gradually 
back to its velocity set point. In the same fashion, people do 
not respond to positive affect by trying to make it go away, 
but just by easing off. 

Does positive affect actually lead people to withdraw ef- 
fort? We are not aware of data that bear unambiguously on 
the question. To do so, a study must assess coasting with re- 
spect to the same goal as lies behind the affect. Many stud- 
ies that might otherwise be seen as relevant to the question 
created positive affect in one context and assessed its impact 
on another task (see, e.g., Isen, 2000). The question thus 
seems to remain open, and to represent an important area for 
future work (for broader discussion of relevant issues see 
Carver, in press). 
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Affect from Discrepancy-Enlarging Loops 


Thus far we have restricted ourselves to issues that arise in the 
context of approach. Now we turn to attempts to avoid a point 
of comparison, attempts to not-be or not-do: discrepancy- 
enlarging loops. 

Our earlier discussion should have made it clear that be- 
havior regarding avoidance goals is just as intelligible as 
behavior regarding approach goals. We think the same is true 
of the affective accompaniments to behavior. Our model 
rests on the idea that positive affect comes when a behav- 
ioral system is doing well at what it is organized to do. Thus 
far we have considered only systems organized to close 
discrepancies. There seems no obvious reason, however, 
why the principle should not apply just as well to systems 
organized to enlarge discrepancies. If the system is doing 
well at what it is organized to do, positive affect should 
arise. If it is doing poorly at what it is organized to do, neg- 
ative affect should arise. 

That much would seem to be fully comparable across the 
two types of systems. But doing well at moving toward an in- 
centive is not exactly the same experience as doing well at 
moving away from a threat. Both have the potential to induce 
positive feelings, by doing well. Both also have the potential 
to induce negative feelings, by doing poorly. Yet the two pos- 
itives may not be quite the same as each other, nor the nega- 
tives quite the same as each other. 

Our view of this difference derives partly from the insights 
of Higgins and his colleagues (Higgins, 1987, 1996). Follow- 
ing their lead, we suggest that the affect dimension relating to 
discrepancy reduction is (in its purest form) the dimension 
that runs from depression to elation (Figure 8.4). The affect 
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Figure 8.4 Two affect-creating systems and the affective dimensions we 
believe arise from the functioning of each. Discrepancy-reducing systems 
are presumed to yield affective qualities of sadness or depression when 
progress is well below standard and happiness or elation when progress is 
above standard. Discrepancy-enlarging systems are presumed to yield anxi- 
ety when progress is below standard and relief or contentment when progress 
is above standard. Source: From C. S. Carver and M. F. Scheier, On the 
Self-Regulation of Behavior, copyright 1998, Cambridge University Press. 
Reprinted with permission. 


that relates to discrepancy enlargement is (in its purest form) 
the dimension from anxiety to relief or contentment. As 
Higgins and his colleagues have noted, dejection-related 
and agitation-related affect may take several forms, but these 
two dimensions capture the core qualities behind those two 
classes of affect. Similarly, Roseman (1984) has argued that 
joy and sadness are related to appetitive (moving-toward) 
motives, whereas relief and distress are related to aversive 
(moving-away-from) motives. 


Merging Affect and Action 


Theories about emotion typically emphasize the idea that 
emotion is related to action. How do affect and action relate 
in this model? We see the regulation provided by these sys- 
tems as forming a two-layered array, with both simultane- 
ously at work (Carver & Scheier, 1998, 1999a, 1999b). The 
two layers are analogous to position and velocity controls in 
a two-layered engineering control system (e.g., Clark, 1996). 
Such a two-layered system in engineering has the quality of 
responding both quickly and accurately (without undue oscil- 
lation). There is reason to believe that the simultaneous func- 
tioning of the two layers has the same broad consequence for 
human behavior. 

Another way of addressing the relation between affect and 
action is to ask about the nature of the output of the affect 
loop. Earlier we described affect as reflecting the error signal 
of a loop that has as input a perception of rate of progress. 
The resulting output thus must be an adjustment in rate of 
progress. This output therefore has a direct link to behavior 
because it means changing its pace. 

What does it mean to adjust the rate of progress? In some 
cases it means literally changing velocity. If you are behind, 
go faster. Some adjustments are less straightforward. The 
rates of many behaviors in which personality-social psy- 
chologists are interested are not defined in terms of literal 
pace of motion. Rather, they are defined in terms of choices 
among actions, even potential programs of action. For ex- 
ample, increasing your rate of progress on a reading assign- 
ment may mean choosing to spend a weekend working 
rather than playing. Increasing your rate of manifestation of 
kindness means choosing to perform an action that reflects 
that value. Thus, adjustment in rate must often be translated 
into other terms, such as concentration or reallocation of 
time and effort. 

Despite this complexity in implementing changes in rate, 
it should be apparent from this description that the action sys- 
tem and the velocity system are presumed to work in concert 
with one another. Both are involved in the flow of action. 
They influence different aspects of the action, but both are 


always involved. Thus, this view incorporates clear links be- 
tween behavior and affect. 


Comparison with Biological Models of Bases of Affect 


It is useful to compare this model with the group of biolog- 
ically focused theories mentioned earlier in the chapter. As 
indicated earlier, those theories assume that two separate 
systems regulate approach and avoidance behavior. Many as- 
sume further that the two systems also underlie affect. Given 
cues of impending reward, the activity of the approach sys- 
tem creates positive feelings. Given cues of impending pun- 
ishment, the avoidance system creates feelings of anxiety. 

Data from a variety of sources fit this picture. Of particular 
relevance is work by Davidson and collaborators involving 
electroencephalography (EEG) recordings assessing changes 
in cortical activation in response to affective inducing stimuli. 
Among the findings are these: Subjects exposed to films in- 
ducing fear and disgust (Davidson, Ekman, Saron, Senulis, & 
Friesen, 1990) and confronted with possible punishment 
(Sobotka, Davidson, & Senulis, 1992) show elevations in 
right frontal activation. In contrast, subjects with a chance to 
obtain reward (Sobota et al., 1992), subjects presented with 
positive emotional adjectives (Cacioppo & Petty, 1980), 
and smiling 10-month olds viewing their approaching moth- 
ers (Fox & Davidson, 1988) show elevations in left frontal 
activation. From findings such as these, Davidson (1992a, 
1992b) concluded that neural substrates for approach and 
withdrawal systems (and thus positive and negative affect) 
are located in the left and right frontal areas of the cortex, 
respectively. 

Thus far the logic of the biological models resembles the 
logic of our model. At this point, however, there is a diver- 
gence. The key question is what regulatory processes are 
involved in—and what affects result from—failure to attain 
reward and failure to receive punishment. Gray (1987b, 
1990) holds that the avoidance system is engaged by cues of 
punishment and cues of frustrative nonreward. It thus is re- 
sponsible for negative feelings in response to either of these 
types of cues. Similarly, Gray holds that the approach system 
is engaged by cues of reward or cues of escape from (or 
avoidance of) punishment. It thus is responsible for positive 
feelings in response to either of these types of cues. In his 
view, then, each system creates affect of one hedonic tone 
(positive in one case, negative in the other), regardless of its 
source. This view is consistent with a picture of two unipolar 
affective dimensions, each linked to a distinct behavioral sys- 
tem. Others have taken a similar position (see Cacioppo, 
Gardner, & Berntson, 1999; Lang, 1995; Lang, Bradley, & 
Cuthbert, 1990; Watson, Wiese, Vaidya, & Tellegen, 1999). 
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Our position is different. We argue that both approach 
and avoidance systems can create affects of both hedonic 
tones because affect is a product of doing well or doing 
poorly. We think that the frustration and eventual depression 
that result from failure to attain desired goals involve the ap- 
proach system (for similar predictions see Clark, Watson, & 
Mineka, 1994, p. 107; Cloninger, 1988, p. 103; Henriques & 
Davidson, 1991). A parallel line of reasoning suggests that 
relief, contentment, tranquility, and serenity relate to the 
avoidance system rather than to the approach system (see 
Carver, 2001). 

Less information exists about the bases of these affects 
than about anxiety and happiness. Consider first relief- 
tranquility. We know of two sources of evidence, both some- 
what indirect. The first is a study in which people worked at 
a laboratory task and experienced either goal attainment or 
lack of attainment (Higgins, Shah, & Friedman, 1997, Study 
4). Participants first were given either an approach orienta- 
tion to the task (to try to attain success) or an avoidance ori- 
entation (to try to avoid failing). After the task outcome 
(which was manipulated), several feeling qualities were as- 
sessed. Among persons given an avoidance orientation, suc- 
cess caused an elevation in calmness, and failure caused an 
elevation in anxiety. These effects on calmness and anxiety 
did not occur, however, among those who had an approach 
orientation. This pattern suggests that calmness is linked to 
doing well at avoidance, rather than doing well at approach. 

Another source is data reported many years ago by Watson 
and Tellegen (1985). In their analysis of multiple samples 
of mood data, they reported “calm” to be one of the 10 best 
markers (inversely) of negative affect (which was defined 
mostly by anxiety) in the majority of the data sets they exam- 
ined. In contrast, “calm” never emerged as one of the top 
markers of positive affect in those data sets. This suggests 
that these feelings are linked to the functioning of a system of 
avoidance. 

The same sources also provide information on the mo- 
mentary experience of sadness. In the study by Higgins et al. 
(1997), failure elevated sadness and success elevated cheer- 
fulness among persons with an approach orientation. These 
effects did not occur, however, among participants who had 
an avoidance orientation. The pattern suggests that sadness is 
linked to doing poorly at approach, rather than doing poorly 
at avoidance. Similarly, Watson and Tellegen (1985) reported 
“sad” to be one of the 10 best markers (inversely) of the fac- 
tor that they called positive affect in the majority of the data 
sets they examined. In contrast, “sad” never emerged as one 
of the top markers of negative affect in those data sets. This 
pattern suggests that sad feelings are linked to the functioning 
of a system of approach. 
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This issue clearly represents an important difference 
among theoretical viewpoints (Carver, 2001). Just as clearly, 
it is not yet resolved. It seems likely that it will receive more 
attention in the near future. 


RESPONDING TO ADVERSITY: PERSISTENCE 
AND GIVING UP 


In describing the genesis of affect, we suggested that one 
process yields two subjective experiences as readouts: affect 
and a sense of confidence versus doubt. We turn now to con- 
fidence and doubt—expectancies for the immediate future. 
We focus here on the behavioral and cognitive manifestations 
of the sense of confidence or doubt. 

One likely consequence of momentary doubt is a search 
for more information. We have often suggested that when 
people experience adversity in trying to move toward goals, 
they periodically interrupt efforts in order to assess in a more 
deliberative way the likelihood of a successful outcome (e.g., 
Carver & Scheier, 1981, 1990, 1998). In effect, people sus- 
pend the behavioral stream, step outside it, and evaluate in a 
more deliberated way. This may happen once or often. It may 
be brief, or it may take a long time. In this assessment people 
presumably depend heavily on memories of prior outcomes 
in similar situations. They may also consider such things as 
additional resources they might bring to bear, alternative ap- 
proaches that might be taken, and social comparison infor- 
mation (Wills, 1981; Wood, 1989). 

These thoughts sometimes influence the expectancies that 
people hold. When people retrieve “chronic” expectancies 
from memory, the information already is expectancies— 
summaries of the products of previous behavior. In some 
cases, however, the process is more complex. People bring to 
mind possibilities for changing the situation and evaluate 
their consequences. This is often done by briefly playing the 
possibility through mentally as a behavioral scenario (cf. 
Taylor & Pham, 1996). Doing so can lead to conclusions that 
influence expectancies (“If I try doing it this way instead of 
that way, it should work better” or “This is the only thing I 
can see to do, and it will just make the situation worse”). 

It seems reasonable that this mental simulation engages 
the same mechanism as handles the affect-creation process 
during actual overt behavior. When your progress is tem- 
porarily stalled, playing through a confident and optimistic 
scenario yields a higher rate of progress than is currently 
being experienced. The affect loop thus yields a more opti- 
mistic outcome assessment than is being derived from 
current action. If the scenario is negative and hopeless, it 
indicates a further reduction in progress, and the loop yields 
further doubt. 


Behavioral Manifestations 


Whether stemming from the immediate flow of experience 
or from a more thorough introspection, people’s expectan- 
cies are reflected in their behavior. If people expect a suc- 
cessful outcome, they continue exerting effort toward the 
goal. If doubts are strong enough, the result is an impetus 
to disengage from effort, and potentially from the goal it- 
self (Carver & Scheier, 1981, 1990, 1998, 1999a; see also 
Klinger, 1975; Kukla, 1972; Wortman & Brehm, 1975). This 
theme—divergence in behavioral response as a function of 
expectancies—is an important one, applying to a surpris- 
ingly broad range of literatures (see Carver & Scheier, 1998, 
chap. 11). 

Sometimes the disengagement that follows from doubt is 
overt, but sometimes disengagement takes the form of mental 
disengagement—off-task thinking, daydreaming, and so on. 
Although this can sometimes be useful (self-distraction from 
a feared stimulus may allow anxiety to abate), it can also cre- 
ate problems. Under time pressure, mental disengagement 
can impair performance, as time is spent on task-irrelevant 
thoughts. Consistent with this, interactions between self- 
focus and expectancies have been shown for measures of 
performance (Carver, Peterson, Follansbee, & Scheier, 1983; 
Carver & Scheier, 1982). 

Often, mental disengagement cannot be sustained, as situ- 
ational cues force the person to reconfront the problematic 
goal. In such cases, the result is a phenomenology of repeti- 
tive negative rumination, which often focuses on self-doubt 
and perceptions of inadequacy. This cycle is both unpleasant 
and performance-impairing. 


Is Disengagement Good or Bad? 


Is the disengagement tendency good or bad? Both and nei- 
ther. On the one hand, disengagement (at some level, at least) 
is an absolute necessity. Disengagement is a natural and in- 
dispensable part of self-regulation (cf. Klinger, 1975). If peo- 
ple are ever to turn away from unattainable goals, to back out 
of blind alleys, they must be able to disengage, to give up and 
start over somewhere else. 

The importance of disengagement is particularly obvious 
with regard to concrete, low-level goals: People must be able to 
remove themselves from literal blind alleys and wrong streets, 
give up plans that have become disrupted by unexpected 
events, even spend the night in the wrong city if they miss the 
last plane home. Disengagement is also important, however, 
with regard to more abstract and higher level goals. A vast lit- 
erature attests to the importance of disengaging and moving 
on with life after the loss of close relationships (e.g., Orbuch, 
1992; Stroebe, Stroebe, & Hansson, 1993; Weiss, 1988). 


People sometimes must be willing to give up even values that 
are deeply embedded in the self if those values create too much 
conflict and distress in their lives. 

However, the choice between continued effort and giving 
up presents opportunities for things to go awry. It is possible 
to stop trying too soon, thereby creating potentially serious 
problems for oneself (Carver & Scheier, 1998). It is also pos- 
sible to hold on to goals too long, thereby preventing oneself 
from taking adaptive steps toward new goals. But both con- 
tinued effort and giving up are necessary parts of the experi- 
ence of adaptive self-regulation. Each plays an important role 
in the flow of behavior. 


Hierarchicality and Importance 
Can Impede Disengagement 


Disengagement is sometimes precluded by situational con- 
straints. However, a broader aspect of this problem stems 
from the idea that behavior is hierarchically organized, with 
goals increasingly important higher in the hierarchy, and thus 
harder to disengage from. 

Presumably, disengaging from concrete values is often 
easy. Lower order goals vary, however, in how closely they 
link to values at a higher level, and thus in how important 
they are. To disengage from low-level goals that are tightly 
linked to higher level goals causes discrepancy enlarge- 
ment at the higher level. These higher order qualities are 
important, even central to one’s life. One cannot disengage 
from them, disregard them, or tolerate large discrepancies 
between them and current reality without reorganizing one’s 
value system (Greenwald, 1980; Kelly, 1955; McIntosh & 
Martin, 1992; Millar, Tesser, & Millar, 1988). In such a case, 
disengagement from even very concrete behavioral goals can 
be quite difficult. 

Now recall again the affective consequences of being in 
this situation. The desire to disengage was prompted by unfa- 
vorable expectancies. These expectancies are paralleled by 
negative affect. In this situation, then, the person experiences 
negative feelings (because of an inability to make progress 
toward the goal) and is unable to do anything about the feel- 
ings (because of an inability to give up). The person simply 
stews in the feelings that arise from irreconcilable discrepan- 
cies. This kind of situation—commitment to unattainable 
goals—seems a sure prescription for distress. 


Watersheds, Disjunctions, and Bifurcations 
Among Responses 


An issue that bears some further mention is the divergence in 
the model of the behavioral and cognitive responses to favor- 
able versus unfavorable expectancies. We have long argued 
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for a psychological watershed among responses to adversity 
(Carver & Scheier, 1981). One set of responses consists of 
continued comparisons between present state and goal, and 
continued efforts. The other set consists of disengagement 
from comparisons and quitting. Just as rainwater falling on a 
mountain ridge ultimately flows to one side of the ridge or the 
other, so do behaviors ultimately flow to one of these sets or 
the other. 

Our initial reason for taking this position stemmed largely 
from several demonstrations that self-focused attention cre- 
ates diverging effects on information seeking and behavior as 
a function of expectancies of success. We are not the only ones 
to have emphasized a disjunction among responses, however. 
A number of others have done so, for reasons of their own. 

Kukla (1972) proposed an early model that emphasized 
the idea of a disjunction in behavior. Another such model 
is the reactance-helplessness integration of Wortman and 
Brehm (1975): the argument that threats to control produce 
attempts to regain control and that perceptions of loss of 
control produce helplessness. Brehm and his collaborators 
(Brehm & Self, 1989; Wright & Brehm, 1989) developed an 
approach to task engagement that resembles that of Kukla 
(1972), but their way of approaching the description of the 
problem is somewhat different. Not all theories about persis- 
tence and giving up yield this dichotomy among responses. 
The fact that some do, however, is interesting. It becomes 
more so a bit later on. 


SCALING BACK ASPIRATIONS AND 
RECALIBRATION OF THE AFFECT SYSTEM 


The preceding sections dealt with the creation of affect and 
confidence and the concomitant effects on behavior. By im- 
plication, the time frames under discussion were quite narrow. 
In this section we broaden our view somewhat and indicate an 
important way in which reference values change across longer 
periods of time. These particular changes are changes in the 
stringency of the goals being sought after. We consider this 
issue both with respect to the reference values underlying the 
creation of affect and with respect to the goals of behavior. 


Shifts in Velocity Standards 


Reference values used by the affect system presumably can 
shift through time and experience. That is, as people accumu- 
late experience in a given domain, adjustments can occur in 
the pacing that they expect and demand of themselves. There 
is a recentering of the system around the past experience, 
which occurs via shifts in the reference value (Carver & 
Scheier, 2000). 
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Consider first upward adjustments. As an example, a per- 
son who gains work-related skills often undertakes greater 
challenges, requiring quicker handling of action units. Up- 
ward adjustment of the rate standard means that the person 
now will be satisfied only with faster performance. Such a 
shift has the side effect of decreasing the potential for posi- 
tive affect and increasing the potential for negative affect be- 
cause there now is more room to fail to reach the rate 
standard and less room to exceed it. Recall, however, that the 
shift was induced by a gain in skills. The change in skill tends 
to counter the shift in regions of potential success and failure. 
Thus, the likelihood of negative affect (vs. positive affect or 
no affect) remains fairly constant. 

Now consider a downward adjustment. For example, a 
person whose health is failing may find that it takes longer to 
get things done than it used to. This person will gradually 
come to use less stringent rate standards. A lower pace will 
then begin to be more satisfying. One consequence of this 
downward shift of standard is to increase the potential for ex- 
periencing positive affect and to decrease the potential for 
negative affect because there now is less room for failing to 
reach the rate standard and more room for exceeding it. The 
failing health, however, tends to counter the shift in regions 
of potential success and failure. Again, then, the net result is 
that the likelihood of negative affect (vs. positive and neutral) 
remains fairly constant. 


Mechanism of Shift 


Such changes in comparison value do not happen quickly or 
abruptly. Shifting the reference value downward is not peo- 
ple’s first response when they have trouble maintaining a de- 
manding pace. First, they try harder to keep up. Only more 
gradually, if they continue to lag behind, does the rate-related 
standard shift to accommodate. Similarly, the immediate re- 
sponse when people’s pace exceeds the standard is not an up- 
ward shift in reference value. The more typical response is to 
coast for a while. Only when the overshoot is frequent does 
the standard shift upward. 

We believe that adjustments in these standards occur 
automatically and involuntarily, but slowly. Such adjust- 
ments themselves appear to reflect a self-corrective feedback 
process (Figure 8.5). This feedback process is slower than the 
ones focused on thus far, involving a very gradually accumu- 
lating shift. It resembles what Solomon (1980; Solomon & 
Corbit, 1974) described as the long-term consequences of an 
opponent process system (see also Helson, 1964, regarding 
the concept of adaptation level). 

As an illustration, assume for the moment that a signal to 
adjust the standard occurred every time there was a signal to 
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Figure 8.5 A feedback loop (in this case, the postulated velocity loop) acts 
to create change in the input function, to shift it toward the reference value. 
Sometimes an additional process is in place as well (gray lines), which ad- 
justs the reference value in the direction of the input. This additional process 
is presumed to be weaker or slower; thus, the reference value is stable rela- 
tive to the input value. Source: From C. S. Carver and M. F. Scheier, On 
the Self-Regulation of Behavior, copyright 1998, Cambridge University 
Press. Reprinted with permission. 


change output, but that the former was much weaker than the 
latter—say, 5% of the latter. If so, it would take a fairly long 
time for the standard to change. Indeed, as long as the person 
deviated from the standard in both directions (under and 
over) with comparable frequency, the standard would never 
change noticeably, even over an extended period. Only with 
repeated deviation in the same direction could there be an ap- 
preciable effect on the standard. 

This view has an interesting implication for affective ex- 
perience across an extended period. Such shifts in reference 
value (and the resultant effects on affect) would imply a 
mechanism within the organism that prevents both the too- 
frequent occurrence of positive feeling and the too-frequent 
occurrence of negative feeling. That is, the (bidirectional) 
shifting of the rate criterion over time would tend to control 
pacing such that affect continues to vary in both directions 
around neutral, roughly as before. The person thus would 
experience more or less the same range of variation in affec- 
tive experience over long times and changing circumstances 
(see Myers & Diener, 1995, for evidence of this). The organi- 
zation would function as a gyroscope serving to keep people 
floating along within the framework of the affective reality 
with which they are familiar. It would provide for a continu- 
ous recalibration of the feeling system across changes in sit- 
uation. It would repeatedly shift the balance point of a 
psychic teeter-totter so that rocking both up and down re- 
mains possible. 


Scaling Back on Behavioral Goals 


The principle of gradual adjustment of a standard also oper- 
ates at the level of behavioral goals (Carver & Scheier, 1981, 
1998). Sometimes progress is going poorly, expectancies of 
success are dim, and the person wants to quit. Rather than quit 
altogether, the person trades this goal for a less demanding 
one. This is a kind of limited disengagement in the sense that 
the person is giving up the first goal while adopting the lesser 
one. However, this limited disengagement keeps the person 
engaged in activity in the domain he or she had wanted to 
quit. By scaling back the goal—giving up in a small way— 
the person keeps trying to move ahead—thus not giving up, in 
a larger way. 

Small-scale disengagement occurs often in the context of 
moving forward in broader ways. A particularly poignant ex- 
ample comes from research on couples in which one partner 
is becoming ill and dying from AIDS (Moskowitz, Folkman, 
Collette, & Vittinghoff, 1996). Some healthy participants ini- 
tially had the goal of overcoming their partner’s illness and 
continuing active lives together. As the illness progressed and 
it became apparent that that goal would not be met, it was not 
uncommon for the healthy partners to scale back their aspira- 
tions. Now the goal was, for example, to do more limited ac- 
tivities during the course of a day. Choosing a more limited 
and manageable goal ensures that it will be possible to move 
toward it successfully. The result was that even in those diffi- 
cult circumstances the person experienced more success than 
would otherwise have been the case and remained engaged 
behaviorally with efforts to move forward. 

How does the scaling back of goals within a domain 
occur? We believe that the answer is the same as in the case 
of affect: If the loop’s output function is inadequate at mov- 
ing the input toward the standard, a second (slower-acting) 
process moves the standard toward the input. The scaling 
back of behavioral goals thus would involve the same struc- 
tural elements as are involved in the recalibration of the af- 
fect system. 


CONFLICT AND RESTRAINT 


In thinking about the self-regulation of behavior, another set 
of issues to be considered concerns the existence of conflict. 
Conflict arises whenever two incompatible goals are held si- 
multaneously and both are salient (see also Carver & Scheier, 
1998, 1999b). It sometimes is possible to move toward two 
goals simultaneously, but sometimes moving toward one 
interferes with one’s ability to move toward the other. For 
example, the woman who wants to develop her career and 
also spend time with her family faces a conflict imposed by 
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Ideal self 


Work extra | | Have family 

hours | picnic | 

Figure 8.6 Conflict arises when two desired goals are incompatible for 
some reason. For example, by working extra hours in order to further a ca- 
reer aspiration, this woman may at the same time be having an adverse in- 
fluence (indicated by the opposite-direction arrow) on another goal that is 
also related to her ideal self—spending time maintaining a sense of related- 
ness with her family. 


the limited number of hours in the day and days in the week 
(Figure 8.6). The effort to attain one (e.g., further the career 
by working extra hours) can interfere with efforts to attain the 
other (by removing the time available for family activities). 

Given this structure, the experience of conflict naturally 
produces negative feelings, as movement toward one of the 
goals is impeded. If movement toward the active goal is rapid 
(relative to the reference velocity) as movement toward the 
other goal is stifled, the person may have mixed feelings, 
feelings relating to each of the two goal values. It is no sur- 
prise that people typically try to balance their conflicting de- 
sires so that both goals are partly attained. It is also no 
surprise that this strategy often feels unsatisfying, as the per- 
son “almost” keeps up with goals in both domains but keeps 
up fully with neither of them. 

Often there is no structural basis for viewing one goal as 
intrinsically more valuable than the other (as in Figure 8.6). 
Sometimes, however, one goal has a kind of primacy because 
it is reflected in an explicitly formulated intention to override 
efforts to move toward the other goal. Sometimes the tenden- 
cies involved are mental; sometimes they are behavioral. 
Often, the attempt to override works for a while (sometimes 
a long while), but sometimes it fails. 


Ironic Processes in Mental Control 


One literature bearing on this theme was developed by 
Wegner (e.g., 1994) and his colleagues. The study that began 
this work was simple. Some people were told not to think of 
a white bear for 5 minutes. Then they were told to think about 
the bear. When the thought was permitted, it came more fre- 
quently than it did for people who had not had to suppress the 
thought first. Something about trying not to think of the bear 
seemed to create pressure to think of it. 

This study was followed by others. Most of this research 
looked not at rebounds, but at what goes on during people’s 
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attempts to control their thoughts. The data consistently indi- 
cate that an instruction to exert mental control yields better 
control if the person has no other demands. If something else 
is going on, however (e.g., if the person is trying to remem- 
ber a 9-digit number), the instruction backfires, and people 
tend to do the opposite of what they are trying to do. 

Wegner (1994) interprets this as follows: Trying to sup- 
press a thought engages two processes. An intentional process 
tries to suppress. An ironic monitoring process looks for 
the occurrence of whatever is being suppressed. If it finds it, it 
increases the effort of the first mechanism. The ironic monitor 
is sensitive, but it is automatic and does not require much in 
the way of mental resources. The intentional process requires 
more resources. Thus, any reduction in mental resources (e.g., 
being distracted by a second thought or task) disrupts the 
intentional process more than it disrupts the ironic monitor. 
The monitor, searching for lapses, in effect invites those 
lapses to occur. 

This theory also applies to the opposite pattern—attempts 
to concentrate. In this case, the intentional process con- 
centrates, and the ironic process looks for the occurrence 
of distractions. As in the first case, if the person’s mental 
resources are stretched thin, the ironic process seems to invite 
the undesired thought into consciousness. In this case, the 
thought is a distraction. 

This research indicates that trying hard to do something (or 
suppress something) gets much harder when your mental re- 
sources are stretched thin. Not only does it get harder, but you 
may even begin to do the opposite of what you are trying to do. 


Lapses in Self-Control 


Another important literature bearing on this set of issues con- 
cerns what Baumeister and Heatherton (1996) termed self- 
regulatory failure, which we will term lapse in self-control. 
The potential for this kind of event arises when someone has 
both the desire to do something (e.g., overindulge in food or 
drink) and also the desire to restrain that impulse. Self-control 
of this sort is often especially hard, and sometimes the re- 
strained impulse breaks free. 

Consider binge eating as an example. The binge eater wants 
to eat but also wants to restrain that desire. If self-control 
lapses, the person stops trying to restrain the desire to eat, lets 
himself or herself go, and binges. 

In characterizing the decision to quit trying to restrain, 
Baumeister and Heatherton noted that restraint is hard work 
and that mental fatigue plays a role; however, giving up the 
restraint attempt rarely requires that the person reach a state 
of total exhaustion. Rather, there is a point where the person 


has had enough and stops trying to control the impulse. We 
have suggested that confidence about resisting the impulse 
plays a role in whether the person stops trying (Carver & 
Scheier, 1998). The confident person continues the struggle 
to restrain. The person whose confidence has sagged is more 
likely to give up. 

Muraven, Tice, and Baumeister (1998) have extended this 
line of thought to argue that self-control is a resource that not 
only is limited but also can become depleted by extended 
self-control efforts. When the resource is depleted, the person 
becomes vulnerable to a failure of self-control. This view 
also suggests that there is a shared pool of self-control 
resources, so that exhausting the resource with one kind of 
self-control (e.g., concentrating very hard for many hours on 
a writing assignment) can leave the person vulnerable to a 
lapse in a different domain (e.g., eating restraint). 

It seems worthwhile to compare the cases considered in 
this section (lapses in self-control) with those described just 
earlier (mental control). Both sections dealt with efforts at 
self-control. In many ways the situations are structurally quite 
similar. Each is an attempt to override one process by another, 
which falters when mental resources are depleted. There even 
is a resemblance between the “overdoing” quality in the pre- 
viously restrained behavior in Baumeister and Heatherton’s 
cases and the rebound quality in Wegner’s research. 

One difference is that the cases emphasized by Baumeis- 
ter and Heatherton explicitly involve desires that direct the 
person in opposing directions. In most cases studied by 
Wegner, there is no obvious reason why the suppressed 
thought (or the distractor) would be desirable. This difference 
between cases seems far from trivial. Yet the similarities in 
the findings in the two literatures are striking enough to war- 
rant further thought about how the literatures are related. 


DYNAMIC SYSTEMS AND SELF-REGULATION 


Recent years have seen the emergence in the psychological lit- 
erature of new (or at least newly prominent) ideas about how 
to conceptualize natural systems. Several labels attach to these 
ideas: chaos, dynamic systems theory, complexity, catastro- 
phe theory. A number of introductions to this body of thought 
have been written, some of which include applications to 
psychology (e.g., Brown, 1995; Gleick, 1987; Thelen & 
Smith, 1994; Vallacher & Nowak, 1994, 1997; Waldrop, 
1992). These themes are of growing interest in several areas of 
psychology, including personality—social psychology. In this 
section we sketch some of the themes that are central to this 
way of thinking. 


Nonlinearity 


Dynamic systems theory holds that the behavior of a system 
reflects all the forces operating on (and within) it. It also 
emphasizes that the behavior of a complex system over any 
period but a brief one is very hard to predict. One reason for 
this is that the system’s behavior may be influenced by these 
forces in nonlinear ways. Thus, the behavior of the system— 
even though highly determined—can appear random. 

Many people are used to thinking of relationships be- 
tween variables as linear. But some relationships clearly are 
not. Familiar examples of nonlinear relationships are step 
functions (ice turning to water and water turning to steam as 
temperature increases), threshold functions, and floor and 
ceiling effects. Other examples of nonlinearity are interac- 
tions. In an interaction the effect of one predictor on the out- 
come differs as a function of the level of a second predictor. 
Thus the effect of the first predictor on the outcome is not 
linear. 

Many personality psychologists think in terms of interac- 
tions much of the time. Threshold effects and interactions are 
nonlinearities that most of us take for granted, though per- 
haps not labeling them as such. Looking intentionally for 
nonlinearities, however, reveals others. For example, many 
psychologists now think that many developmental changes 
are dynamic rather than linear (Goldin-Meadow & Alibali, 
1995; Ruble, 1994; Siegler & Jenkins, 1989; Thelen, 1992, 
1995; van der Maas & Molenaar, 1992). 


Sensitive Dependence on Initial Conditions 


Nonlinearity is one reason for the difficulty in predicting 
complex systems. Two more reasons why prediction over any 
but the short term is difficult is that you never know all the in- 
fluences on a system, and the ones you do know you never 
know with total precision. What you think is going on may 
not be quite what’s going on. That difference, even if it is 
small, can be very important. 

This theme is identified with the phrase sensitive depen- 
dence on initial conditions. This means that a very small dif- 
ference between two states of affairs can lead to divergence 
and ultimately to an absence of relation between the paths 
that are taken later on. The idea is (partly) that a small initial 
difference between systems causes a difference in what they 
encounter next, which produces slightly different outcomes 
(Lorenz, 1963). Through repeated iterations, the systems di- 
verge, eventually moving on very different pathways. After a 
surprisingly brief period they no longer have any noticeable 
relation to one another. 
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How does the notion of sensitive dependence on initial 
conditions relate to human behavior? Most generally, it sug- 
gests that a person’s behavior will be hard to predict over a 
long period except in general terms. For example, although 
you might be confident that Mel usually eats lunch, you will 
not be able to predict as well what time, where, or what he 
will eat on the second Friday of next month. This does not 
mean Mel’s behavior is truly random or unlawful (cf. Epstein, 
1979). It just means that small differences between the influ- 
ences you think are affecting him and the influences that ac- 
tually exist will ruin the predictability of moment-to-moment 
behavior. 

This principle also holds for prediction of your own 
behavior. People apparently do not plan very far into the future 
most of the time (Anderson, 1990, pp. 203-205), even experts 
(Gobet & Simon, 1996). People seem to have goals in which 
the general form is sketched out but only a few steps toward it 
have been planned. Even attempts at relatively thorough plan- 
ning appear to be recursive and “opportunistic,” changing— 
sometimes drastically—when new information becomes 
known (Hayes-Roth & Hayes-Roth, 1979). 

The notion of sensitive dependence on initial conditions 
fits these tendencies. It is pointless (and maybe even counter- 
productive) to plan too far ahead too fully (cf. Kirschenbaum, 
1985), because chaotic forces in play (forces that are hard to 
predict because of nonlinearities and sensitive dependence) 
can render much of the planning irrelevant. Thus, it makes 
sense to plan in general terms, chart a few steps, get there, re- 
assess, and plan the next bits. This seems a perfect illustration 
of how people implicitly take chaos into account in their own 
lives. 


Phase Space, Attractors, and Repellers 


Another set of concepts important to dynamic-systems think- 
ing are variations on the terms phase space and attractor 
(Brown, 1995; Vallacher & Nowak, 1997). A phase diagram 
is a depiction of the behavior of a system over time. Its states 
are plotted along two (sometimes three) axes, with time dis- 
played as the progression of the line of the plot, rather than on 
an axis of its own. A phase space is the array of states that the 
system occupies across a period of time. As the system 
changes states from one moment to the next, it traces a tra- 
jectory within its phase space—a path of the successive states 
it occupies across that period. 

Phase spaces often contain regions called attractors. 
Attractors are areas that the system approaches, occupies, or 
tends toward more frequently than other areas. Attractors 
exert a metaphorical gravitational pull on the system, bringing 
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Figure 8.7 The Lorenz attractor, an example of what is known as a chaotic 
attractor or strange attractor. Source: From C. S. Carver and M. F. Scheier, 
On the Self-Regulation of Behavior, copyright 1998, Cambridge University 
Press. Reprinted with permission. 


the system into proximity to them. Each attractor has a basin, 
which is the attractor’s region of attraction. Trajectories that 
enter the basin tend to move toward that attractor (Brown, 
1995). 

There are several kinds of attractors, some very simple, 
others more complex. In a point attractor, all trajectories con- 
verge onto some point in phase space, no matter where they 
begin (e.g., body temperature). Of greater interest are chaotic 
attractors. The pattern to which this term refers is an irregular 
and unpredictable movement around two or more attraction 
points. An example is the Lorenz attractor (Figure 8.7), 
named for the man who first plotted it (Lorenz, 1963). It has 
two attraction zones. Plotting the behavior of this system 
over time yields a tendency to loop around both attractors, 
but to do so unpredictably. Shifts from one basin to the other 
seem random. 

The behavior of this system displays sensitivity to initial 
conditions. A small change in starting point changes the spe- 
cific path of motion entirely. The general tendencies remain 
the same—that is, the revolving around both attractors. But 
details such as the number of revolutions around one before 
deflection to the other form an entirely different pattern. The 
trajectory over many iterations shows this same sensitivity to 
small differences. As the system continues, it often nearly re- 
peats itself but never quite does, and what seem nearly iden- 
tical paths sometimes diverge abruptly, with one path leading 
to one attractor and the adjacent path leading to the other. 

A phase space also contains regions called repellers, re- 
gions that are hardly ever occupied. Indeed, these regions 
seem to be actively avoided. That is, wandering into the basin 


of a repeller leads to a rapid escape from that region of phase 
space. 


Another Way of Picturing Attractors 


The phase-space diagram gives a vivid visual sense of what 
an attractor looks and acts like. Another common depiction 
of attractors is shown in Figure 8.8. In this view, attractor 
basins are basins or valleys in a surface (more technically 
called local minima). Repellers are ridges. This view assumes 
a metaphoric “gravitational” drift downward in the diagram, 
but other forces are presumed to be operative in all directions. 
For simplicity, this portrayal usually is done in two dimen- 
sions (sometimes 3), but keep in mind that the diagram often 
assumes the merging of a large number of dimensions into 
the horizontal axis. 

The behavior of the system at a given moment is repre- 
sented as a ball on the surface. If the ball is in a valley (points 
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Figure 8.8 Another way to portray attractors. Panel A: Attractor basins as 
valleys in a surface (local minima). Behavior of the system is represented as 
a ball. If the ball is in a valley (point 1 or 2), it is in an attractor basin and will 
tend to stay there unless disturbed. If the ball is on a ridge (between 1 and 2), 
it will tend to escape its current location and move to an attractor. Panel B: A 
wider basin (1) attracts more trajectories than a narrower basin (2). A steeply 
sloping basin (2) attracts more abruptly any trajectory that enters the basin 
than does a more gradually sloping basin (1). Panel C: A system in which 
attractor 1 is very stable, and the others are less stable. It will take more 
energy to free the ball from attractor 1 than from the others. Panel D: The 
system’s behavior is energized, much as the shaking of a metaphoric tam- 
bourine surface, keeping the system’s behavior in flux and less than com- 
pletely captured by any particular attractor. Still, more shaking will be 
required to escape from attractor | than attractor 2. Source: From C. S. 
Carver and M. F. Scheier, On the Self-Regulation of Behavior, copyright 
1998, Cambridge University Press. Reprinted with permission. 


1 and 2 in panel A of Figure 8.8), it is in an attractor basin and 
will tend to stay there unless disturbed. If it is on a hill (be- 
tween | and 2), any slight movement in either direction will 
cause it to escape its current location and move to an adjacent 
attractor. 

One strength of this portrayal is that it does a good job of 
creating a sense of how attractors vary in robustness. The 
breadth of a basin indicates the diversity of trajectories in 
phase space that are drawn into it. The broader is the basin 
(B-1 in Figure 8.8), the more trajectories are drawn in. The 
narrower the basin (B-2), the closer the ball has to come to its 
focal point to be drawn to it. The steepness of the valley indi- 
cates how abruptly a trajectory is drawn into it. The steeper 
the slope of the wall (B-2), the more sudden is the entry of a 
system that encounters that basin. 

The depth of the valley indicates how firmly entrenched 
the system is, once drawn into the attractor. Figure 8.8, panel 
C, represents a system of attractors with fairly low stability 
(the valleys are shallow). One attractor represents a stable 
situation (valley 1), whereas the others are less so. It will take 
a lot more “energy” to free the ball from valley 1 than from 
the others. 

There is a sense in which both breadth and depth suggest 
that a goal is important. Breadth does so because the system 
is drawn to the attractor from widely divergent trajectories. 
Depth does so because the system that has been drawn into 
the basin tends to stay there. 

A weakness of this picture, compared to a phase-space 
portrait, is that it is not as good at giving a sense of the erratic 
motion from one attractor to another in a multiple-attractor 
system. You can regain some of that sense of erratic shifting, 
however, if you think of the surface in Figure 8.8 as a tam- 
bourine being continuously shaken (Figure 8.8, panel D). 
Even a little shaking causes the ball to bounce around in its 
well and may jostle it from one well to another, particularly 
if the attractors are not highly stable. An alternative would be 
to think of the ball as a jumping bean. These two characteri- 
zations would be analogous to jostling from situational influ- 
ences and jostling from internal dynamics, respectively. 


Goals as Attractors 


The themes of dynamic systems thinking outlined here 
have had several applications in personality-social and 
even clinical psychology (Hayes & Strauss, 1998; Mahoney, 
1991; Nowak & Vallacher, 1998; Vallacher & Nowak, 1997). 
Perhaps the easiest application of the attractor concept to 
self-regulatory models is to link it with the goal concept. In- 
deed, alert readers will have noticed that we used the same 
metaphor—gravity and antigravity—in describing both the 
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goal construct at the beginning of the chapter and in describ- 
ing the attractor concept just earlier. 

As we said at the beginning of the chapter, goals are points 
around which behavior is regulated. People spend much of 
their time doing things that keep their behavior in close prox- 
imity to their goals. It seems reasonable to suggest, then, that 
a goal represents a kind of attractor. Further, if a goal is an at- 
tractor, it seems reasonable that an antigoal would represent a 
repeller. 

This functional similarity between the goal construct and 
the attractor basin is very interesting. However, the similarity 
exists only with respect to the end product—that is, main- 
taining proximity to a value (or remaining distant from a 
value). The two views make radically different assumptions 
about the presence or absence of structure underlying the 
functions. The feedback model assumes a structure underly- 
ing and supporting the process, whereas the dynamic systems 
model does not necessarily incorporate such an assumption. 


CONNECTIONISM 


A related set of questions about the role of central control 
processes is raised by the literature of connectionism. Con- 
nectionist models simulate thought processes in networks of 
artificial units in which “processing” consists of passing acti- 
vation among the units. As in neurons, the signal can be exci- 
tatory or inhibitory. Energy passes in only one direction 
(though some networks have feedback links). Processing 
proceeds entirely by the spread of activation—there is no 
higher order executive to direct traffic. In a distributed con- 
nectionist network, knowledge is not represented centrally, as 
nodes of information. Rather, knowledge is represented in 
terms of the pattern of activation of the network as a whole 
(Smith, 1996). 

In networks with feedback relations, once the system re- 
ceives input, the pattern of weights and activations is updated 
repeatedly across many cycles. Thus, modifications or up- 
dates are made iteratively throughout the network, both with 
respect to activation in each node and the weighting func- 
tions. Gradually, the various values asymptote, and the sys- 
tem “settles” into a configuration. The settling reflects the 
least amount of overall error the system has been able to cre- 
ate, given its starting inputs and weights. 


Multiple Constraint Satisfaction 


A useful way to think about this process is that the system si- 
multaneously satisfies multiple constraints that the elements 
create on each other (Thagard, 1989; see also Kelso, 1995). 
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For example, two mutually inhibitory nodes cannot both be 
highly active at the same time. Thus they constrain one an- 
other. Constraints among multiple nodes are settled out dur- 
ing the repeated updating of activation levels. 

This idea of multiple constraint satisfaction is now having 
a substantial impact on how people in social psychology think 
about a variety of topics (Kunda & Thagard, 1996; Read, 
Vanman, & Miller, 1997; Schultz & Lepper, 1996). It is an 
idea that has a great deal of intuitive appeal. It captures well 
the introspective sense that people come to conclusions and 
decisions not by weighing the evidence, exactly, but rather by 
letting the evidence sort itself until it reaches a degree of in- 
ternal consistency. The conclusion then pops into mind. 

Another term that goes along with this picture is self- 
organization (e.g., Prigogine & Stengers, 1984). The idea be- 
hind this label is that multiple causal forces which have no 
intrinsic relation to each other can cause the spontaneous 
emergence of some property of the system as a whole that 
does not otherwise exist. The term is used to describe emer- 
gent qualities in a variety of scientific disciplines. A number 
of people have begun to invoke it as a basis for emergent 
properties in dynamic systems (Nowak & Vallacher, 1998; 
Prigogine & Stengers, 1984). 


Self-Organization and Self-Regulation 


Some would argue that models of self-organization in dy- 
namic systems represent a serious challenge to the viability of 
the type of self-regulatory model with which we began. That 
is, it might be asserted that behavior only seems to be self- 
regulated—that behavior instead self-organizes from among 
surrounding forces, like foam appearing on roiling surf. 

Do feedback processes actually reflect self-organization— 
a haphazard falling together of disparate forces? Or are there 
structures in the nervous system (and elsewhere) in living sys- 
tems that carry out true feedback functions? In considering the 
relation between the two sets of ideas, it is of interest that 
MacKay (1956) anticipated the principle of self-organization 
many years ago when he described a system of feedback 
processes that could evolve its own goals (see also Beer, 1995; 
Maes & Brooks, 1990). Thus, MacKay found the principle of 
self-organization to be useful, but he found it useful explicitly 
within the framework of a self-regulatory model. 

Our view is, similarly, that the concepts of attractors and 
trajectories within phase space complement the idea that be- 
havior is guided by feedback processes but do not replace it 
(Carver & Scheier, in press). There do appear to be times and 
circumstances in which forces converge—unplanned—and 
induce acts to occur that were not intended beforehand. How- 
ever, there also seem to be clear instances of intentionality in 
behavior and its management. 


It is of interest in this regard that contemporary cogni- 
tive psychologists often assume the existence of both bot- 
tom-up organizational tendencies and top-down directive 
tendencies (see, e.g., Holyoak & Spellman, 1993; Shastri & 
Ajjanagadde, 1993; Sloman, 1996; Smolensky, 1988). That 
view would seem to fit a picture in which self-organization 
of action can occur, but where actions can also be planned 
and executed systematically, from the top down. Similar 
two-mode models of regulation have also appeared in sev- 
eral literatures in personality-social psychology (Chaiken & 
Trope, 1999). In short, there seems to be some degree of con- 
sensus that human experience is part self-organization and 
part self-regulation. 

Even when the focus is on planful behavior, the two kinds 
of models seem to complement each other in a different way. 
The feedback model provides a mechanism through which 
goal-directed action is managed, which the phase-space 
model lacks. The phase-space model suggests ways of think- 
ing about how multiple goals exist and how people shift 
among those multiple goals over time, an issue that is not 
dealt with as easily in terms of feedback processes. 

That is, think of the landscape of chaotic attractors, but with 
many different basins rather than just two or three. This seems 
to capture rather well the sense of human behavior. No basin in 
this system ever becomes a point attractor. Behavior tends to- 
ward one goal and then another, never being completely cap- 
tured by any goal. The person does one thing for a while, then 
something else. The goals are all predictable—in the sense 
that they all influence the person—and the influence is highly 
predictable when aggregated across time. But the shifts from 
one to another occur unpredictably (thus being chaotic). 


CATASTROPHE THEORY 


Another set of ideas that has been around for some time but 
may be reemerging in influence is catastrophe theory, a math- 
ematical model that bears on the creation of discontinuities, 
bifurcations, or splittings (Brown, 1995; Saunders, 1980; 
Stewart & Peregoy, 1983; van der Maas & Molenaar, 1992; 
Woodcock & Davis, 1978; Zeeman, 1977). A catastrophe oc- 
curs when a small change in one variable produces an abrupt 
(and usually large) change in another variable. 

An abrupt change implies nonlinearity. This focus on non- 
linearity is one of several themes that catastrophe theory 
shares with dynamic systems theory, though the two bodies of 
thought have different origins (and are seen by some as quite 
different from each other—see Kelso, 1995, chap. 2). The sim- 
ilarity is nicely expressed in the statement that the discontinu- 
ity in catastrophe theory reflects “the sudden disappearance of 
one attractor and its basin, combined with the dominant 
emergence of another attractor” (Brown, 1995, p. 51). 


Figure 8.9 Three-dimensional depiction of a cusp catastrophe. Variables x 
and z are predictors, and y is the system’s “behavior,” the dependent variable. 
The catastrophe shows sensitive dependence on initial conditions. Where z is 
low, points 1 and 2 are nearly the same on x. If these points are projected for- 
ward on the surface (with increases in z), they move in parallel until the cusp 
begins to emerge. The lines are then separated by the formation of the cusp 
and project to completely different regions of the surface. Source: From 
C. S. Carver and M. F. Scheier, On the Self-Regulation of Behavior, copy- 
right 1998, Cambridge University Press. Reprinted with permission. 


Though several types of catastrophe exist (Brown, 1995; 
Saunders, 1980; Woodcock & Davis, 1978), the one receiv- 
ing most attention regarding behavior is the cusp catastrophe, 
in which two variables influence an outcome. Figure 8.9 por- 
trays its three-dimensional surface. X and z are predictors, 
and y is the outcome. At low values of z, the surface of the fig- 
ure shows a roughly linear relationship between x and y. As x 
increases, so does y. As z increases, the relationship between 
x and y becomes less linear. It first shifts toward something 
like a step function. With further increase in z, the x-y rela- 
tionship becomes even more clearly discontinuous—the out- 
come is either on the top surface or on the bottom. Thus, 
changes in z cause a change in the way x relates to y. 

Another theme that links catastrophe theory to dynamic 
systems is the idea of sensitive dependence on initial condi- 
tions. The cusp catastrophe displays this characteristic nicely. 
Consider the portion of Figure 8.9 where z has low values and 
x has a continuous relation to y (the system’s behavior). 
Points 1 and 2 on x are nearly identical, but not quite. Now 
track these points across the surface as z increases. For a 
while the two paths track each other closely, until suddenly 
they begin to be separated by the fold in the catastrophe. At 
higher levels of z, one track ultimately projects to the upper 
region of the surface, the other to the lower region. Thus, a 
very slight initial difference results in a substantial difference 
farther along. 


Hysteresis 


The preceding description also hinted at an interesting and 
important feature of a catastrophe known as hysteresis. A sim- 
ple characterization of what this term means is that at some 
levels of z, there is a kind of fold-over in the middle of the x-y 
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Figure 8.10 A cusp catastrophe exhibits a region of hysteresis (between 
values a and b on the x axis), in which x has two stable values of y (the solid 
lines) and one unstable value (the dotted line that cuts backward in the mid- 
dle of the figure). The region represented by the dotted line repels trajecto- 
ries, whereas the stable regions (those surrounding values c and d on the 
x-axis) attract trajectories. Traversing the zone of hysteresis from the left 
of this figure results in an abrupt shift (at value b on the x-axis) from the 
lower to the upper portion of the surface (right arrow). Traversing the zone 
of hysteresis from the right of this figure results in an abrupt shift (at value a 
on the x-axis) from the upper to the lower portion of the surface (left arrow). 
Thus, the disjunction between portions of the surface occurs at two different 
values of x, depending on the starting point. Source: From C. S. Carver and 
M. F. Scheier, On the Self-Regulation of Behavior, copyright 1998, 
Cambridge University Press. Reprinted with permission. 


relationship. A region of x exists in which more than one value 
of y exists. Another way to characterize hysteresis is that two 
regions of this surface are attractors and one is a repeller 
(Brown, 1995). This unstable area is illustrated in Figure 8.10. 
The dashed-line portion of Figure 8.10 that lies between val- 
ues a and b on the x-axis—the region where the fold is going 
backward—tepels trajectories (Brown, 1995), whereas the 
areas near values c and d attract trajectories. To put it more 
simply, you cannot be on the dashed part of this surface. 

Yet another way of characterizing hysteresis is captured 
by the statement that the system’s behavior depends on the 
system’s recent history (Brown, 1995; Nowak & Lewenstein, 
1994). That is, as you move into the zone of variable x that 
lies between points a and b in Figure 8.10, it matters which 
side of the figure you are coming from. If the system is mov- 
ing from point c into the zone of hysteresis, it stays on the 
bottom surface until it reaches point b, where it jumps to the 
top surface. If the system is moving from d into the zone of 
hysteresis, it stays on the top surface until it reaches point a, 
where it jumps to the bottom surface. 


An Application of Catastrophe Theory 


How does catastrophe theory apply to the human behaviors 
of most interest to personality and social psychologists? Sev- 
eral applications of these ideas have been made in the past 
decade or so, and others seem obvious candidates for future 
study (for broader discussion see Carver & Scheier, 1998, 
chap. 16). 

One interesting example concerns what we believe is a 
bifurcation between engagement in effort and giving up. 
Earlier we pointed to a set of theories that assume such a 
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Figure 8.11 A catastrophe model of effort versus disengagement. Source: 


From C. S. Carver and M. F. Scheier, On the Self-Regulation of Behavior, 
copyright 1998, Cambridge University Press. Reprinted with permission. 


disjunction (Brehm & Self, 1989; Kukla, 1972; Wortman & 
Brehm, 1975). In all those models (as in ours), there is a point 
at which effort seems fruitless and the person stops trying. 
Earlier, we simply emphasized that the models all assumed a 
discontinuity. Now we look at the discontinuity more closely 
and suggest that the phenomena addressed by these theories 
may embody a catastrophe. 

Figure 8.11 shows a slightly relabeled cross section of a 
cusp catastrophe similar to that in Figure 8.10. This figure 
displays a region of hysteresis in the engagement versus 
disengagement function. In that region, where task demands 
are close to people’s perceived limits to perform, there should 
be greater variability in effort or engagement, as some people 
are on the top surface of the catastrophe and others are on the 
bottom surface. Some people would be continuing to exert 
efforts at the same point where others would be exhibiting a 
giving-up response. 

Recall that the catastrophe figure also conveys the sense 
that the history of the behavior matters. A person who enters 
the region of hysteresis from the direction of high confidence 
(who starts out confident but confronts many contradictory 
cues) will continue to display engagement and effort, even as 
the situational cues imply less and less basis for confidence. 
A person who enters that region from the direction of low 
confidence (who starts doubtful but confronts contradictory 
cues) will continue to display little effort, even as the cues 
imply a greater basis for confidence. 

This model helps indicate why it can be so difficult to get 
someone with strong and chronic doubts about success in 
some domain of behavior to exert real effort and engagement 
in that domain. It also suggests why a confident person is so 
rarely put off by encountering difficulties in the domain 
where the confidence lies. To put it in terms of broader views 
about life in general, it helps show why optimists tend to stay 
optimistic and pessimists tend to stay pessimistic, even when 
the current circumstances of the two sorts of people are iden- 
tical Oe, in the region of hysteresis). 

It is important to keep in mind that the catastrophe cross 
section (Figure 8.11) is the picture that emerges under 


catastrophe theory only once a clear region of hysteresis has 
begun to develop. Farther back, the model is more of a step 
function. An implication is that to see the fold-over it is im- 
portant to engage the variable that is responsible for bringing 
out the bifurcation in the surface (i.e., axis z in Figure 8.9). 

What is the variable that induces the bifurcation? We think 
that in the motivational models under discussion—and per- 
haps more broadly—the control parameter is importance. Im- 
portance arises from several sources, but there is a common 
thread among events seen as important. They demand mental 
resources. We suspect that almost any strong pressure that 
demands resources (time pressure, self-imposed pressure) 
will induce bifurcating effects. 


CONCLUDING COMMENT 


In this chapter we sketched a set of ideas that we think are im- 
portant in conceptualizing human self-regulation. We believe 
that behavior is goal directed and feedback controlled and 
that the goals underlying behavior form a hierarchy of ab- 
stractness. We believe that experiences of affect (and of con- 
fidence vs. doubt) also arise from a process of feedback 
control, but a feedback process that takes into account tem- 
poral constraints. We believe that confidence and doubt yield 
patterns of persistence versus giving up and that these two re- 
sponses to adversity form a dichotomy in behavior. These 
ideas have been embedded in our self-regulatory viewpoint 
for some time. 

We have also recently begun to consider some newer 
ideas, addressed in the latter parts of the chapter. In those sec- 
tions we described ideas from dynamic systems theory, con- 
nectionism, and catastrophe theory. We suggest that they 
represent useful tools for the analysis and construal of behav- 
ior. Our view is that they supplement rather than replace the 
tools now in use (though not everyone will agree on this 
point). We see many ways in which those ideas mesh with the 
ideas presented earlier, though space constraints limited us to 
discussing that integration only briefly. 

In thinking about the structure of behavior, we have tried 
to draw on ideas from disparate sources while continuing to 
follow the thread of the logical model from which we started. 
The result is an aggregation of principles that we think have 
a good deal to say about how behavioral self-regulation takes 
place. In so doing, they also say something about personality 
and how it is manifested in people’s actions. 

The conceptual model presented here is surely not com- 
plete, and many avenues exist for further discussion and in- 
deed further conceptual development. For example, this 
chapter included little attention to the issue of how new goals 


are added to people’s hierarchies or of how to think about 
growth and change over time (but see Carver & Scheier, 
1998, 1999a, 1999b). Similarly, the concepts addressed here 
bear in several ways on problems in behavior and behavior 
change, though space constraints prevent us from describing 
them in detail. For example, we suspect that many problems 
in people’s lives are, at their core, problems of disengage- 
ment versus engagement and the failure to disengage adap- 
tively (Carver & Scheier, 1998). As another example, it may 
be useful to conceptualize problems as less-than-optimal 
adaptations in a multidimensional phase space, which require 
some jostling to bounce the person to a new attractor (Hayes 
& Strauss, 1998). These are all areas in which more work re- 
mains to be done. 

These are just some of the ways in which we think the fam- 
ily of ideas described here will likely be explored in the near 
future. Further analyses of the self-regulation of behavior are 
likely to produce insights that transform the models from 
which the insights grew. As the models change, so will our un- 
derstanding of motivational processes and of how human be- 
ings function as coherent, autonomous units. This we take to 
be one of the core pursuits of personality psychology. 
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INTERPERSONAL FOUNDATIONS FOR AN 
INTEGRATIVE THEORY OF PERSONALITY 


The origins of the interpersonal theory of personality we dis- 
cuss in the present chapter are found in Sullivan’s (1953a, 
1953b, 1954, 1956, 1962, 1964) interpersonal theory of 
psychiatry. Extensions, elaborations, and modifications have 
consistently appeared over the last 50 years, with landmark 
works appearing in each successive decade (see Table 9.1). 
Given this clear line of theoretical development, it might 
seem puzzling that in a discussion of the scope of interper- 
sonal theory held at a recent meeting of the Society for Inter- 
personal Theory and Research (SITAR), it was pointed out 
that psychology’s expanding focus on interpersonal function- 
ing has rendered study of interpersonal processes so funda- 
mental that interpersonal theory risks an identity crisis 
(Gurtman, personal communication, June 20, 2000). In our 
opinion, both promising and perplexing aspects of this iden- 
tity crisis are respectively reflected in two growing bodies of 
literature. The former body recognizes the integrative and 
synthetic potential of interpersonal theory to complement and 
enhance many other theoretical approaches to the study of 
personality (e.g., Benjamin, 1996c; Kiesler, 1992), whereas 
the latter body focuses on interpersonal functioning without 
any recognition of interpersonal theory. 

Explicit efforts have been made toward integration of 
interpersonal theory and cognitive theory (e.g., Benjamin, 
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1986; Benjamin & Friedrich, 1991; Carson, 1969, 1982; 
Safran, 1990a, 1990b; Tunis, Fridhandler, & Horowitz, 
1990), attachment theory (e.g., Bartholomew & L. Horowitz, 
1991; Benjamin, 1993; Birtchnell, 1997; Florsheim, Henry, 
& Benjamin, 1996; Pincus, Dickinson, Schut, Castonguay, & 
Bedics, 1999; Stuart & Noyes, 1999), contemporary psy- 
chodynamic theory (e.g., Benjamin, 1995; Benjamin & 
Friedrich, 1991; Heck & Pincus, 2001; Lionells, Fiscalini, 
Mann, & Stern, 1995; Pincus, 1997; Roemer, 1986), and 
evolutionary theory (e.g., Hoyenga, Hoyenga, Walters, & 
Schmidt, 1998; Zuroff, Moskowitz, & Cote, 1999). Although 
it might be argued that such efforts could lead to identity dif- 
fusion of interpersonal theory, we believe this points to the 
fundamental integrative potential of an interpersonal theory 
of personality. In contrast, efforts at integrating interpersonal 
theory with social psychological theories of human interac- 
tion and social cognition appear to be lagging despite the 
initial works of Carson (1969) and Wiggins (1980). We note 
continued expansion of a significant social psychological lit- 
erature on interpersonal behavior, such as self-verification 
and self-confirmation theories (e.g., Hardin & Higgins, 1996; 
Swann & Read, 1981) and interpersonal expectancies (e.g., 
Neuberg, 1996), that does not incorporate interpersonal 
theory as reviewed here. Remarkably, recent reviews of 
interpersonal functioning (Reis, Collins, & Berscheid, 2000; 
Snyder & Stukas, 1999) did not cite any of the literature 
reviewed for the present chapter on interpersonal theory, 
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TABLE 9.1 Landmark Publications in Interpersonal Theory 


1950s 1960s 1970s 1980s 1990s 

Sullivan (1953a) Schaefer (1961) Benjamin (1974) Wiggins (1980) Benjamin (1996b) 
Sullivan (1953b) Sullivan (1962) McLemore & Benjamin (1979) Anchin & Kiesler (1982) Wiggins & Trapnell (1996) 
Sullivan (1954) Lorr & McNair (1963) Wiggins (1979) Wiggins (1982) Kiesler (1996) 

Sullivan (1956) Sullivan (1964) Kiesler (1983) Wiggins & Trobst (1999) 
Leary (1957) Lorr & McNair (1965) Benjamin (1984) 

Schaefer (1959) Carson (1969) Horowitz & Vitkus (1986) 


nor do interpersonal theorists regularly recognize the social 
psychological literature on interpersonal interaction in their 
work (cf. Kiesler, 1996). 

Thus, the current state of affairs compels interpersonal 
theorists to take the next step in defining the interpersonal 
foundations for an integrative theory of personality. The ini- 
tial integrative efforts provide a platform to refine the scope 
of interpersonal theory, and the areas in which integration is 
lacking indicate that further development is necessary. The 
goal of this chapter is to begin to forge a new identity for in- 
terpersonal theory that recognizes both its unique aspects and 
integrative potential; in this chapter, we also suggest impor- 
tant areas in need of further theoretical development and em- 
pirical research. 


THE INTERPERSONAL SITUATION 


I had come to feel over the years that there was an acute need for 
a discipline that was determined to study not the individual 
organism or the social heritage, but the interpersonal situations 
through which persons manifest mental health or mental disor- 
der. (Sullivan, 1953b, p. 18) 

Personality is the relatively enduring pattern of recurrent in- 
terpersonal situations which characterize a human life. (Sullivan, 
1953b, pp. 110-111) 


These statements are remarkably prescient, as much of psy- 
chology in the new millenium seems devoted in one way or 
another to studying interpersonal aspects of human existence. 
To best understand how this focus has become so fundamen- 
tal to the psychology of personality (and beyond), we must 
clarify what is meant by an interpersonal situation. Perhaps 
the most basic implication of the term is that the expression 
of personality (and hence the investigation of its nature) 
focuses on phenomena involving more than one person—that 
is to say, some form of relating is occuring (Benjamin, 1984; 
Kiesler, 1996; Mullahy, 1952). Sullivan (1953a, 1953b) sug- 
gested that individuals express “integrating tendencies” that 
bring them together in the mutual pursuit of both satisfactions 
(generally a large class of biologically grounded needs) and 
security (i.e., self-esteem and anxiety-free functioning). 


These integrating tendencies develop into increasingly com- 
plex patterns or dynamisms of interpersonal experience. 
From infancy onward through six developmental epochs 
these dynamisms are encoded in memory via age-appropriate 
learning. According to Sullivan, interpersonal learning of 
social behaviors and self-concept is based on an anxiety 
gradient associated with interpersonal situations. All inter- 
personal situations range from rewarding (highly secure) 
through various degrees of anxiety and ending in a class of 
situations associated with such severe anxiety that they are 
dissociated from experience. Individual variation in learning 
occurs when maturational limits affect the developing a 
person’s understanding of cause-and-effect logic and consen- 
sual symbols such as language (i.e., Sullivan’s prototaxic, 
parataxic, and syntaxic modes of experience), understanding 
of qualities of significant others (including their “reflected 
appraisals” of the developing person), as well as their under- 
standing of the ultimate outcomes of interpersonal situations 
characterizing a human life. Thus, Sullivan’s concept of the 
interpersonal situation can be summarized as the experience 
of a pattern of relating self with other associated with varying 
levels of anxiety (or security) in which learning takes place 
that influences the development of self-concept and social 
behavior. This is a very fundamental human experience for 
psychology to investigate, and it is a significant aspect of 
the efforts to integrate interpersonal theory with cognitive, 
attachment, psychodynamic, and evolutionary theories previ- 
ously noted. 

Sullivan (1954) described three potential outcomes of in- 
terpersonal situations. Interpersonal situations are resolved 
when integrated by mutual complementary needs and recip- 
rocal patterns of activity, leading to “felt security” and prob- 
able recurrence. A well-known example is the resolution of 
an infant’s distress by provision of tender care by parents. 
The infant’s tension of needs evokes complementary parental 
needs to provide care (Sullivan, 1953b). Interpersonal situa- 
tions are continued when needs and patterns of activity are 
not initially complementary, such that tensions persist and 
covert processing of possible alternative steps toward resolu- 
tion emerge, leading to possible negotiation of the rela- 
tionship (Kiesler, 1996). Finally, interpersonal situations are 


frustrating when needs and actions are not complementary 
and no resolution can be found, leading to an increase in anx- 
iety and likely disintegration of the situation. 

For Sullivan, the interpersonal situation underlies genesis, 
development, mutability, and maintenance of personality. 
The continuous patterning and repatterning of interpersonal 
experience in relation to the vicissitudes of satisfactions and 
security in interpersonal situations gives rise to lasting 
conceptions of self and other (Sullivan’s “personifications’””) 
as well as to enduring patterns of interpersonal relating. To 
us, the interpersonal situation is at the core of an inter- 
personal theory of personality. The power of interpersonal 
experiences to create, refine, and change personality as Sulli- 
van conceived is the foundation of an interpersonal theory of 
personality that has been elaborated in the last half century by 
a wide range of theoretical, empirical, and clinical efforts. 

A comprehensive theory of personality includes con- 
temporaneous analysis emphasizing present description and 
developmental analysis emphasizing historical origins as well 
as the continuing significance of past experience on cur- 
rent functioning (Millon, 1996). Consistent with these ap- 
proaches, the fundamental aspects of an interpersonal theory 
of personality should include (a) a delineation of what is 
meant by interpersonal, (b) the systematic description of in- 
terpersonal behavior, (c) the systematic description of recip- 
rocal interpersonal patterns, (d) articulation of processes and 
structures that account for enduring patterns of relating, and 
(e) motivational and developmental principles. In our opin- 
ion, interpersonal theorists have reached greater consensus on 
contemporaneous description than on developmental con- 
cepts. This consensus may be due in part to ambiguity in the 
meaning of the term interpersonal. 


THE INTERPERSONAL AND THE INTRAPSYCHIC 


Where are interpersonal situations to be found? Millon’s 
(1996) distinction between contemporaneous and develop- 
mental analysis alludes to the dichotomy of the interpersonal 
and the intrapsychic. Specifically, current description evokes 
a view of the reciprocal behavior patterns of two persons en- 
gaged in resolving, negotiating, or disintegrating their pre- 
sent interpersonal situation. In this sense, we might focus on 
what can be observed to transpire between them. In contrast, 
developmental analysis implies that there is something rela- 
tively stable that a person brings to each new interpersonal 
situation. Such enduring influences might be considered to 
reside within the person—that is, they are intrapsychic. The 
dichotomous conception of the interpersonal and the in- 
trapsychic as two sets of phenomena—one residing between 
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people and one residing within a person—may have at times 
led interpersonal theorists to focus more attention on con- 
temporaneous analysis with perhaps greater hesitancy to 
elaborate on developmental influences. In our opinion, how- 
ever, we must include developmental concepts if we are to be 
comprehensive, and this in turn requires examination of in- 
trapsychic structures and processes. As it turns out, Sullivan 
would not be opposed to such efforts. 

Greenberg and Mitchell (1983) point out that Sullivan’s 
interpersonal theory of psychiatry was largely a response to 
Freud’s strong emphasis on drive-based intrapsychic aspects 
of personality. Because of Sullivan’s opposition to drives as 
the source of personality structuralization, there is a risk of 
simplifying interpretation of interpersonal theory as focusing 
solely on what occurs outside the person, in the world of ob- 
servable interaction. Mitchell (1988) points out that Sullivan 
was quite amenable to incorporating the intrapsychic into 
interpersonal theory because he viewed the most important 
contents of the mind to be the consequence of lived inter- 
personal experience. For example, Sullivan (1964) states, 
“everything that can be found in the human mind has been 
put there by interpersonal relations, excepting only the capa- 
bilities to receive and elaborate the relevant experiences” 
(p. 302; see also Stern, 1985, 1988). 

Mitchell (1988) specifies several concepts associated with 
the dichotomization of interpersonal and intrapsychic, in- 
cluding perception versus fantasy and actuality versus 
psychic reality. Sullivan clearly viewed fantasy as fundamen- 
tal to interpersonal situations. He defined psychiatry as 
the “study of the phenomena that occur in configurations 
made up of two or more people, all but one of whom may be 
more or less completely illusory” (Sullivan, 1964, p. 33). 
These illusory aspects of the interpersonal situation involve 
mental structures—that is, personifications of self and others. 
Sullivan (1953b) was forceful in asserting that personifica- 
tions are elaborated organizations of past interpersonal expe- 
rience, stating “. . . I would like to make it forever clear that 
the relation of the personifications to that which is personified 
is always complex and sometimes multiple; and that personi- 
fications are not adequate descriptions of that which is per- 
sonified” (p. 167). Sullivan also saw subjective meaning (i.e., 
psychic reality) as highly important. For example, Mitchell 
(1988) points out that Sullivan’s conception of parataxic inte- 
gration involves subjective experience of the interpersonal 
situation influenced by intrapsychic structure and process. 
Sullivan (1953a) describes parataxic integrations as occur- 
ring “when, beside the interpersonal situation as defined 
within the awareness of the speaker, there is a concomitant 
interpersonal situation quite different as to its principle inte- 
grating tendencies, of which the speaker is more or less 
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completely unaware” (p. 92). In discussing the data of 
psychiatry, Sullivan (1964) asserted that “human behavior, 
including the verbal report of subjective appearances 
(phenomena), is the actual matter of observation” (p. 34). 

Thus, we can assert that interpersonal theory is not strictly 
an interactional theory emphasizing observable behavior; 
rather, the term interpersonal is meant to convey a sense of 
primacy, a set of fundamental phenomena important for per- 
sonality development, structuralization, function, and pathol- 
ogy. It is not a geographic indicator of locale: It is not meant 
to generate a dichotomy between what is inside the person 
and what is outside the person. From a Sullivanian stand- 
point, the intrapsychic is intrinsically interpersonal, derived 
from the registration and elaboration of interactions occurring 
in the interpersonal field (Mitchell, 1988). As we will see, 
however, descriptions of observable interpersonal behavior 
and patterns of relating have generated far more consensus 
among interpersonal theorists than have elaboration of in- 
trapsychic processes and concepts. 


DESCRIBING INTERPERSONAL BEHAVIOR 


The emphasis on interpersonal functioning in Sullivan’s 
work stimulated efforts to develop orderly and lawful con- 
ceptual and empirical models describing interpersonal be- 
havior. The goal of such work was to obtain a taxonomy of 
interpersonal behavior— “to obtain categories of increasing 
generality that permit description of behaviors according to 
their natural relationships” (Schaefer, 1961, p. 126; see also 
Millon, 1991, for a general discussion of taxonomy in clas- 
sification of personality and psychopathology). In contem- 
porary terms, such systems are referred to as structural 
models, which can be used to conceptually systematize ob- 
servation and covariation of variables of interest. If suffi- 
ciently integrated with rich theory, such models can even be 
considered nomological nets (Benjamin, 1996a; Gurtman, 
1992). 

There have been two distinct but related empirical 
approaches to the development of structural models describ- 
ing interpersonal functioning. We refer to these as the indi- 
vidual differences approach and the dyadic approach (Pincus, 
Gurtman, & Ruiz, 1998). These authors pointed out that 
although each approach has unique aspects, the approaches 
converge in that they assert that the best structural model of 
interpersonal behavior takes the form of a circle or circumplex 
(Gurtman & Pincus, 2000; Pincus et al., 1998; Wiggins & 
Trobst, 1997). The geometric properties of circumplex mod- 
els give rise to unique computational methods for assess- 
ment and research (Gurtman, 1994, 1997, 2001; Gurtman & 


Balakrishnan, 1998; Gurtman & Pincus, in press) that are not 
reviewed here. In the present chapter, circumplex models of 
interpersonal behavior are used to anchor description of theo- 
retical concepts. The development of circumplex models of 
interpersonal behavior has significantly influenced contem- 
porary developments in interpersonal theory, and vice versa 
(Pincus, 1994). 


The Individual Differences Approach 


The individual differences approach focuses on qualities of 
the individual, (e.g., personality traits) that are assumed to 
give rise to behavior that is generally consistent over time and 
across situations (Wiggins, 1997). From a relational stand- 
point, this approach involves behavior which is also generally 
consistent across interpersonal situations, giving rise to the in- 
dividual’s interpersonal style (e.g., Lorr & Youniss, 1986; Pin- 
cus & Gurtman, 1995; Pincus & Wilson, 2001), and in cases of 
psychopathology, an individual’s interpersonal diagnosis 
(Kiesler, 1986; Leary, 1957; McLemore & Benjamin, 1979; 
Wiggins, Phillips, & Trapnell, 1989). 

The individual differences approach led to the empirical 
derivation of a popular structural model of interpersonal 
traits, problems, and behavioral acts often referred to as the 
Leary circle (Freedman, Leary, Ossorio, & Coffey, 1951; 
Leary, 1957) or the Interpersonal Circle (IPC; Kiesler, 1983; 
Pincus, 1994; Wiggins, 1996). Leary and his associates at the 
Kaiser Foundation Psychology Research Group observed in- 
teractions among group psychotherapy patients and asked, 
“What is the subject of the activity, e.g., the individual whose 
behavior is being rated, doing to the object or objects of 
the activity?” (Freedman et al., 1951, p. 149). This context- 
free cataloging of all individuals’ observed interpersonal 
behavior eventually led to an empirically derived circular 
structure based on the two underlying dimensions of 
dominance-submission on the vertical axis and nurturance- 
coldness on the horizontal axis (see Figure 9.1). 

The IPC model is a geometric representation of individ- 
ual differences in a variety of interpersonal domains, includ- 
ing interpersonal traits (Wiggins, 1979, 1995), interpersonal 
problems (Horowitz, Alden, Wiggins, & Pincus, 2000), 
verbal and nonverbal interpersonal acts (Gifford, 1991; 
Kiesler, 1985, 1987), and covert interpersonal impacts 
(Kiesler, Schmidt, & Wagner, 1997; Wagner, Keisler, & 
Schmidt, 1995). Thus, all qualities of individual differences 
within these domains can be described as blends of the cir- 
cle’s two underlying dimensions. Blends of dominance and 
nurturance can be located along the 360° perimeter of the cir- 
cle. Interpersonal qualities close to one another on the perim- 
eter are conceptually and statistically similar, qualities at 90° 
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Figure 9.1 The Interpersonal Circle (IPC). 


are conceptually and statistically independent, and qualities 
180° apart are conceptual and statistical opposites. Although 
the circular model itself is a continuum without beginning or 
end (Carson, 1969, 1996; Gurtman & Pincus, 2000), any seg- 
mentalization of the IPC perimeter to identify lower-order 
taxa is potentially useful within the limits of reliable discrim- 
inability. The IPC has been segmentalized into sixteenths 
(Kiesler, 1983), octants (Wiggins, Trapnell, & Phillips, 
1988), and quadrants (Carson, 1969). 

Although the IPC represents a model of functioning in 
which the individual is presumed to be in many possible in- 
terpersonal situations, the model itself is monadic. The IPC 
structure does not include specific structural or contextual 
references to the interacting other. Most often, it is used 
to describe qualities of the individual interacting with a 
“generalized other” (Mead, 1932; Sullivan, 1953a, 1953b), 
such as the “hostile-dominant patient” interacting with a 
generic “psychotherapist” (e.g., Gurtman, 1996; Horowitz, 
Rosenberg, & Kalehzan, 1992). 


The Dyadic Approach 


In contrast to the individual differences approach, a second 
approach assumes that the basic unit of analysis for the study 
of interpersonal functioning was the dyad. As is the case for 
the IPC, there is a long history of theoretical and empirical 
conceptualizations of dyadic interpersonal functioning. At the 
same time that Leary and his colleagues were investigating in- 
dividual differences in interpersonal behavior, Schaefer 
(1959, 1961) began investigating mother-child dyads in an ef- 
fort to develop a structural model of interpersonal behavior. 
His methods were similar, but he emphasized the specific 
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dyad as the basic unit of observation: “For maternal behavior, 
the universe [of content] is the behavior of the mother directed 
toward an individual child, excluding all other behaviors of 
the mother” (Schaefer, 1961, p. 126). His work showed a re- 
markable convergence with Leary (1957)—both investigators 
found that a two-dimensional circular model best represented 
interpersonal behavior. As with the IPC, the horizontal di- 
mension was love-hostility. However, the vertical dimension 
differed, and was labeled autonomy, ranging from autonomy- 
granting to controlling. Given a dyadic focus, Schaefer (1961) 
also derived a complementary circular model of children’s be- 
havior in reaction to mothers. Although this early model 
failed to parallel his maternal behavior model, the notion that 
parent-like interpersonal behaviors and childlike interper- 
sonal behaviors may be distinguished from each other was an 
important advance that led to the development of a second 
prominent circular model of interpersonal behavior from a 
dyadic point of view. 

Structural Analysis of Social Behavior (SASB; Benjamin, 
1974, 1984, 1996a, 1996b, 2000) is a complex three-plane 
circumplex that operationally defines interpersonal and 
intrapsychic interactions (see Figure 9.2). The dimensions 
underlying SASB include autonomy (i.e., enmeshment- 
differentiation on the vertical axis), affiliation (i.e., love-hate 
on the horizontal axis), and interpersonal focus (i.e., parent- 
like transitive actions towards others represented by the top 
circle, childlike intransitive reactions to others represented 
by the middle circle, and introjected actions directed toward 
the self represented by the bottom circle). Benjamin (1996c) 
described the development of SASB as an effort “to combine 
the prevailing clinical wisdom about attachment with the 
descriptive power of the circumplex as Schaefer had envi- 
sioned it” (p. 1204). The unique multiplane structure of 
SASB also incorporates Sullivan’s concept of introjection— 
that is, the expected impact of interpersonal situations on the 
self-concept—by proposing a third corresponding circle that 
reflects how one relates to self. 

By separating parent-like and childlike behaviors into two 
planes, SASB incorporates both the vertical dimension of 
Schaefer’s model (control vs. emancipate) and that of the IPC 
(dominate vs. submit). The transitive surface represents the 
former, whereas the intransitive surface opposes submission 
with autonomy-taking. Thus, according to circumplex geom- 
etry, controlling and autonomy-granting are opposite inter- 
personal actions, whereas submitting and autonomy-taking 
are opposite interpersonal reactions (Lorr, 1991). Dominance 
and submission are placed at comparable locations on differ- 
ent surfaces to reflect the fact that they are complementary 
positions rather than opposites. Thus, SASB expands inter- 
personal description by including taxa reflecting friendly 
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Figure 9.2 Structural Analysis of Social Behavior (SASB). 


and hostile differentiation (e.g., affirming, ignoring) not 
defined within the IPC structure, as well as describing the 
introjected relationship with self. Although the vertical 
dimensions and complexity of SASB set it apart from the 
IPC, the same geometric assumptions are applicable. Inter- 
personal behaviors located along the perimeters of the SASB 
circles (identified as clusters in SASB terminology) represent 
blends of the basic dimensions with the same geometric 
relations among clusters on each surface. 

To complete the description, we note that attachment 
concepts have been incorporated into the SASB structure 


self-control 
3-5 


(Benjamin, 1993, 1996a, Florsheim et al., 1996; Henry, 
1994). Boxes in Figure 9.2 denote that interpersonal elements 
on the right side of the circles (affirm-disclose, reciprocal 
love, protect-trust) represent the attachment group (AG). 
Interpersonal elements on the left side of the circles (blame- 
sulk, attack-recoil, ignore-wall off) represent the disrupted 
attachment group (DAG). 

Using this expanded taxonomy, SASB describes a 
dyadic interpersonal unit—that is, a real or internalized 
relationship—rather than the qualities of a single interactant. 
For example, psychotherapy research using SASB has 


focused on the therapist-patient dyad as the unit of investiga- 
tion (e.g., Henry, Schacht, & Strupp, 1990). Despite these 
differences, we view the structural models derived from the 
individual differences and dyadic approaches to be highly 
convergent in many respects, and they should be viewed 
as complementary approaches rather than mutually exclusive 
competitors (e.g., Pincus, 1998; Pincus & Wilson, 2001). 


INTERPERSONAL RECIPROCITY 
AND TRANSACTION 


The notion of reciprocity in human relating is reflected in a 
wide variety of psychological concepts including repetition 
compulsion (Freud, 1914, 1920), projective identification 
(Grotstein, 1981), core conflictual relational themes 
(Luborsky & Crits-Cristoph, 1990), self-fulfilling prophe- 
cies (Carson, 1982), vicious circles (Millon, 1996), self- 
verification seeking (Swann, 1983), and object-relational 
enactments (Kernberg, 1976), to name a few. If we assume 
that an interpersonal situation involves two or more people 
relating to each other in ways that bring about social and self- 
related learning, this implies that something is happening that 
is more than mere random activity. Reciprocal relational pat- 
terns create an interpersonal field (Wiggins & Trobst, 1999) 
in which various transactional influences impact both interac- 
tants as they resolve, negotiate, or disintegrate the interper- 
sonal situation. Within this field, interpersonal behaviors tend 
to pull, elicit, invite, or evoke restricted classes of responses 
from the other, and this is a continual, dynamic transactional 
process. Thus, an interpersonal theory of personality empha- 
sizes field-regulatory processes over self-regulatory or affect- 
regulatory processes (Mitchell, 1988). 

Sullivan (1948) initially conceived of reciprocal processes 
in terms of basic conjunctive and disjunctive forces that lead 
either to resolution or to disintegration of the interpersonal 
situation. He further developed this in the “theorem of recip- 
rocal emotions,” which states that “integration in an interper- 
sonal situation is a process in which (1) complementary 
needs are resolved (or aggravated); (2) reciprocal patterns of 
activity are developed (or disintegrated); and (3) foresight 
of satisfaction (or rebuff) of similar needs is facilitated” 
(Sullivan, 1953b, p. 129). Kiesler (1983) pointed out that al- 
though this theorem was a powerful interpersonal assertion, it 
lacked specificity, and “the surviving general notion of com- 
plementarity was that actions of human participants are 
redundantly interrelated (i.e., have patterned regularity) in 
some manner over the sequence of transactions” (p. 198). 

Leary’s (1957) “principle of reciprocal interpersonal 
relations” provided a more systematic declaration of the 
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patterned regularity of interpersonal behavior, stating “inter- 
personal reflexes tend (with a probability greater than chance) 
to initiate or invite reciprocal interpersonal responses from the 
‘other’ person in the interaction that lead to a repetition of the 
original reflex” (p. 123). Learning in interpersonal situations 
takes place in part because social interaction is reinforcing 
(Leary, 1957). Carson (1991) referred to this as an interbehav- 
ioral contingency process whereby “there is a tendency for a 
given individual’s interpersonal behavior to be constrained or 
controlled in more or less predictable ways by the behavior 
received from an interaction partner” (p. 191). 


Describing Reciprocal Interpersonal Patterns 


Structural models of interpersonal behavior such as the IPC 
and SASB have provided conceptual anchoring points and 
lexicons upon which more systematic description of the 
patterned regularity of reciprocal interpersonal processes can 
be articulated (e.g., Benjamin, 1974; Carson, 1969; Kiesler, 
1983). 


The Interpersonal Circle 


Carson (1969) focused on the notion of interpersonal com- 
plementarity as the patterned regularity between two people 
that contributed to “felt security.” This notion is directly re- 
lated to Sullivan’s conception of a resolved interpersonal sit- 
uation as an outcome in which both persons’ needs are met 
via reciprocal patterns of activity leading to its likely recur- 
rence. Anchoring his propositions within the IPC system, 
Carson first proposed that complementarity was based on the 
social exchange of status and love, as reflected in reciprocity 
for the vertical dimension (i.e., dominance pulls for submis- 
sion; submission pulls for dominance) and correspondence 
for the horizontal dimension (friendliness pulls for friendli- 
ness; hostility pulls for hostility). 

Kiesler’s (1983) seminal paper on complementarity sig- 
nificantly expanded these [PC-based conceptions in several 
ways. First, he recognized the continuous nature of the circu- 
lar model’s descriptions of behavior, and he noted that be- 
cause all interpersonal behaviors are blends of dominance 
and nurturance, the principles of reciprocity and correspon- 
dence could be employed to specify complementary points 
along the entire IPC perimeter. Thus, beyond the cardinal 
points of the IPC, it was asserted that (for example) hostile 
dominance pulls for hostile submission, friendly dominance 
pulls for friendly submission, and so forth, which can be fur- 
ther described by the lower-level taxa in these segments of 
the model. Second, Kiesler also incorporated Wiggins’ (1979, 
1980, 1982) conception of the IPC as a formal geometric 
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model into his description of complementarity, whereby the 
distance from the center of the circle represents a dimen- 
sion of intensity. That is, complementarity involves both the 
class of behaviors and their strength. Reciprocity on domi- 
nance, correspondence on nurturance, and equivalent inten- 
sity thus define complementary behaviors. 

In addition, Kiesler (1983, 1996) defined two other broad 
classes of reciprocal interpersonal patterns anchored by the 
IPC model. When reciprocal interpersonal patterns meet one 
of the two rules of complementarity, he referred to this situa- 
tion as an acomplementary pattern. In such a case, interac- 
tants may exhibit correspondence with regard to nurturance 
or reciprocity with regard to dominance, but not both. When 
interactants exhibit neither reciprocity on dominance nor cor- 
respondence on nurturance, he referred to this situation as an 
anticomplementary pattern. In Kiesler’s (1996) discussion of 
these three reciprocal patterns of interpersonal behavior, it is 
clear that they relate rather directly to the types of outcomes 
of interpersonal situations suggested by Sullivan. Comple- 
mentary reciprocal patterns are considered to promote rela- 
tional stability—that is, such interpersonal situations are 
resolved, they are mutually reinforcing, and they are recur- 
ring. Acomplementary patterns are less stable and instigate 
negotiation (e.g., toward or away from greater complemen- 
tarity). Finally, anticomplementary patterns are the most un- 
stable and lead to avoidance, escape, and disintegration of the 
interpersonal situation. 


SASB 


After developing his two circular models of maternal and 
child behavior, Schaefer (1961) suggested that relationships 
between the two surfaces could be the basis for articulating a 
theory of influence of maternal behavior on child behavior, 
stating 


Bowlby (1951) has pointed out that both European and Ameri- 
can investigators agree that the quality of parental care has great 
importance to the development of the child. Less agreement ex- 
ists about how specific patterns of parent behavior are related to 
specific patterns of child behavior. One obstacle to the under- 
standing of such relationships has been a lack of knowledge of 
the interrelations of the concepts within each universe [italics 
added]. For the purpose of discussion, let us accept the concep- 
tual models presented here and attempt to develop hypotheses 
concerning the relationship of the two models [italics added]. 
(pp. 143-144) 


Benjamin (1974, 1984, 1996a, 1996b) has extended 
Schaefer’s proposition by formally articulating a class of 
reciprocal interpersonal patterns defined by intersurface 


relationships within the SASB model, referred to as SASB 
predictive principles. The main predictive principles are 
complementarity, similarity, opposition, antithesis, and intro- 
jection, although others may be logically deduced (Schacht, 
1994). It is important to note that these principles are not mu- 
tually exclusive from those anchored in the IPC model. The 
first four listed can also be articulated using the IPC. Com- 
plementarity implies the very same conditions for an inter- 
personal situation in both models with content (i.e., differing 
taxa) being the point of descriptive distinction. As Kiesler 
(1983) noted, similarity and opposition are specific forms of 
an acomplementary pattern as defined on the IPC. Antithesis 
is a form of anticomplementarity from the IPC perspective, 
again distinctly described using the SASB lexicon. Only 
introjection cannot be at least partially specified within the 
IPC model. 

Complementarity is based on the relations between transi- 
tive and intransitive SASB surfaces; it reflects the typical 
transactional so-called pulls, bids, or invitations that influ- 
ence dyadic interactants. It is defined when both members of 
a dyad are focused on the same person and exhibit compara- 
ble amounts of affiliation and autonomy. These can be identi- 
fied by the numbers indicating the SASB surface (1, 2, or 3) 
and the cluster (1 through 8) as indicated in Figure 9.2. For 
example, a therapist focuses on her patient and empathically 
communicates that she notices an emotional shift (1-2: 
affirm). In response, the patient focuses on himself and 
tells the therapist of the associated perceptions, cognitions, 
wishes, fears, or memories associated with his current affec- 
tive state (2-2: disclose). All possible complementary posi- 
tions are marked by taxa appearing in the same locations 
on surface one and surface two (i.e., attack-recoil, blame- 
sulk, control-submit, protect-trust, active love-reactive love, 
affirm-disclose, emancipate-separate, and ignore-wall off). 
Like the continuous nature of the IPC, the SASB model has 
several versions, differing in their level of segmentalization 
and thus precision in terms of their descriptive taxa and 
predictive principles. 

Similarity is exhibited when an individual imitates or acts 
like someone else—that is, they occupy the same points on 
the same SASB surface. Imitation, modeling, and observa- 
tional learning (Bandura, 1977) are important mechanisms in 
social learning theories that can be described by similarity. 
However, similarity has a different meaning if it is exhibited 
by two interactants in an interpersonal situation. If two 
people rigidly maintain similar positions at the same time, the 
situation will be rather unproductive—negotiation must 
occur for there to be much progress. A familiar example is a 
couple planning their weekend. If both attempt to control 
(demand their way), there is a power struggle. If both submit, 


little is accomplished as the pattern of What do you want to 
do?—I don't know, I'll do whatever you want cycles and 
stalls. In an occupational relationship, both boss and em- 
ployee tend to focus on the employee. The boss controls (in a 
friendly, neutral, or hostile way) and the employee complies 
in kind (i.e., complementarity). In contrast, an employee who 
consistently tries to boss the boss (i.e., similarity) will not be 
an employee for long! 

Points 180° apart describe opposition on each SASB sur- 
face. Opposing transitive actions are attack and active love, 
blame and affirm, control and emancipate, and protect and 
ignore. Opposing intransitive reactions are recoil and reac- 
tive love, sulk and disclose, submit and separate, and trust 
and wall off. Opposing introjected actions are self-attack and 
self-love, self-blame and self-affirm, self-control and self- 
emancipate, and self-protect and self-neglect. 

The complementary point of an opposite is its antithesis. 
Given a particular transitive or intransitive behavior, the an- 
tithesis is identified by first locating the behavior’s opposite 
on the same surface, and then identifying its complement. 
That is, antithetical points differ in interpersonal focus and 
are 180° apart. Due to the impact of complementarity (i.e., a 
bid or invitation), the antithesis is the response that pulls for 
maximal change in an interpersonal relationship. For exam- 
ple, a psychotherapy patient treated by the first author would 
frequently sulk (2-6) when she experienced the therapist as 
not understanding or supporting her (e.g., I don’t know why I 
come here, this isn’t helping me). Rather than complement 
this with blame (1-6; e.g., If you don’ try to tolerate not get- 
ting exactly what you want from me, this won't work), the 
antithetical affirming (1-2) response was enacted, (e.g., I can 
see that something I have done or failed to do has left you 
feeling pretty upset). The complement of affirm (1-2) is dis- 
close (2-2). The patient would often visibly relax and com- 
municate her frustration and disappointment. Thus, the 
antithesis of sulk (2-6) is affirm (1-2). Other antithetical pairs 
are emancipate and submit, active love and recoil, protect and 
wall off, control and separate, blame and disclose, attack 
and reactive love, and ignore and trust. 

Introjection is based on the relations between the transitive 
and introject SASB surfaces and describes the circumstance 
where an individual treats him- or herself as he or she has 
been treated by important others. This reflects Sullivan’s view 
that important aspects of an individual’s self-concept are de- 
rived from reflected appraisals of others. That is, the person 
comes to conceptualize and treat himself in accordance with 
the ways important others have related to him or her. Com- 
mon patterns often seen in psychotherapy include depressed 
patients who recall chronic blame and criticism from parents 
and now chronically self-blame, and patients with borderline 
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personalities who were physically or sexually abused as chil- 
dren (perpetrator attack) and who now chronically self-attack 
via cutting or burning. As with complementarity, all intro- 
jected positions are marked by clusters in the same location 
but reflect the pairing of transitive and introject surface de- 
scriptors. These include attack and self-attack, blame and 
self-blame, control and self-control, protect and self-protect, 
active love and self-love, affirm and self-affirm, emancipate 
and self-emancipate, and ignore and self-neglect. 

It is important to note that reciprocal interpersonal patterns 
anchored in either the IPC or SASB are neither inherently 
good nor inherently bad; they are value-free. In addition, we 
have tried to present them in their simplest form—as descrip- 
tors of behavior patterns that can be observed in interpersonal 
situations. A taxonomy of reciprocal interpersonal patterns is 
fundamental to contemporaneous analysis to account for 
transactional influences occurring in the interpersonal field 
and to developmental analysis to account for the enduring 
patterning of interpersonal situations that characterize a 
human life. 


Contemporaneous Analysis of Human Transaction 


In examining the immediate interpersonal situation, we may 
now use the taxonomies of interpersonal behavior and recip- 
rocal interpersonal patterns to provide a contemporaneous 
analysis of human transaction. The most central pattern dis- 
cussed previously is that of complementarity, and it is this 
reciprocal interpersonal pattern that anchors most theoretical 
discussions of interpersonal interaction. If we are to regard 
interpersonal behavior as influential or field regulatory, there 
must be some basic goals toward which our behaviors are di- 
rected. Sullivan (1953b) viewed the personification of the 
self to be a dynamism that is built up from the positive re- 
flected appraisals of significant others, allowing for relatively 
anxiety-free functioning and high levels of felt security and 
self-esteem. The self-dynamism tends to be self-perpetuating 
due to both our awareness and organization of interpersonal 
experience (input), and the field-regulatory influences of in- 
terpersonal behavior (output). Sullivan proposed that both 
our enacted behaviors and our perceptions of others’ behav- 
iors toward us are strongly affected by our self-concept. 
When we interact with others, we are attempting to define 
and present ourselves and trying to negotiate the kinds of in- 
teractions and relationships we seek from others. Sullivan’s 
(1953b) theorem of reciprocal emotion and Leary’s (1957) 
principle of reciprocal interpersonal relations have led to the 
formal view that what we attempt to regulate in the inter- 
personal field are the responses of the other. “Interpersonal 
behaviors, in a relatively unaware, automatic, and unintended 
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fashion, tend to invite, elicit, pull, draw, or entice from inter- 
actants restricted classes of reactions that are reinforcing of, 
and consistent with, a person’s proffered self-definition” 
(Kiesler, 1983, p. 201; see also Kiesler, 1996). To the extent 
that individuals can mutually satisfy their needs for interac- 
tion that are congruent with their self-definitions Oe, com- 
plementarity), the interpersonal situation remains integrated 
(resolved). To the extent that this fails, negotiation or disinte- 
gration of the interpersonal situation is more probable. 

As noted previously, interpersonal theory includes in- 
trapsychic elements. The contemporaneous description of the 
interpersonal situation utilizing either the IPC or SASB to 
delineate behavior and reciprocal patterns is not limited to the 
observable behaviors occurring between two people. Thus, 
interpersonal complementarity (or any other reciprocal 
pattern) should not be conceived of as some sort of stimulus- 
response process based solely on overt actions and reactions 
(Pincus, 1994). A comprehensive account of the contempora- 
neous interpersonal situation must somehow bridge the gap 
between the interpersonal (or overt) and the intrapsychic (or 
covert). Interpersonalists have indeed proposed many con- 
cepts and processes that clearly imply a rich and meaningful 
intrapsychic life (Kielser, 1996; Pincus, 1994), including per- 
sonifications, selective inattention, and parataxic distortions 
(Sullivan, 1953a, 1953b), covert impacts (Kiesler et al., 
1997), expectancies of contingency (Carson, 1982), fantasies 
and self-statements (Brokaw & McLemore, 1991), and cog- 
nitive interpersonal schemas (Foa & Foa, 1974; Safran, 
1990a, 1990b; Wiggins, 1982). We agree with Safran’s 
(1992) conclusion that the “ongoing attempt to clarify the re- 
lationship between interpersonal and intrapsychic levels is 
what is needed to fully realize the transtheoretical implica- 
tions of interpersonal theory” (p. 105). Much of the field is 
moving in this direction, as the relationship between the in- 
terpersonal and the intrapsychic is a common entry point for 
current integrative efforts (e.g., Benjamin, 1995; Florshiem 
et al., 1996, Tunis et al., 1990). 

Kiesler’s (1986, 1988, 1991, 1996) interpersonal transac- 
tion cycle provides the most articulated discussion of the 
relations among overt and covert interpersonal behavior 
within interpersonal situations. He proposes that the basic 
components of an interpersonal transaction are (a) Person X’s 
covert experience of Person Y, (b) Person X’s overt behavior 
toward Person Y, (c) Person Y’s covert experience in re- 
sponse to Person X’s action, and (d) Person Y’s overt behav- 
ioral response to Person X. These four components are part of 
an ongoing transactional chain of events cycling toward res- 
olution, further negotiation, or disintegration. Within this 
process, overt behavioral output serves the purpose of regu- 
lating the interpersonal field via elicitation of complementary 


overt responses in the other. The structural models of inter- 
personal behavior specify the range of descriptive taxa, 
whereas the motivational conceptions of interpersonal theory 
give rise to the nature of regulation of the interpersonal field. 
For example, dominant or controlling interpersonal behavior 
(e.g., Do it this way!) communicates a bid for status (e.g., 
I am an expert) that impacts the other in ways that elicit either 
complementary (e.g., Can you show me how?) or noncom- 
plementary (e.g., Quit bossing me around!) responses in an 
ongoing cycle of reciprocal causality, mediated by covert and 
subjective experience. 

In our opinion, the conceptions of covert processes medi- 
ating behavioral exchange have been a weak link in the inter- 
personal literature, reflecting much less consensus among 
theorists than do the fundamental dimensions and circular 
nature of structural models. The diverse conceptualizations 
proposed have not been comprehensively related to develop- 
mental analyses, nor have their influences on the observable 
interpersonal field been fully developed. In a significant step 
forward, Kiesler (1996) has synthesized many concepts (i.e., 
emotion, behavior, cognition, and fantasy) in developing the 
construct referred to as the impact message (see also Kiesler 
et al., 1997). Impact messages are fundamental covert aspects 
of the interpersonal situation, encompassing feelings (e.g., 
elicited emotions), action tendencies (pulls to do something; 
i.e., I should calm him down or I should get away), perceived 
evoking messages (i.e., subjective interpretations of the 
other’s intentions, desires, affect states, or perceptions of in- 
terpersonal situation), and fantasies (De, elaborations of the 
interaction beyond the current situation). Kiesler and his col- 
leagues view the link between the covert and overt aspects of 
the interpersonal situation to be emotional experience. Im- 
pact messages are part of a “transactional emotion process 
that is peculiarly essential to interpersonal behavior itself” 
(Kiesler, 1996, p. 71). Impact messages are registered 
covertly by Person X in response to Person Y’s interpersonal 
behavior, imposing complementary demands on the behavior 
of Person X through elicited cognition, emotion, and fantasy. 
Notably, the underlying structure of impact messages paral- 
lels that of the IPC (Kiesler et al., 1997; Wagner et al., 1995), 
allowing for description of covert processes that are on a 
metric common with the description of overt interpersonal 
behavior. 

In summary, contemporaneous analysis of the interper- 
sonal situation accounts for the patterned regularity of inter- 
actions by positing that interpersonal behavior typically 
evokes a class of covert responses (impact messages) that 
mediate cycles of overt behavior—that is, patterned rela- 
tional behavior occurs, in part, due to the field-regulatory in- 
fluences of interpersonal behavior on covert experience and 


the subsequent mediation of overt action by evoked covert 
exprience. In our opinion, this is only part of the story. Covert 
responses are intrapsychic phenomena that give rise to sub- 
jective experience. It is clear that the nature of such covert 
responses—that is, feelings, action tendencies, interpreta- 
tions, and fantasies—are not evoked completely in the mo- 
ment due to interpersonal behavior of another, but rather arise 
in part from enduring organizational tendencies of the indi- 
vidual, as the following example illustrates. 


Parataxic Integration of Interpersonal Situations 


The covert impact messages evoked within a contemporane- 
ous interpersonal transaction cycle are primarily associated 
with the overt behaviors of the interactants. It is assumed that 
interactants are generally aware of such covert experience, as 
the development of the self-report Impact Message Inventory 
(Kiesler & Schmidt, 1993) suggests. However, Sullivan 
(1953a) also suggested that other integrating tendencies— 
beyond those that are encoded within the proximal interper- 
sonal field—often influence the interpersonal situation. Such 
parataxic distortions may play a more or less significant role 
in the covert experience of one or the other person in an 
interpersonal situation. 


Clinical Example 


A psychotherapy patient treated by the first author entered her 
therapy session genuinely distraught and depressed. She 
reported that a person she labeled “an important friend” had 
ignored her during a recent social gathering and failed to 
attend a small celebration of her birthday. This was certainly 
no surprise to me, as this fellow had consistently behaved in 
an unreliable and invalidating manner toward my patient. 
However, it again appeared to be a surprise to her, and her 
disappointment was profound. The immediate interpersonal 
situation with this patient was quite familiar, and I decided 
our alliance was now sufficiently established to allow for an 
empathic effort to confront her continued unrealistic expecta- 
tions of this fellow and to further examine how her attach- 
ment to him seemed to leave her vulnerable to ongoing 
disappointments. 

I responded by saying, “I can understand that what has 
happened over the weekend has left you hurt, but I wonder 
why it is that despite repeated similar experiences with this 
‘friend,’ you continue to remain attached to him and hope he 
will give you what you want? It seems to leave you very 
vulnerable.” My patient responded with sullen withdrawal, 
curtly remarking, “Now you’re yelling at me just like my 
mother always does!” 
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I am fairly certain that had the session been videotaped, 
there would be no increase in the decibel level of my voice 
during the intervention. And, an internal scan of my reaction 
to her report suggested helpful intent rather than countertrans- 
ferential punitiveness. Nonetheless, my patient’s response 
clearly communicated that I was now berating her and putting 
her down, and that therapy was not supposed to go this way. 
This continued for several months—any effort I made to 
examine my patient’s contributions to her difficulties was 
rebuffed in a similar way. This continued to shape my thera- 
peutic responses, leaving me hesitant to venture in this direc- 
tion when my patient reported interpersonal difficulties. In 
other words, the repertoire of therapeutic behaviors I could 
provide became more and more limited by my patient’s rather 
rigid behaviors in our relationship. In Kiesler’s (1988) 
terminology, I was “hooked.” 

Several things are apparent from this example. First, using 
interpersonal structural models to describe the contempo- 
raneous therapeutic transaction, we would see that the rela- 
tionship was often characterized by noncomplementary 
responses and by a movement away from an integrated ther- 
apeutic relationship. I would try to direct her attention toward 
herself in an empathic way, and in response my patient would 
withdraw and threaten to leave. Second, my patient’s re- 
sponse of sullen withdrawal was, however, quite complemen- 
tary to her subjective experience of me as blaming and 
punishing. And third, the dynamic interaction between the 
overt and the covert aspects of our therapeutic transactions 
continuously exerted field-regulatory influence that allowed 
the therapy to continue. Too much “yelling and blaming” on 
my part would lead to a quick termination. My patient did not 
seem particularly aware of her bids to get me to back off, in- 
stead insisting she wanted my help with her depression and 
interpersonal difficulties. 

In our opinion, this example highlights the challenges 
ahead for fully developing an integrative interpersonal theory 
of personality. In bridging the interpersonal and the intrapsy- 
chic, there are several limitations to contemporaneous analy- 
sis, three of which we discuss further in the next section. 


Some Comments on Interpersonal Complementarity 
The Locus of Influence 


Safran (1992) is correct in pointing out that interpersonal the- 
ory’s bridge between the overt and the covert requires further 
development. It is possible that many interpersonal situations 
generate undistorted, proximal field-regulatory influences— 
that is, covert experience generally is consistent with overt 
experience and impact messages reflect reasonably accurate 
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encoding of the interpersonal bids proffered by the interac- 
tants. Thus, all goes well, the interpersonal situation is re- 
solved, and the relationship is stable. However, this is clearly 
not always the case, as the previous example suggests. When 
covert experience is inconsistent with the field-regulatory 
bids communicated via overt behavior, it is our contention 
that subjective experience takes precedence. That is, the 
locus of complementarity is internal and covert experience is 
influenced to a greater or lesser degree by enduring tenden- 
cies to elaborate incoming data in particular ways. The qual- 
ities of the individual that give rise to such tendencies have 
yet to be well articulated within interpersonal theory. Inter- 
personal theory can easily accommodate the notion that 
individuals exhibit tendencies to organize their experience 
in certain ways De, they have particular interpersonal 
schemas, expectancies, fantasies, etc.), but there has been 
relatively little consensus on how these tendencies develop 
and how they impact the contemporaneous interpersonal 
situation. 


The Problem of Complementarocentricity 


Complementarocentricity can be defined as the tendency to 
place complementarity at the center of interpersonal theory 
and research. In our opinion, this overemphasis has limited 
the growth of theory. Three examples of complementarocen- 
tricity are as follows: 


1. What does failure to find empirical support for interper- 
sonal complementarity mean? When empirical studies do 
not confirm the existence of complementarity, investiga- 
tors often label it “a failure to statistically support comple- 
mentarity” (e.g., Orford, 1986). Even when empirical 
investigations do find significant results (e.g., Gurtman, 
2001), they are not indicative of 100% lawfulness. In our 
opinion, the answer to our question is that other reciprocal 
interpersonal patterns are also occurring in the interper- 
sonal situation(s) under investigation. 


2. In perhaps the most influential articulation of complemen- 
tarity, Kiesler (1983) defined all reciprocal patterns in 
relation to complementarity. That is, other forms of recip- 
rocal interpersonal patterns are said to take either acom- 
plementary or anticomplementary forms. We wonder if 
this has inadvertently promoted complementarity as a 
more fundamental reciprocal interpersonal pattern than it 
actually should be. 

3. In his encyclopedic review of complementarity theory and 
research, Kiesler (1996) presented 11 propositions to 
define and clarify the nature of, scope, and generizability 


of complementarity, and nine counterpoints to Orford’s 
(1986) famous critique of complementarity. In this work, 
Kiesler summarized important contributions by many in- 
terpersonalists emphasizing situational, personological, 
and intrapsychic moderators of complementarity (e.g., see 
Tracey, 1999), and suggested that significant attention be 
directed toward articulating when and under what condi- 
tions complementarity should and should not be expected 
to occur. Although this is exceptionally important, it 
continues to reflect complementarocentric thinking in that 
what is not recognized is that Kiesler’s (1996) 11 proposi- 
tions, nine counterpoints, and continuing investigation of 
moderators serve to decentralize complementarity as the 
fundamental reciprocal interpersonal pattern by suggest- 
ing that its occurrence is more limited and contextualized. 
For example, consider Proposition 11 regarding “appro- 
priate situational parameters” from Kiesler (1996): “The 
condition of complementarity is likely to obtain and be 
maintained in a dyadic relationship only if the following 
conditions are operative: a) the two participants are peers, 
b) are of the same gender, c) the setting is unstructured, 
and d) the situation is reactive (the possibility of recipro- 
cal influence exists)” (p. 104). Considered alone, comple- 
mentarity is thus suggested to be most applicable to 
understanding the unstructured interactions of same-sex 
peers. This is certainly important, but is perhaps not the 
core phenomenon of interest for a comprehensive theory 
of personality. 


The Problem of Motivation 


The two core theoretical assertions associated with interper- 
sonal complementarity are Sullivan’s theorem of reciprocal 
emotion and Leary’s principle of reciprocal interpersonal re- 
lations. With regard to the former, we suggest that interper- 
sonal theorists have overemphasized Sullivan’s first point 
(i.e., complementary needs are resolved or aggravated) and 
underemphasized his second point (i.e., reciprocal patterns of 
activity are developed or disintegrated). It is important to 
note that the needs involved are left undefined, and that the 
nature of satisfaction in the Sullivanian system involves a 
global sense of felt security marked by the absence of 
anxiety. Leary’s principle provided an important extension in 
its emphasis on interpersonal influence and reinforcement 
that shapes the nature of ongoing interpersonal situations. 
But to what end? What is behavior’s purpose? Traditionally, 
the cornerstone of complementarity has been the assertion 
that behavior is enacted to invite self-confirming reciprocal 
responses from others. We believe this has also been overem- 
phasized in the interpersonal literature. 


We agree that reciprocal interpersonal influence, reinforce- 
ment, and gratification are central to understanding human 
personality. This is reflected in the large number of psy- 
chological concepts that in some way reflect the notion of 
reciprocity. That is, individuals develop some consistently 
sought-after relational patterns and some strategies for 
achieving them. However, we do not believe that a single 
superordinate motive such as self-confirmation will succeed 
in comprehensively explaining how personality develops and 
is expressed. 


Summary 


Our discussion of interpersonal reciprocity and transaction 
has highlighted many of the unique strengths of interper- 
sonal theory, as well as areas in which significant develop- 
ment and synthesis are necessary. In our view, interpersonal 
theory emphasizes relational functioning in understanding 
personality; this emphasis has led to the development of 
well-validated structural models that provide anchors to sys- 
tematically describe interpersonal behavior and the patterned 
regularity of human transaction. Interpersonal theory has 
also emphasized field-regulatory aspects of personality in 
addition to the more traditional drive, self, and affect- 
regulatory foci of most theories of personality. The combi- 
nation of descriptive structural models and clear focus on the 
interpersonal situation provides a rich nomological net that 
has had a significant impact in psychology, particularly with 
regard to the classification of personological and psycho- 
pathological taxa and the contemporaneous analysis of 
human transactions and relationships. However, we also feel 
that the future of interpersonal theory will require continuing 
efforts to address (a) the intrapsychic or covert structures 
and processes involved in human transaction, (b) the 
overemphasis on complementarity as the fundamental recip- 
rocal interpersonal pattern in human relationship, (c) the 
overemphasis on self-confirmation as the fundamental mo- 
tive of interpersonal behavior, and (d) the lack of a compre- 
hensive developmental theory to complement its strength in 
contemporaneous analysis. 


THE FUTURE OF INTERPERSONAL THEORY 


We believe the future of interpersonal theory is bright. 
Addressing the four major issues previously noted will re- 
quire interpersonal theorists to continue efforts at integrating 
interpersonal theory’s nomological net with the wisdom con- 
tained in the cognitive, psychodynamic, and attachment liter- 
ature. Fortunately, this is already beginning to take place. 
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Benjamin (1993, 1995, 1996a, 1996b) has initiated this with 
her interpersonal “gift of love” theory that integrates the 
descriptive precision of the SASB model with intrapsychic, 
motivational, and developmental concepts informed by at- 
tachment, cognitive, and object-relations theories. 


Interpersonal Theory and Mental Representation 


We have previously asked the question Where are interper- 
sonal situations to be found? Our answer is that they are 
found both in the proximal relating of two persons and also in 
the minds of individuals. There are now converging lit- 
eratures that suggest mental representations of self and other 
are central structures of personality that significantly af- 
fect perception, emotion, cognition, and behavior (Blatt, 
Auerbach, & Levy, 1997). Attachment theory refers to these 
as internal working models (Bowlby, 1969; Main, Kaplan, & 
Cassidy, 1985), object-relations theory refers to these as 
internal object relations (Kernberg, 1976), and cognitive the- 
ory refers to these as interpersonal schemas (Safran, 1990a). 
Notably, theorists from each persuasion have observed the 
convergence in these concepts (Blatt & Maroudas, 1992; 
Bretherton & Munholland, 1999; Collins & Read, 1994; 
Diamond & Blatt, 1994; Fonagy, 1999; Safran & Segal, 
1990; Westen, 1992). Benjamin (1993, 1996a, 1996b) has 
also proposed that mental representations of self and other 
are central to the intrapsychic interpersonal situation. She 
refers to these as important people or their internalized rep- 
resentations, or IPIRs. Thus, whether referred to as internal 
working models, internal object relations, interpersonal 
schemas, or IPIRs, psychological theory has converged in 
identifying mental representations of self and other as basic 
structures of personality. 

In our opinion, the fundamental advantage of integrating 
conceptions of dyadic mental representation into interper- 
sonal theory is the ability to import the interpersonal field 
(Wiggins & Trobst, 1999) into the intrapsychic world of the 
interactants (Heck & Pincus, 2001). What we are suggesting 
is that an interpersonal situation can be composed of a proxi- 
mal interpersonal field in which overt behavior serves impor- 
tant communicative and regulatory functions, as well as an 
internal interpersonal field that gives rise to enduring individ- 
ual differences in covert experience through the elaboration 
of interpersonal input. 

In addition, Benjamin’s conception of IPIRs retains inter- 
personal theory’s advantage of descriptive precision based on 
the SASB model (Pincus et al., 1999). Benjamin (1993, 
1996a, 1996b) proposes that the same reciprocal patterns that 
describe the interactions of actual dyads may be used to 
describe internalized relationships (mental representations 
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of self and other) on the common metric articulated by the 
SASB model (see also Henry, 1997). In our view, this adds 
explanatory power for interpersonal theory to account for 
individuals’ enduring tendencies to organize interpersonal 
information in particular ways. Although the concept of 
the impact message is extremely useful in identifying the 
classes of covert cognitive, affective, and behavioral experi- 
ences of individuals, it does not necessarily account for the 
nature of individual differences in covert experiences. Ben- 
jamin’s IPIRs provide a way to account for the unique and 
enduring organizational tendencies that people bring to inter- 
personal situations—experiences that may underlie their 
covert feelings, impulses, interpretations, and fantasies in re- 
lation to others. Interpersonal theory proposes that overt be- 
havior is mediated by covert processes. Psychodynamic, 
attachment, and cognitive theories converge with this asser- 
tion, and they suggest that dyadic mental representations are 
key influences on the subjective elaboration of interpersonal 
input. In our opinion, Benjamin has advanced interpersonal 
theory by incorporating mental representations explicitly into 
the conception of the interpersonal situation. 

Returning briefly to our clinical example, recall that the 
patient consistently came into therapy reporting disappoint- 
ments in her interpersonal relations. In telling her sad stories, 
she communicated her need to be consoled and nurtured. 
When she was asked to reflect on her own contributions to 
her disappointments, she became sullen and withdrawn. This 
reaction was a bid at negotiation, communicating a threat to 
leave in an effort to reestablish a reciprocal pattern of satisfy- 
ing responses from her therapist. Why was this happening, 
given that the therapist attempted to provide recognition and 
consolation of her hurt feelings? Despite good therapeutic in- 
tentions, efforts to focus her attention on her own patterns 
seemed unhelpful. There was a clue in her report of her sub- 
jective covert experience. When the therapist turned the 
focus toward the patient’s contributions to her relational dif- 
ficulties, he was experienced as similar to her mother. The 
proximal interpersonal field was no longer the primary 
source of her experience. There was now a second, parataxic 
integration of the situation that led to a covert experience that 
was driven by previous lived interpersonal experiences that 
now influenced the patient’s subjective experience; this be- 
came the primary mediating influence on her overt behavior. 
Despite her requests for help and consistent attendance in 
therapy, the patient was having difficulty organizing her 
experience of the therapist independently of her maternal 
IPIR. In our view, this example demonstrates that noncom- 
plementary reciprocal interpersonal responses in the proxi- 
mal interpersonal field may indicate significantly divergent 
experiences within the internal interpersonal field that can 


best be described by integrating interpersonal theory’s struc- 
tural models with concepts of mental representation. 


Development and Motivation 


Adding conceptions of dyadic mental representation is not 
sufficient for a comprehensive interpersonal theory of per- 
sonality. Sullivan (1964), Stern (1988), and others have sug- 
gested that the contents of the mind are in some way the 
elaborated products of lived interpersonal experience. A 
comprehensive interpersonal theory must account for how 
lived interpersonal experience is associated with the develop- 
ment of mental representation. In our opinion, Benjamin has 
provided the only comprehensive developmental approach to 
evolve from interpersonal theory. 

Using SASB as the descriptive anchor (Figure 9.2), 
Benjamin (1993, 1996a, 1996b) has proposed three develop- 
mental copy processes that describe the ways in which early 
interpersonal experiences are internalized. The first is identi- 
fication, which is defined as treating others as one has been 
treated; this is associated with the transitive SASB surface. 
To the extent that individuals strongly identify with early 
caretakers (typically parents), there will be a tendency to act 
toward others in ways that copy how important others have 
acted toward the developing person. The second copy 
process is recapitulation, which is defined as maintaining a 
position complementary to an IPIR; this is associated with 
the intransitive SASB surface and can be described as react- 
ing as if the IPIR were still there. The third copy process is 
introjection, which is defined as treating the self as one has 
been treated. This is associated with the introject SASB sur- 
face and is related to Sullivan’s conceptions of “reflected 
appraisals” as a source of self-personification. 

Identification, recapitulation, and introjection are not in- 
compatible with Kiesler’s conception of covert impact mes- 
sages. In fact, we suggest that the proposed copy processes 
can help account for individual differences in covert experi- 
ence by providing developmental hypotheses regarding the 
origins of a person’s enduring tendencies to experience par- 
ticular feelings, impulses, cognitions, and fantasies in inter- 
personal situations. For the patient described earlier, it seems 
that her experience of the therapist as yelling and blaming re- 
flects (in part) recapitulation of her relationship with her 
mother. This in turn leads to a parataxic distortion of the 
proximal interpersonal field in therapy and noncomplemen- 
tary overt behavior. 

Although the copy processes help to describe possible 
pathways in which past interpersonal experience is internal- 
ized into mental structures (IPIRs), it is still insufficient to ex- 
plain why early IPIRs remain so influential. The answer to 


this question requires a discussion of motivation. Whereas 
Sullivan’s legacy has led many interpersonal theorists to 
posit self-confirmation as the core motive underlying human 
transaction, Benjamin (1993) proposed a fundamental shift 
toward the establishment of attachment as the fundamental 
interpersonal motivation. In doing so, she has provided one 
mechanism to account for the enduring influence of early ex- 
perience on mental representation and interpersonal behav- 
ior. Although a complete description of attachment theory is 
beyond the scope of the present chapter, we agree that attach- 
ment to proximal caregivers in the early years of life is both 
an evolutionary imperative (e.g., Belsky, 1999; Bowlby, 
1969; Simpson, 1999) and a primary organizing influence on 
early mental representation (Beebe & Lachmann, 1988a, 
1988b; Bowlby, 1980; Stern, 1985). 

Infants and toddlers must form attachments to caregivers 
in order to survive. Benjamin has suggested that the nature of 
the early interpersonal environment will dictate what must 
be done to establish attachments. These early attachment 
relationships can be described using the SASB model’s de- 
scriptive taxa, predictive principles, and copy processes. The 
primacy of relationships to IPIRs is thus associated with the 
need to maintain attachment to them even when not immedi- 
ately present. Benjamin (1993) refers to this as maintaining 
“psychic proximity” to IPIRs. The need to maintain psychic 
proximity is organized around wishes for love and connect- 
edness (secure attachment or AG on the SASB model), as 
well as fears of rejection and loss of love (disrupted attach- 
ment or DAG on the SASB model). The primacy of early 
attachment patterns and mental representations influencing 
current experience is consistent with psychodynamic and at- 
tachment theories. Bowlby (1980) suggested that internal 
working models act conservatively; thus, assimilation of new 
experience into established schemas is typical (see also Stern, 
1988). Benjamin (1996a) suggested that “psychic proximity 
fulfills the organizing wish to receive love from the IPIR ... 
acting like the IPIR, acting like the IPIR were present, or 
treating the self as would the IPIR can bring about psychic 
proximity” (p. 189). 

Returning again to the patient described earlier, it was 
clear that she was ambivalently but strongly attached to her 
mother. She consistently experienced blame any time she 
attempted to convey interpersonal disappointments or bad 
feelings. Anything that disrupted her mother’s sense of con- 
trol over the world was met with the accusation that the pa- 
tient was being selfish and immature—and that it was the 
patient’s fault, so her feelings were not valid. In addition, she 
was told that if she didn’t stop causing so much trouble, her 
parents might divorce. It became clear that the patient had 
internalized a critical maternal IPIR. Whenever the patient 
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was asked about her experience of self, she would inevitably 
begin her response with “My mother says that am...” or 
“My mother says it’s bad for me to feel this way.” When the 
therapist would try to explore the patient’s contributions to 
her interpersonal difficulties, it evoked recapitulation. De- 
spite affirming and affliative efforts on the part of the thera- 
pist, the patient had a difficult time accommodating the new 
interpersonal input; instead she covertly experienced psychic 
proximity to the critical maternal IPIR and responded in kind. 
She experienced the therapeutic interpersonal situation as if 
the maternal IPIR were present, and she needed to back down 
rather than own her disappointments. To do otherwise would 
risk her attachment to her mother, painful as it was. 


Concluding Propositions 


Benjamin’s developmental and motivational extensions of 
interpersonal theory provide some of the richest advances to 
date. We see her work, along with Kiesler’s recent integration 
of emotion theory into the interpersonal transaction cycle, as 
solid evidence that interpersonal theory as originally con- 
ceived of by Sullivan has a vital and promising future as a 
fundamental and integrative approach to personality. In this 
vein we would like to close this chapter with a further exten- 
sion of these contemporary works. 

Interpersonal theorists are interested in understanding 
why certain reciprocal interpersonal patterns become promi- 
nent for an individual. Benjamin has made an important start 
by suggesting that a basic human motivation is attachment 
and that the interpersonal behaviors and reciprocal interper- 
sonal patterns (described by interpersonal theory’s unique 
structural models) that help achieve attachment become fun- 
damental to personality through internalization of relation- 
ships (characterized by the copy processes). She posits that 
the wish for attachment and the fear of its loss are universal, 
and that positive early environments lead to secure attach- 
ments and normal behavior (i.e., AG). If the developing 
person is faced with achieving attachment in a toxic early en- 
vironment, behavior will be abnormal (DAG), but will de- 
velop in the service of attachment needs and be maintained 
via internalization. 

We would like to extend this further in an effort to gener- 
ate an interpersonal theory of personality that more broadly 
addresses issues of basic human motivation. It is our con- 
tention that the maturational trajectory of human life allows 
us to conceptualize many developmentally salient motives 
that may function to mediate and moderate current interper- 
sonal experience. That is, reciprocal interpersonal patterns 
develop in concert with emerging motives that take develop- 
mental priority, thus expanding the goals that underlie their 
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formation and maintenance. We can posit core issues likely to 
elicit the activation of central reciprocal patterns and their 
associated IPIRs, potential developmental deficits associated 
with early experiences, and unresolved conflicts that continue 
to influence the subjective experience of self and others. The 
output of such intrapsychic structures and processes for indi- 
viduals are those consistently sought-after relational patterns 
and their typical strategies for achieving them (i.e., proximal 
and internal field regulation). These become the basis for 
the recurrent interpersonal situations that characterize a 
human life. 

It is our view that what catalyzes and reinforces identifica- 
tion, recapitulation, and introjection is the organizing power 
of developmental achievements and traumatic stressors. Al- 
though interpersonalists have discussed differential “evoking 
power” of behavior due to situational constraints and the 
quality of interactions (i.e., moderators of complementarity), 
we believe such evoking power is limited in comparison to 
the catalyzing effects of major personality developments and 
their underlying motivational influences. At different points 
in personality development, certain motives become a prior- 
ity. Perhaps initially the formation of attachment bonds and 
security are primary motivations; but later, separation-indi- 
viduation, self-esteem, mastery of unresolved conflicts, and 
identity formation may become priorities (see Table 9.2). If 
we are to understand the reciprocity seeking, field-regulatory 
strategies individuals employ, we must learn what interper- 
sonal behaviors and patterns were required to achieve partic- 
ular developmental milestones. In this way, we see that what 
satisfies a need or achieves an important goal for a given in- 
dividual is strongly influenced by his or her developmental 
history. In addition to developmental achievements, trau- 
matic learning may also catalyze the internalization of pat- 
terns associated with coping responses to early loss of an 
attachment figure, severe physical illness in childhood, sex- 
ual or physical abuse, and so on. 

Integrating the developmental and traumatic catalysts 
for internalization of reciprocal interpersonal patterns allows 


TABLE 9.2 Some Possible Catalysts of Internalization 


Developmental Achievements Traumatic Learning 


Attachment 

Security 
Separation-individuation 
Positive affects 

Gender identity 

Resolution of Oedipal issues 
Self-esteem 

Self-confirmation 

Mastery of unresolved conflicts 
Identity formation 


Early loss of attachment figure 
Childhood illness or injury 
Physical abuse 

Sexual abuse 


for greater understanding of current behavior. If individuals 
have the goal of individuating the self in the context of a cur- 
rent relationship in which they feel too enmeshed, they are 
likely to employ strategies that have been successful in the 
past. Some individuals have internalized hostile forms of dif- 
ferentiation such as walling off, whereas others have inter- 
nalized friendly forms of differentiation such as asserting 
their opinions in an affiliative manner. The overt behavior of 
the other is most influential as it activates a person’s ex- 
pectancies, wishes, and fears associated with current goals, 
needs, and motives; this will significantly influence their 
covert experience of impact messages. In our opinion, the 
most important goals, needs, and motives of individuals are 
those that are central to personality development. 

A brief example highlights this point and provides some 
clues as to why individuals may repeat maladaptive interper- 
sonal behaviors over and over. Another psychotherapy pa- 
tient treated by the first author was severely sexually and 
emotionally abused by multiple family members while she 
was growing up. The predictive principle of opposition to 
what she experienced as a child characterized her transitive 
actions towards others in the present. In all dealings with 
others she was hyper-loving and hyper-protective, even when 
clearly to her detriment. She compulsively exhibited such 
behaviors, even when treated badly by others. In therapy, 
it became clear that she counteridentified with her perpetra- 
tors and chronically exhibited the opposite pattern in order to 
maintain a conscious sense of individuation. It was as if she 
were saying, “If I allow myself to become even the slightest 
bit angry or blaming, it will escalate and UU be just like those 
who hurt me in the past.” Unfortunately, although she could 
shed tears for the victims of the holocaust and the victims of 
the recent epidemic of school shootings, she could not do so 
for herself. She had also introjected her early treatment 
within the family and continued to self-injure and ignore her 
own needs and basic human rights. Thus, although she con- 
sciously behaved in ways that individuated her from her 
abusers, she also abused and neglected herself in ways that 
unconsciously maintained attachment to her abusive IPIRs 
(see Table 9.3). 

We end this chapter with a bit of speculation. A broader tax- 
onomy of reciprocal interpersonal patterns such as SASB pre- 
dictive principles and copy processes, combined with a theory 
of personality development and motivation, can be the basis 
for understanding both personality and its pathology. Obvi- 
ously this approach could take many forms. From the contem- 
porary interpersonal perspective developed in this chapter, a 
basic approach would be an open system with consideration 
of IPIR-Goal linkages associated with fundamental develop- 
mental achievements and traumatic learning. We could also 
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TABLE 9.3 Interpersonal Analysis of Ms. Wis Behavior 


Motive, Copy Predictive Overt 
IPIRs Goal Process Principle Behavior 
Brothers & Individuation (Counter-) Opposition Hyper-loving & 
mother identification protective 
Brothers & Attachment, Introjection Introjection Self-attacking, 
mother psychic proximity self-neglecting 


consider individual differences in the influence of certain 
copy processes, such that personalities are classified as highly 
recapitulating, highly introjective, and so on. Similarly, we 
could consider individual differences in the tendency to enact 
certain reciprocal interpersonal patterns, such that personali- 
ties are differentiated by their tendencies to exhibit opposi- 
tional, complementary, antithetical, similar, or introjected 
behaviors. Although these final thoughts are purely specula- 
tive, we wish to emphasize our hope that the ideas presented 
throughout this chapter provide the interpersonal foundations 
for an integrative theory of personality. 
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Operations reshape concepts. Over the past decades, the 
very concept of personality has been subject to implicit re- 
definition through a set of operations labeled the Big Five 
taxonomy or the five-factor model of personality. In a re- 
stricted sense, the number five refers to the finding that most 
of the replicable variance of trait-descriptive adjectives in 
some Western languages is caught by five principal compo- 
nents whose varimax rotations are named extraversion, 
agreeableness, conscientiousness, emotional stability, and 
intellect (or openness to experience, autonomy, imagination, 
and so on, depending on operational variations). In a wider 
sense, however, the five-dimensional (5-D) approach has 
come to represent no less than a paradigm—in particular, a 
revival of the individual-differences or trait conception of 
personality. For an evaluation of its status and future per- 
spectives, a systematic analysis of its operational credentials 
is in order. 


The author is greatly indebted to Lewis R. Goldberg, Gerard 
Saucier, and Jos M. F. Ten Berge for their incisive comments on a 
draft of this chapter. 
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A first module of the set of operations that constitute the 
5-D paradigm consists of the questionnaire construction of 
personality, whereby someone’s personality is defined through 
his or her own answers, or more exceptionally through the 
answers given by third persons, to standardized questions. 
The questionnaire approach is not confined to the 5-D tradi- 
tion, but it has to a significant extent been taken over by that 
paradigm (the megamerger impressing some as monopolis- 
tic). Is there a viable alternative to the questionnaire method, 
and if so, would it change our view of personality? 

A second, more specific, operational module contains 
ways of choosing personality descriptors. The general guid- 
ing principle in this module is the lexical approach that 
consists of selecting items from a corpus of language, par- 
ticularly a dictionary of that language. The distinguishing 
characteristic of the lexical approach is its purposely induc- 
tive nature, in contrast to approaches in which the descriptor 
base is deduced from particular trait constructs, for example, 
neuroticism. Again, the leading question is about the impact 
of these operations on our conception of personality. 

A third operational characteristic consists of reliance on 
the linear model, particularly, principal component analysis 
(PCA) of Likert item scales. This is probably the most 
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constitutive operation of the paradigm, if only because the 
number of five dimensions is intimately connected with it. 
The merits or demerits of PCA as such (if that problem makes 
sense at all) are not in order here. Clearly, however, other 
methods—notably, methods advocated under the label of 
person-oriented approach— yield concepts of personality that 
differ from the 5-D trait paradigm. 

A fourth set of operations contains models for structur- 
ing, interpreting, and communicating trait information. The 
major rivals are the hierarchical and the circumplex models 
of personality structure. Their common point of departure 
is simple structure. On the one hand, simple structure is a 
primitive case of the circumplex in that trait variables are 
assigned to the factor on which they load highest, thus, to 
circle segments that are 90 deg wide with the factor poles 
as bisectrices. On the other, simple structure may be 
viewed as a primitive case of hierarchical structure con- 
taining two levels: factors at the top and trait variables at 
the bottom. But from there on, ways separate. I judge 
structure models by their capacity to produce clear and 
communicable trait concepts; their underlying mechanics, 
however, should be allowed to be intricate and may stretch 
the mind. 

After discussing the structure models that have been pro- 
posed or implied in the 5-D context, I conclude with sketch- 
ing a family of models that may serve as a base for capturing 
personality structure. It consists of a hierarchy of generalized 
semicircumplexes, with one general p component of person- 
ality at the top, and including two-dimensional circumplex, 
giant three, 5-D, and other dimensional structures. The joint 
structure responds to the greatest challenge in personality as- 
sessment, which is to deal with its dominating evaluative 
component in a realistic manner. 


CONSTRUCTING PERSONALITY 
THROUGH QUESTIONNAIRES 


Under the 5-D paradigm, what does it mean to say that a per- 
son is extraverted? In the typical case, it means that this in- 
dividual has given answers to a number of standard 
questions regarding himself or herself and that these answers 
have been summarized into a score under the hopefully 
adequate label of extraversion—rather than, for example, 
surgency or sociability, which are related but not the same. 
This is not to suggest that aready alternative to the question- 
naire approach is available; rather, it functions as a tacit pre- 
supposition in trait psychology taken generally. However, 
there is an obvious alternative to the individual himself 


or herself as a responder, namely, others who know the per- 
son well. 


The Hegemony of Questionnaires 


The association between personality and questionnaires is 
not merely a matter of fashion or a historical coincidence. To 
assess someone’s personality, we have to ask questions about 
it—to the person himself or herself, to third parties who know 
the person well, to expert observers. Between the investigator 
or practitioner on the one hand and the person on the other, 
there is an indispensable assessor. So-called behavior obser- 
vations, for example, are not objective in the way they would 
be if behavior recordings were translated into a score without 
the intervention of an observer; they represent answers to 
questions put to a human assessor. Moving from asking ques- 
tions to applying a questionnaire is a small step: A systematic 
approach to personality requires standard questions, and thus 
a questionnaire. Using an unstructured interview, for exam- 
ple, means obtaining answers to an imperfectly standardized 
set of questions. 

One seeming exception is self-report, in which person and 
assessor coincide. Failure to distinguish between the two 
roles, however, would amount to denying that the assessor 
could be someone else, thereby abandoning personality as an 
intersubjective phenomenon. Another more interesting ap- 
parent exception to questionnaire use is expert clinical diag- 
nosis, in which practitioner and assessor coincide. In the first 
place, however, that process may be reconstructed in part as 
giving answers to more or less standardized questions about 
the person that the diagnostician has learned to ask to himself 
or herself. Second and more fundamental, the diagnostician 
could have been another individual. By virtue of that 
exchangeability, a case can be made for maximizing the 
intersubjective character of diagnoses. Actually using a stan- 
dardized set of questions (e.g., a personality questionnaire 
phrased in the third person singular) to guide and articulate 
one’s diagnostic impressions would contribute to that end. 
This is not to deny the heuristic element in clinical diagnosis, 
or in any other applied setting, but to document the central 
place of asking questions to third persons in the systematic 
study of personality. 

The reason for the primacy of questionnaires may of 
course be sought in a tendency of students of personality to 
take things easy: There is nothing more convenient than giv- 
ing a self-report questionnaire to a client or applicant. But 
more valid reasons may be brought forward. There is a tension 
between the concepts of “test” and “personality.” Surely, we 
may decide to assess a person’s typical intelligent behavior by 


means of a questionnaire (e.g., Goff & Ackerman, 1992), or 
test the maximal introversion of which he or she is capable 
(see Riemann, 1997), but neither of these crossovers has ap- 
peared to be adequate or promising. Ability and tests of max- 
imal performance, and personality and assessments of typical 
behavior, are associated in a nonarbitrary manner (Hofstee, 
2001). 


Are Questionnaires There to Stay? 


The prime product of the 5-D paradigm consists of question- 
naires, including most notably the Neopersonality incen- 
tives-PI-R and NEO-FFI (Costa & McCrae, 1992), and 
includes many other questionnaires and trait adjective lists; 
the model has thus given a significant boost to the question- 
naire construction of personality. I have argued in brief that 
the relation between personality traits and the questionnaire 
operationalization is intimate. Should one be happy with 
the prospect of such an essentially monomethod definition of 
personality, and if not, can alternatives be foreseen? 

Asking questions to third persons in order to assess per- 
sonality implies a social definition of it. Surely, the field has 
moved beyond the stage at which personality was deemed to 
be merely in the eye of the beholder; cumulative behavior- 
genetic research (see, e.g., chapter by Livesley, Jang, & 
Vernon in this volume; Loehlin, 1992) has put an end to that 
subjective conception of traits. But the dominant conception 
of personality remains social in the sense of intersubjective 
rather than objective. Buss (1996) made a virtue of this need 
by explaining the Big Five as elementary social mechanisms; 
for example, Factor III represents the need of the perceiver to 
know whether the other person can be depended upon. Most 
students of personality, however, would have hesitations with 
this subordination of personality to social psychology, espe- 
cially if that bondage is a side effect of a dominant opera- 
tional approach. 

The scientific emancipation of a subjective or intersubjec- 
tive concept appears to hinge upon the discovery of objective 
indicators that cover the concept well. If we wish to establish 
how much of a fever we run, we do not use a Likert scale but 
measure it with a thermometer. If we want to gauge an appli- 
cant’s intelligence, we apply a test rather than asking ques- 
tions to the applicant or even to a number of third persons. If 
the latter example is more problematic than the first, that is be- 
cause there may be doubt regarding the coverage of the con- 
cept of intelligence by an IQ score. In the same vein, one may 
have doubts about the thermometer scale as a measure of out- 
door temperature and prefer a formula that includes sunshine, 
humidity, and wind force. But once a certain level of coverage 
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is secured, a return to sheer subjectivity would count as re- 
gressive. Are adequate objective indicators of personality 
traits in sight? 

Probably the most promising indicators of personality are 
genes. According to estimates based on behavior-genetic 
research, genetic patterns will be capable of covering some 
40% of the trait variance. That degree of coverage is not 
enough; we would not accept a thermometer that is only 40% 
valid. But before discarding the prospect, one should realize 
that the figure of .4 is heavily attenuated. An indicator need 
not and should not predict the error components in subjective 
assessments of temperature or extraversion. Heredity coeffi- 
cients in the order of .4 should thus be divided by an estimate 
of the proportion of valid variance in questionnaire scores. 

The first source of error in the self-reports that have almost 
invariably been used in behavior-genetic studies of personal- 
ity is lack of agreement between assessors. The highest 
agreement coefficients between self and other in assessing 
personality (Hendriks, 1997; McCrae & Costa, 1987) are in 
the order of .7. Unless it is assumed that self is a systemati- 
cally better assessor than other or vice versa, that figure may 
be taken as an estimate of the rater reliability of a single re- 
spondent, and some 30% of the questionnaire variance is 
rater error. Second, some 20% of the variance results from 
lack of internal consistency of the questionnaire scale, as- 
suming alpha reliabilities in the order of .8; and third, a com- 
parable error component results from temporal instability. 
Taking all these independent sources of error into account, 
one is left wondering how the heredity coefficients can reach 
4 at all (Hofstee, 1994a). 

The ironic conclusion from this crude analysis of error 
components in questionnaire variance is that the perspective 
of molecular-genetic diagnosis of personality traits cannot at 
all be discarded: It may well appear that whatever valid vari- 
ance remains in questionnaire data can be accounted for to a 
satisfactory extent by genetic configurations. However, the 
analysis also points to the conditions for such a development. 
To establish links between genes and phenotypic personality 
traits, the assessment of the latter will have to be much more 
valid than it has been up to now (see also Bouchard, 1993). 
The central element of that program is discussed in the next 
paragraphs. Another aspect—optimizing the internal consis- 
tency of questionnaire data—is treated in the section on the 
linear approach to personality. 


Definitions of Personality by Self and Others 


Self-report fosters a conception of personality whereby the 
individual knows best how he or she is. With self-report 
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questionnaires, the situation is more complicated. Standard- 
ized questions aim at comparing personalities rather than 
capturing unique and emergent characteristics. McAdams’s 
(1992) criticism of the Big Five approach as a psychology of 
the stranger is correct in that sense (although other phrasings 
might be preferred if the value of scientific objectivity is 
stressed); it would be even more correct if the emphasis in 
Big Five research were on other-report rather than self- 
report. Self-report questionnaires embody a discordant blend 
of subjective and intersubjective accents. 

In preparing an earlier (Hofstee, 1994a) paper on the 
topic, I met with unexpectedly ardent arguments in favor of 
self-report from prominent American Big Five researchers, 
the essence of which is documented in that paper. One argu- 
ment pertained to personal secrets, whose content, however, 
would not be central to personality in most definitions. (A 
person might be said to be secretive, but that trait hardly even 
makes sense from the person’s own point of view.) Another 
argument was that a person might sit in a corner over a large 
number of consecutive parties but still consider himself or 
herself to be extraverted, which would be all that counts. In 
practice, however, most witnesses would start worrying 
whether that person were still in contact with reality (which is 
again different from the question about introversion or extra- 
version). In the abstract, actors are at liberty to entertain a 
subjective definition of personality, but in real life it does not 
carry them very far. The intersubjective viewpoint is not 
merely a matter of scientific style; it is in touch with what 
people think of personality. 

If the intersubjective viewpoint is accepted as a proper per- 
spective on personality and if idiosyncrasies in self-report are 
seen as a source of error among other sources of error, the con- 
sequence for personality research and practice is as straight- 
forward as it is revolutionary: Multiple assessors are needed to 
achieve acceptable reliability and validity; self-reports, being 
single by definition, are inevitably deficient. Self is of course 
acceptable as an assessor among others; self-ratings might 
even contribute more to the common variance than others’ rat- 
ings do. But in any case, the road toward an eventual objective, 
genetic diagnosis of personality, will have to be paved with 
multiple assessors; good intentions will not be enough. 


THE FUTURE OF THE 
FIVE-DIMENSIONAL MODEL 


Will genetic fingerprinting in due time describe personality in 
terms of extraversion, agreeableness, conscientiousness, 
emotional stability, and some version of Factor V? In other 
words, will the 5-D model survive the developments that 


most readers may expect to witness in their lifetimes 
(whether they like it or not)? At the moment of writing this, 
the answer can hardly be unequivocal; even the question may 
appear to need rephrasing. 

In an extensive reanalysis of several data sets, Saucier 
(2002a) found a three-dimensional structure containing 
agreeableness, conscientiousness, and extraversion to be 
more replicable across samples than a 5-D structure, espe- 
cially in peer ratings, which in the present reasoning are more 
germane than self-ratings. So we might end up with a subset. 
Using a comparable three-dimensional solution, Krueger 
(2000) showed that the additive-genetic structure underlying 
the Multidimensional Personality Questionnaire (Tellegen, 
1982) corresponded closely to the phenotypic structure. 
On the other hand, Jang, McCrae, Angleitner, Riemann, & 
Livesly (1998) demonstrated that specific factors beyond the 
first five have nonzero heritability coefficients. 

Even supposing reliable and valid assessments of pheno- 
typic personality traits, a routine search for indicators of, for 
example, conscientiousness would require enormous samples 
just for tracing additive polygenetic effects; for interactions, 
the required sizes would rise exponentially (for a discussion 
of strategies of molecular-genetic research on personality, see 
Plomin & Caspi, 1998). At the turn of the century, attempts to 
trace genetic polymorphisms that explain personality showed 
the familiar picture of high initial expectations followed by 
failing replications (e.g., Herbst, Zonderman, McCrae, & 
Costa, 2000). According to a possibly more feasible scenario, 
large principal components of personality traits may be ex- 
pected to reappear as an aggregate result of studies searching 
for single genes to explain specific patterns of deviant behav- 
ior (see, e.g., Brunner, Nelen, Breakfield, Roppers, & Van 
Oost, 1993). Assuming continuity between the range of nor- 
mal behavior and deviant extremes, the aggregate structure of 
a large number of such specific patterns would resemble the 
5-D structure. In the process, such taxonomies of phenotypic 
traits would receive a status comparable to mineralogical 
classifications; the chemistry of individual differences would 
be located at the DNA level. 

Decades ago, Carlson (1971) found that personality was 
spelled in either of two ways: social or clinical. The 
questionnaire conception of personality is arguably social- 
psychometric by its methodological nature. If the genetic 
approach becomes dominant, a clinical reconstruction will 
regain momentum; individual differences within the normal 
range will be seen as mitigations and moderations of person- 
ality defects constituting the chemical elements. Meanwhile, 
an enormous amount of work has to be done, and 5-D ques- 
tionnaires filled out by several third persons and self are 
instrumental in that labor. 


THE LEXICAL BASE OF THE 
FIVE-DIMENSIONAL MODEL 


A basic motive of researchers involved in the 5-D paradigm 
is to give a systematic and comprehensive, or at least repre- 
sentative, account of personality traits. An accompanying 
notion is that the field is characterized by a proliferation 
(John, 1990) of concepts and instruments, which frustrates 
the progress of the science of personality. The signature of 
the 5-D paradigm is empiricist and, in a sense, antitheoreti- 
cal: If theorists, in this context, are individuals bent on dis- 
seminating their idiosyncratic concepts of personality, then 
their collective but uncoordinated action is responsible for a 
chaotic state of affairs in which thousands of unrelated con- 
cepts and their operationalizations form a market rather than 
a science. The 5-D conception is thus a taxonomy intended 
to end all idiosyncratic taxonomies. 

To lift personality out of its chaotic state, an Archimedean 
point was needed. The most obvious candidate for a point of 
departure at the descriptive or phenotypic (Goldberg, 1993b) 
level is the lexicon. Like genetics, it provides a finite set of 
elements on the base of which a taxonomy may be built and 
proliferation may be counteracted. This section contains a 
discussion of the lexical point of departure, its variations, 
and its consequences. An analysis of the different shapes of 
the Factor V and their operational antecedents serves as an 
illustration. 


The Lexical Axiom 


What is usually referred to as the lexical hypothesis is more 
like an axiom. It states that people wish to talk about what- 
ever is important and that the terms in which they talk may be 
found in the lexicon. The first and central part of that state- 
ment is not a hypothesis that is subject to empirical confirma- 
tion or disconfirmation; it introduces a heuristic that may or 
may not appear to be fertile. The second part is definitely 
false as no dictionary is ever complete; however, it is un- 
problematic because most dictionaries contain far more 
words than most people care to use or even understand, and 
hardly if ever omit common terms. 

An objection that is seldom voiced although it is obvious 
enough is that the reverse of the lexical axiom does not nec- 
essarily hold true: People may well be talking about unim- 
portant things most of the time. There is something to be said 
for the idea that the language of normal personality does 
not serve much of a purpose. However, PCA (see the 
next section) capitalizes on redundancies among variables. 
That method thus retroactively introduces a corollary of the 
lexical axiom, namely, that redundancy is indicative of real 
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importance. For playful purposes, we may seek rare and 
sophisticated terms or combinations of terms; at the level 
of common components, however, we mean business. Of 
course, this corollary, in its turn, may or may not be judged 
credible. 

A reverse objection is that common language is not subtle 
enough for scientific purposes. One may philosophize at 
length about this proposition, which is as metaphysical as the 
lexical axiom itself. The historic rebuttal, however, was 
delivered by Digman (1990; Digman & Inouye, 1986), who 
recovered the Big Five structure in questionnaires, that is, in 
instruments designed by experts. In a similar vein, I (Hofstee, 
1999) asked 40 clinicians to score a prototypical personality 
disorder with which they were familiar on the items of 
the Five-Factor Personality Inventory (FFPI; Hendriks, 
Hofstee, & De Raad, 1999). These items do not contain 
any technical terms or pathological content. Nonetheless, 
very distinct and extreme profiles in 5-D terms resulted, 
again indicating that expert categories may be well repre- 
sented by ordinary language. 

In principle, the lexical approach both reflects and fosters 
a lay definition of personality; in practice, however, the 
effect seems to be slight. Thus, at low conceptual costs 5-D 
research has succeeded in bringing a considerable measure of 
order to the anarchy of phenotypic traits. Any serious investi- 
gator proposing a new trait concept would now be well ad- 
vised to investigate whether it has incremental validity over 
an optimal linear combination of the five factors; existing 
concepts are better understood in that framework. An exam- 
ple is typical intellectual engagement (Goff & Ackerman, 
1992), which appears to be a label for a mixture of Factors V 
and III; another is the familiar concept of sociability, blend- 
ing Factors I and II. As I argue later, there is nothing against 
using dedicated labels for blends if they are distinguished 
from variables that do carry considerable specific variance. 
But even if taken liberally, the five factors represent a taxo- 
nomic breakthrough, part of which may be credited to the 
lexical approach. 


Operationalizations of the Lexical Approach 


There is no unique and cogent operationalization of the lexi- 
cal approach. It pertains to single personality-relevant words, 
under the tacit supposition that words do not interact, so that 
the meaning of any trait combination can be represented by a 
linear function of them. That supposition is patently false in 
the case of oxymora like “amiably inimical” or “quietly exu- 
berant,” joinings of opposite terms whose meaning cannot be 
accounted for in a linear fashion; however, there are reasons 
to be wary of such seductions of literary language. In any 
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case, the search for single words is a defining characteristic of 
the lexical approach. But the question of how to select the 
single words has no straightforward answer; a number of 
decisions must be made. 

A first decision concerns grammatical categories. Most 
investigators, from Galton (1884) on, have concentrated on 
adjectives (for an overview, see De Raad, 2000; Saucier, 
Hampson, & Goldberg, 2000). Goldberg (1982) and De 
Raad (1992) have studied type nouns, alphabetically running 
from ace to zombie in American English, but there is a con- 
sensus that this category does not add much (cf. extraverted 
vs. an extravert) or consists of invectives that have uses 
other than describing personality. A more interesting addition 
to adjectives are personality-descriptive verbs, which run 
from abandon to yield (not counting zap, zip, and zigzag) in 
English, denoting acts that would be more characteristic of 
one person than another. De Raad’s (1992) analyses of per- 
sonality verbs and nouns, however, do not result in novel 
content over the factors found in adjectives. The focus on 
adjectives does not recoil significantly on the implicit defini- 
tion of personality. 

A second set of operations consists of exclusion cate- 
gories, for example, moods (e.g., sad), body characteristics 
(e.g., fat), social relations (e.g., subordinate), attitudes (e.g., 
progressive), and effects (e.g., famous). These exclusions are 
unproblematic because the categories are outside the domain 
of personality traits. Two other categories, however, deserve 
special consideration. One is called mere evaluations (e.g., 
good). In the language of personality, content and evaluation 
are intimately connected: On the one hand, neutral content is 
hard to find; on the other, mere evaluation is equally scarce. 
Tellegen (1993), in particular, has argued against excluding 
this category and has shown that it contains variance over and 
above the five factors (Almagor, Tellegen, & Waller, 1995). 
Thus, the 5-D model entertains a conception of personality 
that is somewhat sterilized with respect to evaluation. 

The other problematic category is one that is invariably 
included, containing adjectives denoting intelligence, capa- 
bilities, talents, erudition, and the like—thus, the kind of 
maximum-performance traits that have traditionally been dis- 
tinguished from typical-behavior traits. This inclusion is not 
an automatic consequence of the lexical approach; Ostendorf 
(1990), for example, sharply distinguished between tempera- 
ment and character on the one hand, and skills and talents on 
the other, before joining the two sets of traits under the head- 
ing of dispositions. One could simply state that the 5-D 
approach has opted for the broader of the two definitions of 
personality, including not only temperamental or stylistic as- 
pects (most notably Factors I, extraversion-introversion, and 
IV, emotional stability vs. neuroticism) and character (most 


notably Factors I, agreeableness, and II, conscientiousness), 
but also intellect, erudition, and the like (Factor V; see 
Hofstee, 1994b). However, I voice some reservations regard- 
ing that inclusive choice when discussing Factor V later. 

A final operation consists of the exclusion of technical, 
highly metaphorical, and otherwise difficult terms. As I 
argued earlier, that procedure is probably not very conse- 
quential with respect to the scientific concept of personality, 
even though the literary loss is considerable. In constructing 
the FFPI, however, Hendriks (1997) went one step further 
and retained only items that were found perfectly compre- 
hensible by students of lower professional education. Of the 
1,045 brief expressions (e.g., Wants to be left alone) that 
made up the pool from which the items were chosen, 34% 
met this criterion. In a set of 195 trait-descriptive adjectives 
carefully selected to cover the factors of the 5-D model, only 
14% did. It is a sobering thought that the founding studies of 
the 5-D model could not have been meaningfully carried out 
with these respondents. Furthermore, this sharpening of the 
comprehensibility criterion does appear to have conse- 
quences for the content of Factor V, as is shown next. 


The Credentials of the Fifth Factor 


The most spectacular vindication of the 5-D model has been 
brought forward by Ostendorf (1990). In the introduction to 
his study, Ostendorf related that he viewed the model with 
great skepticism at first, as the available American studies 
were based on very small samples of trait variables that had 
been composed using very subjective criteria (Ostendorf, 
1990, p. 9). Not only this initial skepticism, but also the fact 
that the replication was completely independent, started from 
scratch, and was carried out in another language, added to the 
credibility of the 5-D model. Ostendorf, however, expressly 
included ability adjectives; consequently, his Factor V is a 
clear intellect factor defined by such terms. 

In our Dutch lexical project, subjects were asked whether 
an adjective would fit in the framing sentence “he/she is 
[adjective] by nature” (cf. Brokken, 1978) in order to deter- 
mine an adjective’s prototypicality as a trait descriptor. Adjec- 
tives like dull, gifted, capable, brilliant, one-sided, idiotic, 
sharp, and ingenious received very low prototypicality ratings 
(along with other categories of terms, most notably social- 
effect adjectives like horrible, commonplace, and captivating). 
In a selection of terms used by De Raad (1992) to establish the 
replicability of the 5-D model in the Dutch language, terms 
with low prototypicality were excluded; consequently, no 
clear fifth factor appeared. In a Dutch-German-American 
comparison (Hofstee, Kiers, De Raad, Goldberg, & Ostendorf, 
1997), the correspondence between the American and German 


structures was higher than the match of either with the Dutch 
structure, especially with regard to Factor V. 

Plagued with feelings of intellectual inferiority, we took 
drastic steps to better our lives. In constructing the item pool 
for the FFPI (Hendriks et al., 1999), we expressly added 266 
intellect items over the 1,045 constructed to cover the five 
Dutch factors (see Hendriks, 1997, p. 19f). However, only 
two of these 266 were judged to be perfectly comprehensible 
by our students of lower professional education, who did not 
connect to words like reflect, analyze, and contemplate. In a 
PCA of the whole item pool, based on responses of more so- 
phisticated subjects, typical intellect items like Thinks ahead, 
Uses his/her brains, Sees through problems, Learns quickly, 
Is well-informed, and their counterparts had sizable sec- 
ondary or even primary loadings on Factor III, conscientious- 
ness; pure markers of V(—) were items like Follows the 
crowd, Copies others, and Does what others do. Conse- 
quently, Factor V(+) was interpreted as autonomy. We were 
thus unsuccessful in our attempts to arrive at an intellect fac- 
tor. The autonomy interpretation of Factor V reappears in 
Italian data (see De Raad, Perugini, & Szirmäk, 1997). 

A powerful competitor—if only by virtue of the wide- 
spread use of the NEO-PI-R (Costa & McCrae, 1992)—to the 
intellect conception of Factor V is its interpretation as open- 
ness to experience. That construct does not come out of the 
lexical approach; in fact, McCrae (1990, 1994) has used it re- 
peatedly to argue the deficiency of that approach. The conse- 
quent problem with such constructs, however, is that they do 
not share the taxonomic status that is awarded by the lexical 
paradigm. Furthermore, many of the NEO items in general, 
and of the openness to experience scales in particular, would 
not pass the comprehensibility test that was outlined earlier. 
Brand (1994) predicted that both intellect and openness to 
experience would correlate substantially with measured intel- 
ligence (g) over the whole intellectual range of the popula- 
tion. A special reason may be that subjects of modest IQ 
would reject such items because they do not understand 
them, and thus receive low scores. 


Distinguishing Personality from Ability 


The 5-D model seems to have contributed to a shifting em- 
phasis from a narrow to a broad conception of personality. 
That shift can hardly be objected to as such. Not only are both 
intelligence and other personality traits stable and psycholog- 
ically relevant, but they also combine with each other. An in- 
telligent extravert may be found eloquent; a dull one may be 
judged to be loudmouthed. In the study on 5-D profiles of 
prototypical personality disorders (Hofstee, 1999) referred to 
earlier, the narcissistic and antisocial profiles were relatively 
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close together, but that must be because the FFPI’s Factor V 
has little to do with intellect: Sizable differences between the 
two would be expected on measured intelligence (Millon, 
personal communication, September 29, 1999). For a proper 
assessment of personality, the inclusion of intelligence is 
indispensable. 

There is no good reason, however, to contaminate typical 
behavior and maximum performance. On the contrary, there 
are good reasons to separate the operations. One is that 
objective measurement of intelligence is more scientific than 
its assessment, however intersubjective that assessment may 
be. Another is that methods are not neutral: Abilities and tests 
of maximum performance are as closely associated as are 
stylistic traits and assessments of typical behavior. To include 
ability items in questionnaires can only obscure the view on 
intelligence. 

With respect to concepts of temperament and character, 
state-of-the-art assessment would include a 5-D question- 
naire as a baseline instrument, and novel concepts would 
have to prove their added value against that background. 
According to the same principle of parsimony, however, 5-D 
factors have to prove their added value over measured intelli- 
gence. Precisely because personality and intelligence belong 
together, objective measures of intelligence should be in- 
cluded in investigating the structure of personality. In view of 
the scientific primacy of intelligence, its variance should be 
partialled out of the questionnaire scores. While in the 
process, attitudinal factors, which are out of bounds in most 
definitions of personality, should be removed in the same 
manner. They, too, are empirically correlated with certain 
versions of Factor V, particularly with openness to experi- 
ence (Saucier, 2002a). With these corrections, it is entirely 
conceivable that little would remain of Factor V. 


THE LINEAR APPROACH TO THE CONCEPT 
OF PERSONALITY 


The “Magical Number Five,” in the words of Goldberg 
(1992b), is intricately connected with applying PCA to large 
numbers of trait variables. Forerunners have been pinpointed, 
most notably Tupes and Christal’s (1961/1992) analyses. 
However, Tupes and Christal’s denomination of the fifth fac- 
tor in terms of culture is now obsolete. On the other hand, if 
the magical number had been found to be six, one could have 
referred to another Cattellian’s (Pawlik, 1968) set consisting 
of I Extraversion, II Cooperativeness, III Deliberate Control, 
IV Emotionality, V Independence of Opinion, and VI 
Gefiihlsbetontheit (which is difficult to translate; the order in 
which the factors appear has been adjusted to the present 
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context). These examples of imperfect historical fit could eas- 
ily be expanded upon. The five factors owe their consolidation 
and impact to analyses of large data matrices that did not be- 
come possible until the last decades of the twentieth century. 
This section starts with setting out the strongest possible 
case for PCA by presenting a classical (see Horst, 1965) 
rationale for it. Next, it examines the grounds for the magical 
number five. It then considers the so-named person-centered 
approach as an alternative to PCA in certain contexts. 


The Case for Principal Component Analysis 


Applying PCA to a scores matrix is the logical consequence 
of performing item analysis. In the general case, the aim of 
item analysis is to maximize the internal consistency of one 
or more scales based on the items; the exception whereby 
items are weighted by their predictive validity is outside the 
present scope. The basic idea of item analysis may be 
expressed as follows: The investigator is aware that each 
single item, carefully chosen as it may be, is an imperfect 
operationalization of whatever construct it represents. But 
the investigator has no better criterion against which to 
gauge the validity of the item than the total score on the set 
of equivalent items. Item analysis is thus a bootstrapping 
operation. 

Carrying this basic idea to its logical consequence pro- 
ceeds as follows: At the first step, items are weighted accord- 
ing to their association with the total score. Discarding items 
on that basis would amount to arbitrarily assigning a zero 
weight. That may be defensible in extreme cases where it is 
evident for substantive reasons—albeit post hoc—that the 
item does not belong in the set. In the general case, however, 
all items would be retained. 

By virtue of assigning weights to the items, however, the 
total score has been replaced by a weighted sum. The implicit 
rationale is that this weighted sum is a better approximation 
of the underlying construct than was the unweighted sum. So 
the logical second step would be to assign item weights ac- 
cording to their association with the weighted sum. Thus an 
iteration procedure has been started, the endpoint of which is 
reached when convergence of weights and of weighted sums 
occurs. At that point, the weighted total score is the first prin- 
cipal component of the item scores (Horst, 1965). If the item 
set is multidimensional, more than one principal component 
is obtained, but the reasoning is essentially the same. 

Thus a particularly strong argument in favor of PCA is 
that it is logically inevitable. Also, since the days of com- 
puter scoring, any practical objections against calculating 
weighted sums have disappeared: Sooner than applying 10 
hand-scoring keys to a 5-D questionnaire (five keys for 


positive items and five for negative items), one would put the 
item scores on electronic file anyway. 


Raw-Scores PCA 


The present argument does not prejudice in favor of PCA as 
it is usually conceived, namely, PCA of z scores or correla- 
tion matrices. Rather, it refers to raw-scores PCA, with devi- 
ation scores and their covariance matrices, or standardized 
scores and their correlations, as special cases. Raw-scores 
PCA should be performed on bipolar scores; for example, 
scores on a five-point scale should be coded as —2, —1, 0, 
+1, and +2: We (Hofstee, 1990; Hofstee & Hendriks, 1998; 
Hofstee, Ten Berge, & Hendriks, 1998) have argued that a 
bipolar representation of personality variables is appropriate, 
as they tend to come in pairs of opposites. Thinking in terms 
of all-positive numbers is a habit imported from the abilities 
and achievement domain, where it does not make sense to 
assign a negative score. 

Raw-scores PCA implies an absolute-scale interpretation 
of the Likert scale, rather than the conventional interval-scale 
interpretation. These alternative interpretations have subtle 
consequences for our conception of personality. The first of 
these concerns the reference point. With relative, interval- 
scale scoring, the population mean is the reference point. For 
desirable traits, that reference point is at the positive side of 
the scale midpoint (0), and vice versa. Thus a person with a 
score of +.8 on a socialness scale with a population mean of 
+1.1 (most people being found social), would be said to be 
somewhat asocial, albeit in a relative sense, which however is 
the only available interpretation when using interval scaling. 
The unthinking adoption of interval scales from the domain 
of intelligence and achievement may lead to a bleak view of 
humankind, whereby a sizable proportion of the population is 
judged more or less deviant. A poor comfort is that the pro- 
portion is a bit less than 50% because the raw-score distribu- 
tion is not symmetric. Taking the scale midpoint seriously 
solves the problem; it prevents a positive judgment from 
being translated into something unfavorable and vice versa, 
based on an inappropriate convention. 

The second way in which absolute and interval scale con- 
ceptions differ concerns spread. Using a five-point scale, 
most items have standard deviations close to 1, as the preva- 
lent responses are —1 and +1; thus the difference between 
absolute and interval scaling is not dramatic in this respect. 
But extremely favorable and unfavorable items obtain 
smaller standard deviations. The effect of standard PCA and 
interval scoring procedures is to increase their impact on the 
total score. It would seem that this is also an unintended con- 
sequence rather than a deliberate effect. 


In sum, item scoring through weights obtained by raw- 
scores PCA deserves more consideration than it has received 
so far. The standard objection to treating scores on a Likert 
scale as absolute is that strong assumptions would be im- 
posed on the data. I am unable to see the validity of that 
argument. So-called weak models may in fact be very strong: 
To assume that the midpoint of a personality scale has no 
meaning and, consequently, that respondents’ evaluations can 
be reversed, is about as strong as hypothesizing, for example, 
that a large proportion of the population cannot be trusted. At 
the very least, the absolute conception of Likert-scale scores 
is no more indefensible than the interval conception. 


A Review of the Grounds for the Number Five 


A way to obtain many principal components is to analyze ma- 
trices with large numbers of variables, in this case single trait 
descriptors. Earlier, limitations on computing capacity virtu- 
ally prevented the number of trait variables from being much 
larger than the 35 employed by Tupes and Christal (1961/ 
1992). With the expanding power of computers, however, it 
became feasible to analyze the very large numbers of vari- 
ables that were needed to justify claims of representativeness 
if not exhaustiveness. However, the sorcerer’s apprentice 
problem then becomes keeping the number of factors from 
getting out of hand. With hundreds of variables, it will take 
many factors to get down to the time-honored “eigenvalue 1” 
threshold; for example, the 20th factor in Ostendorf’s (1990) 
PCA of 430 traits still has an eigenvalue of 3. 

Hofstee et al. (1998) proposed a more stringent criterion 
based on the alpha reliability of principal components, which 
is approximately 1— 1/E with large numbers of variables, E 
being the eigenvalue of that principal component. Setting the 
minimum alpha at .75, an “eigenvalue 4” threshold results 
(E = 1 gives a = 0). Using this criterion, Ostendorf (1990) 
should still have set the dimensionality of the personality 
sphere at about 14 rather than 5; with even larger numbers of 
traits, the dimensionality would only increase. There can be 
no doubt that the 5-D model discards linear composites of 
traits that are of sufficient internal consistency, and would 
add to the number of dimensions. It is of interest to note that 
the most prominent 5-D questionnaire, Costa and McCrae’s 
(1992) NEO-PI-R, in fact postulates 30 dimensions rather 
than 5, as each of the 30 subscales is deemed to have specific 
variance in their hierarchical model. (The five second-order 
factors do not add to the dimensionality, as they are linear 
combinations of six subscales at a time.) 

An entirely valid pragmatic reason to restrict the number 
of factors is parsimony. The first principal component is 
the linear combination of traits that explains a maximum of 
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variance; the second maximizes the explained variance in the 
residual, and so on. Consecutive factors thus follow the law 
of diminishing returns. Next, the scree test acts on the amount 
of drop in eigenvalue between consecutive factors; it thus 
signals points of increasingly diminishing returns. Using the 
scree test, Brokken (1978) retained 6 principal components in 
a set of 1,203 trait adjectives; Ostendorf (1990) retained 5. 
However, the scree test does not offer a unique solution; 
Ostendorf, for example, could have opted for an 8-factor so- 
lution on that basis. Neither PCA nor the scree test dictates 
the number of five. 

Does replicability of factors provide a cogent criterion 
for the dimensionality of the space? That depends on how 
the term is understood. If one and the same large trait list were 
administered to large samples from the same population, the 
number of replicable factors would in all likelihood exceed 
five. At the other extreme, when independent, “emic” replica- 
tions of the lexical approach in different languages are under- 
taken, the number tends to be in the order of three (De Raad 
et al., 1997; Saucier et al., 2000) rather than five, Ostendorf’s 
replication being an exception. Saucier, Hampson, and 
Goldberg list 18 points on which such studies might diverge 
and recommend methodological standardization. A familiar 
objection is that standardization leads to premature closure of 
the issue: Not only would the outcome depend on arbitrary 
choices, but moreover one could not tell anymore what makes 
a difference and what does not. It would be preferable to use 
these points for studies on whether and how the number varies 
in function of differences in approach. 

In sum, the number five takes on the character of a point 
estimate in a Bayesian credibility function on an abscissa that 
runs from 0 to some fairly large number, with the bulk of the 
density stacked up between 3 and 7. As with other empirical 
constants, the uncertainty does not so much result from ran- 
dom error as from the interplay of diverging arguments and 
specifications. In any case, the number should be taken with 
a grain of salt. 


The Person-Centered or Typological Approach 


A familiar critique of trait psychology is that it loses the 
individual from sight (see, e.g., Block, 1995; Magnusson, 
1992). A set of alternative operations is available under labels 
such as type or person-centered approach; it comprises 
Q-sorts in preference to Likert scales, longitudinal designs to 
assess the dynamics of personality, and cluster analysis of 
persons rather than PCA of variables. Recent empirical stud- 
ies (e.g., Asendorpf & Van Aken, 1999; Robins, John, Caspi, 
Moffit, & Stouthamer-Loeber, 1996) concentrate on the three 
“Block” types: resilient, overcontrolled, and undercontrolled. 
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I document the relativeness of the opposition between the 
person-centered and variable-centered paradigms (see also 
Millon, 1990) but try to do justice to a real difference in their 
ranges of application. 


Persons in Principal Component Analysis 


Unlike factor analysis proper, in which factor scores are 
hardly more than an afterthought, PCA offers a fairly sym- 
metric treatment of individuals and variables. One could ro- 
tate a matrix of scores on principal components to simple 
structure and characterize individuals by the person factor on 
which they had their highest score. In an even closer approx- 
imation to the person-centered approach, factors and load- 
ings may be rescaled such that individuals receive loadings 
and variables receive factor scores. That is not precisely the 
same thing as performing Q-factor analysis, as the scores still 
become standardized per variable instead of per individual as 
in Q-analysis; but the two operations would be mathemati- 
cally identical in the case of raw-score PCA. If the argument 
in favor of raw-score PCA is accepted, the difference 
between variable-centered and person-centered analysis re- 
duces to a set of scaling constants and rotation criteria being 
applied to one matrix rather than another, which is hard to get 
excited about. 


Variables and Types 


An orthodox typological solution may be viewed as a binary 
matrix of persons by types with one | per row, representing 
the type to which that person is assigned, and 0 scores in the 
remaining cells. At the other extreme, each person could be 
given a score for each type on a continuous scale, represent- 
ing the extent to which that person corresponds with that 
type, thereby treating the types as continuous variables. The 
orthodox solution could be reconstructed from that matrix by 
selecting the highest score per row and dichotomizing ac- 
cordingly. Intermediate, liberalized typological solutions 
(Millon, 1990) could also be derived, most notably through 
matrix-wise dichotomization of the continuous scores. In the 
liberalized solution, some persons would appear to be as- 
signed to more than one type, whereas some others would fail 
to meet the threshold for any type at all. The types would no 
longer be orthogonal in the way they are forced to be accord- 
ing to the orthodox solution; so one could correlate the types, 
factor analyze them, and the like. 

To those who would find this methodological play with 
types improper, there is a perfectly serious answer. In the ideal 
case, a diagnosis is performed by an infinite number of inde- 
pendent experts. Experts do not agree perfectly in all cases. 


Thus the sum or average of even their orthodox typological 
solutions would give precisely the kind of matrix of continu- 
ous scores introduced in the earlier argument. In a scientific 
(in the sense of intersubjective) conception of types, the con- 
tinuous matrix is the primitive case, not the binary matrix. The 
primitive case arises not because types (or even personality 
variables in general) are necessarily continuous as such, but 
because of the tacit third dimension of the matrix. 


Q-Sorts and Likert Scales 


Investigators working in the person-centered paradigm prefer 
ipsative scores, as they would represent intra-individual 
rather than interindividual comparisons. Varieties of ipsative 
scoring are row standardization, which fixes the means and 
standard deviations, and forced distribution, whereby all 
moments are fixed. Q-sorts automatically result in forced- 
distribution scores (unless the number of items in the “most 
applicable” to “least applicable” categories is not fixed, in 
which case, however, the method is indistinguishable from 
using a Likert scale). 

Like orthodox typologies, ipsative scores may be con- 
structed from continuous “interactive” scores, in this case by 
standardizing over variables or by forcing a distribution on 
them. One might object that Q-sorts are different in principle 
from Likert-scale scores, but that remains to be seen. In the 
first place, judges need not respond the way we instruct them 
to. If I am asked, by way of intra-individual comparison, 
whether I am (or John is) more reliable than friendly, I may 
well respond against the background of people in general; it 
could even be argued that the question is meaningless with- 
out that background. Conversely, when confronted with a 
standard personality questionnaire, intra-individual consider- 
ations might well enter into my response process. It is thus ar- 
guable that all responding is interactive. In the second place, 
Q-sorts are used to compare people, therefore, interindividu- 
ally: If John is said to be of Type A whereas Mary is not, the 
intra-individual level is automatically surpassed. 

The effect of ipsatization is to remove interindividual dif- 
ferences in elevation and spread (and skewness, kurtosis, and 
so on) of the responses. The operation thus implies a view 
of personality in which such individual differences have 
no place. Surprisingly, that view appears to be shared by 
some unadulterated trait researchers, most notably Goldberg 
(1992a) and Saucier (1992; see, however, Saucier, 2002a). 
Their rationale, however, has nothing to do with an emphasis 
on intra-individual differences. Rather, they use ipsatization 
of Likert-scale data to remove differences in scale usage, in 
other words, response sets. Whatever the rationale is, the 
implication needs to be examined in detail. 


Removing differences in elevation and spread prevents one 
person from having more traits than another, as well as from 
being more extreme. Correcting for elevation is quite defensi- 
ble in the special case where the variable set is completely bal- 
anced (i.e., consists of opposites like reliable and unreliable). 
Except in a fairly poetic manner, it hardly makes sense for a 
person to be both more X and un-X than another; it is more 
parsimonious to attribute such a response pattern to excentric 
scale use, traditionally denoted as the acquiescent response 
set. Hofstee et al. (1998; see also Ten Berge, 1999) presented 
ways to correct for excentric responding. However, if the vari- 
ables set is not balanced, correcting for elevation removes 
content and social desirability variance. In the most elemen- 
tary case, John is prevented from being both more friendly and 
reliable than Mary. That consequence is infelicitous. 

The person-centered approach is thus subject to an irony 
of fate: An intention (a proper approach to personality) mate- 
rializes into an operation (ipsative scoring) that appears to 
cradle aversive implications (for the very concept of person- 
ality). Ipsatization would do the job in a strictly idiographic 
approach, but that condition is not fulfilled: By virtue of the 
fact that one and the same method and vocabulary is applied 
to more than one person, interindividual comparison auto- 
matically creeps in. It may make sense to separate ipsative 
and normative components of a scores matrix by representing 
the latter as a vector containing the person means. Discarding 
that vector, however, has the effect of flattening the concept 
of personality. Essentially the same argument applies to indi- 
vidual differences in spread (and other moments of the score 
distribution). 


Dynamics 


Analytically, a dynamic approach to personality, as advo- 
cated by person-centered investigators, may mean either of 
two things: taking the time or growth dimension into account, 
and interpreting traits as an intra-individual pattern, there- 
fore, in a nonlinear fashion. The dynamic approach thus 
stands in opposition to an orthodox trait approach, which is 
static and linear. 

However, dynamics are easily accommodated in the 
individual-differences paradigm. A chronological series of 
assessments pertaining to an individual may be conceived as 
an extension of the scores vector. In a multiple prediction of 
some criterion, the question then becomes whether, for ex- 
ample, last year’s emotional stability has incremental validity 
over today’s. Alternatively, a (fitted) growth curve may be 
represented by its first derivative representing growth speed, 
its second derivative representing growth acceleration, and 
so on, in addition to the overall score of that individual. 
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Again, the derivatives function as extra traits. Similarly, 
pattern interpretation may be represented by introducing 
extra predictors, in this case, moderator or interaction terms 
formed by multiplication of predictors. Thoroughbred trait 
psychologists would argue that growth and pattern scores 
cannot be expected to have incremental validity, but that 
is not an objection of principle. What this brief analysis 
shows is that the two paradigms are not ideologically incom- 
patible but appear to consist of different generalized expecta- 
tions regarding the relevance of growth and moderator terms. 

A final wording of the moderator issue is whether single 
predictors may receive different weights according to the indi- 
vidual in question; thus, whether Mary’s emotional stability 
may be less relevant in predicting her performance as a pursuit 
plane pilot than is John’s. Again, there is no a priori reason 
why the weights should be uniform. A technical problem is that 
the Pearson correlation is undefined in the single case; how- 
ever, raw-score association coefficients like Gower’s (1971) 
and Zegers and Ten Berge’s (1985) can do the job. Their appli- 
cation to the single case also gives a precise expression to 
the otherwise elusive idea of intra-individual trait structure. 
The Gower coefficient for the general case is the mean of the 
single-case coefficients; it thus writes interindividual structure 
as the mean of intra-individual structures, thereby joining two 
paradigms of personality that are usually brought in opposition 
to each other. This integration is still another reason for taking 
raw scores seriously. An empirical problem, however, is that 
individual weights may be extremely unstable. However, the 
same holds for intra-individual structure. 


Ranges of Application 


After digesting a number of red herrings, what remains is a 
matter of conventional preference. The trait psychologist rep- 
resents the person as a vector of scores on a continuous scale, 
whereas the typologist would prefer a single qualification on a 
binary (applicable vs. not applicable) scale. Taking a sophisti- 
cated trait model incorporating growth and moderator effects, 
the person-centered approach is a special or degenerate case 
of it, and can therefore not be psychometrically superior in 
any respect. To justify the type approach, a different perspec- 
tive should be adopted. To that end, I distinguish between a 
context of prediction and a context of communication. 

Given the same basic materials, there can be no reasonable 
doubt that the trait approach is superior in a predictive con- 
text. On the one hand, typing consists of discarding informa- 
tion that is potentially valid. On the other, it introduces 
dynamic predictor terms whose empirical status is highly 
dubious; therefore, even an orthodox trait approach may be 
expected to do better upon cross validation. 
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Ironically, the 5-D approach meets with ambivalence from 
the side of its very proponents in predictive respects. McCrae 
and Costa (1992) and Jang and others (1998) have empha- 
sized the incremental validity of the 30 subscales of the 
NEO-PI-R (Costa & McCrae, 1992) over its five factor 
scales, thereby implicitly questioning the 5-D model as an 
adequate representation of personality. The psychometric 
value of such arguments, however, is quite limited. Principle 
component analysis capitalizes on the common variance in 
the predictor set; successive residuals follow the law of di- 
minishing returns. So does validity, unless in some magical 
and unintended way specific variance would be more valid 
than common variance. 

The value of the type approach is to be found at a differ- 
ent, pragmatic level, at which personality is a subject of com- 
munication between a diagnostician and a therapist (in the 
wide sense of someone who is going to work with the indi- 
vidual, possibly the individual him- or herself). Human dis- 
course and cognition being what they are, it makes little sense 
in that context to exchange vectors of continuous scores. Pro- 
fessional communication is better served by an attempt to 
capture the essence of the individual’s personality in a vivid 
and suggestive picture. To insist on using a trait paradigm in 
this context is to ignore the human element at the receiving 
end of a communication. 

In the end, the two sets of operations appear to refer to dif- 
ferent conceptions of personality-in-context rather than 
personality-in-vitro. The trait approach is geared toward au- 
tomated predictive procedures in which substantive consider- 
ations do not even surface. The type approach caters to 
human receivers of personality information. Which of the 
two scripts is appropriate in a particular case is difficult to say 
in abstract terms. A personnel selection situation, for exam- 
ple, may be conceived in predictive as well as in communica- 
tive terms; the same goes for a clinical intake situation. The 
emphasis here is on distinguishing the scripts: Predicting on 
the basis of types and communicating in terms of traits are 
both arguably deficient. 


HIERARCHICAL AND 
CIRCUMPLEX STRUCTURES 


In a hierarchical model, trait concepts are seen as specifica- 
tions of broader traits, which in turn may be grouped under 
the heading of supertraits. In a circumplex model, trait vari- 
ables appear as combinations of each other; they form a 
network in which all concepts define each other in a recursive 
manner, without subordination or superordination. In mixed 
models, all variables and factors are equal, but some are more 


equal than others because they explain more variance or are 
assigned privileged status for conventional reasons. 

This section contains an evaluation of trait taxonomies 
that have been proposed or implied, and it works its way to- 
ward a family model that may be acceptable by way of inte- 
gration. However, it should be kept in mind that taxonomies 
are subject to contradictory demands, namely, conceptual and 
communicative simplicity on the one hand, and adequate 
coverage of empirical reality on the other. 


The Principal Component Analysis Plus 
Varimax Taxonomic Model 


In its elementary form, the Big Five structure consists of a 
varimax rotation of the first five principal components taken 
from a large heterogeneous set of trait adjectives (see, e.g., 
Ostendorf, 1990). Whether this result is intended as a model 
in any proper sense is irrelevant, as it evidently functions like 
one: People receive scores on the Big Five, and these scores 
are interpreted as their personality structure—specifically, an 
orthogonal structure according to which these factors vary 
independently over persons. 

Goldberg (1993a) articulated that the model in question 
may be viewed as hierarchical: Items specify scales, and 
scales specify factors. This argument presupposes simple 
structure, but that condition is not fulfilled. A concomitant 
and very widespread notion is that the Big Five are “broad” 
(in the sense of fuzzy) factors of personality. 


The Implicit Assumption of Simple Structure 


Simple structure, in which each variable loads on only one 
factor and factors exhaust the common variance would be 
hierarchical indeed: Each variable would be a specification of 
only that factor; a particular factor could legitimately and 
meaningfully be interpreted in terms of the variables that load 
on it. The interpretation would not surreptitiously introduce 
other variance common to some subset of the variables in 
question. 

In empirical practice, however, variable structures are so 
overwhelmingly complex—as opposed to simple—that the 
hierarchical model functions as an obstacle to proper concep- 
tualization: The practice of interpreting factors on the basis of 
their highest loading items, which would be appropriate 
under simple structure, is quite erroneous if the condition is 
not fulfilled. For to the extent that some of the highest load- 
ing items share other common variance, factor interpretations 
become contaminated. For example, an extraversion factor 
easily receives a social interpretation (sociability, social ex- 
traversion, and the like; for an overview, see Digman, 1990) 


because many high-loading items have positive secondary 
loadings on agreeableness. 


The Alleged Broadness of Factors 


Under conditions of actual simple structure, factors could be 
called broad in a hierarchical sense, as they capture the com- 
mon variance of a number of variables. Even then, factors are 
not broad in a conceptual sense but rather more narrow than 
variables, as their internal consistency is higher and their 
angular position in the trait space is thus more fixed. A g fac- 
tor of intelligence, for example, is not a broadband but a high- 
fidelity measure of some latent trait. A fortiori, there is nothing 
broad about a Big Five factor based on a particular domain of 
trait variables. For lack of actual simple structure, it does not 
encompass a sizable number of lower level items or scales. 
The meaning of a factor, even if latent, is much more precise 
in a psychometric sense than is the meaning of the variables 
on which it is based. In that domain of variables, a set of five 
rotated principal components covers more variance than does 
any other set of five linear combinations, but “broadness” is an 
inappropriate and misleading term for that. 

In another terminology, to view the Big Five as broad fac- 
tors is to treat them as a circumplex structure. In a regular 
two-dimensional circumplex, the plane is sliced into a num- 
ber of angular segments (e.g., 12 segments). Variables within 
a segment form a homogeneous set. A special case is simple 
structure, in which “mixed” segments are empty, as in 
Figure 10.1, panel A. The actual situation, however, is closer 
to panel B, amounting to a circumplex with four segments, of 
which two are well filled. These segments contain very het- 
erogeneous sets of variables; two of those variables may even 
be orthogonal to each other. The very specific meaning of the 
factor is thus not adequately captured by the broad array of 
variables that have their primary loading on it. 
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Marker Variables 


The interpretation problem would be solved if stable marker 
variables could be found, that is, trait terms that load exclu- 
sively on one factor. Goldberg (1992a) presented such sets of 
psychometric synonyms, for example, extraverted, talkative, 
assertive, verbal, bold, and five other terms for the positive 
pole of the extraversion factor. A minor problem with this 
interpretation strategy is that markers for some factor poles 
are difficult to find, for example, markers for emotional sta- 
bility. A major problem is that marker sets appear to be no 
longer orthogonal in fresh samples or upon translation. Any 
two homogeneous sets of traits may be expected to correlate 
positively if both are desirable or if both are undesirable, 
negatively if they are opposite in that respect; neutral sets 
hardly exist. Orthogonal sets may be selected in a sample, but 
they will regress to obliqueness upon cross validation. On the 
basis of a large-scale study, Saucier (2002b) has developed 
marker scales that appear to be robustly orthogonal within his 
several data sets and might thus defy the present analysis. 
Still, one would have to wait and see how they do in another 
laboratory, for example, when transported abroad. 

The obliqueness problem (see, e.g., Block, 1995) cannot 
be answered by the truism that varimax-rotated factors are by 
definition orthogonal. The missed point is that they have no 
interpretation—not because they are broad or fuzzy, but be- 
cause any interpretation in terms of sets of variables is biased. 
To interpret a Big Five factor properly, one would have to 
perform and communicate a suppression operation, such as 
the following: Factor I is what remains of extraversion after 
suppressing any connotation of agreeableness or socialness 
that may be associated with it, however firmly; Factor V is a 
residue of intellect or openness to experience after subtract- 
ing a virtually indissoluble tinge of energy, which rather 
belongs to Factor I. That is a bit much to ask. 


(B) 


Figure 10.1 Prototypical simple-structure (A) and semicircumplex (B) configurations. 
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In conclusion, the PCA plus varimax set of operations 
leads to an inadequate representation of personality. The 
argument is not that traits are correlated, in any metaphysical 
sense: For purely predictive purposes, linear regression of 
criteria on orthogonal factors is a perfectly defensible ap- 
proach. What was stressed is the conceptual risk of starting to 
talk in Big Five terms, either among experts or with others. 
Conceivably, we could keep our mouths shut, but in practice 
that is too high a price to pay. 

The PCA plus varimax model has been imported into per- 
sonality from the domain of intelligence research. The ques- 
tion arises whether it is appropriate in that domain. I 
(Hofstee, 1994c) have argued that it is not. The empirical 
structure of intelligence variables is an n-dimensional sim- 
plex (the all-positive quadrant of an n-dimensional sphere) 
characterized by positive manifold and lack of simple struc- 
ture. Treating it as an orthogonal simple structure gives rise to 
biased conceptualizations of the underlying dimensions and 
inadequate representation of the domain. Essentially the 
same objection holds for the domain of personality. 


The Double Cone Model 


A seminal attempt at a specific structure model of personality 
in the 5-D framework is Peabody and Goldberg’s (1989) dou- 
ble cone, based on Peabody’s (1984; see also De Boeck, 
1978) work on separating descriptive and evaluative aspects 
of trait terms. It focuses on the first three Factors; the smaller 
factors IV, emotional stability, and V, intellect, are treated 
as separate axes orthogonal to the sphere that is formed by 
the bigger three: I Extraversion, II Agreeableness, and IH 
Conscientiousness. 

The double cone model may be envisaged as follows: 
Take a globe with desirability as its north-south axis, so that 
all desirable traits are on the northern hemisphere and 
their undesirables opposites are on the southern hemisphere 
in the antipode positions. Apply an orthogonal rotation to the 
Factors I, II, and III such that their angular distances to 
the desirability axis become equal, namely, 54.7 deg with 
cosine 1/3. Draw a parallel of latitude at 35.3 deg (close to 
Kyoto and Oklahoma City) through the positive endpoints of 
the Factors I, II, and III, and another one (close to Sydney and 
Montevideo) through the negative endpoints. Connect each 
possible pair of antipode points on the two circles by a vector. 
Together, these vectors form the double cone. The model rep- 
resents empirical trait variables by their projection on the 
closest model vector. 

The double cone was designed to embody a particular tax- 
onomic principle, informally referred to by insiders as the 


Peabody plot and named chiasmic structure by Hofstee and 
Arends (1994). A classical example of a chiasm is 


Thrifty 
Stingy 


Generous 


Extravagant 


In Peabody’s reasoning, this configuration arises by pitting a 
content contrast (i.e., not spending vs. spending) against a so- 
cial desirability contrast (thrifty and generous vs. stingy and 
extravagant). In the double cone model, chiasmic structure 
recurs in the shape of Xs that are formed by vertical slicings 
through the center of the double cone. On the Northern circle, 
we would have thrifty and generous at opposite longitudes; 
on the southern hemisphere, stingy and extravagant. More 
generally, descriptive and evaluative aspects are represented 
by longitude and latitude, respectively. 


Evaluation of the Double Cone Model 


The model is readily generalized to five dimensions, although 
it loses some of its aesthetic appeal in the process: Take all 10 
subsets of 3 out of the 5 factors, that is, the I X II x III, I x 
II x IV, through II X IV X V subsets, and treat each of these 
spheres in the manner just sketched. The generalized double 
cone thus consists of a Gordian knot of 10 three-dimensional 
double cones in the 5-D space sharing their vertical (desir- 
ability) axis, or 10 pairs of latitude circles. There is no valid 
reason why the range of the chiasmic structural principle 
should be restricted to a particular subset of three dimensions. 
But the model easily passes the generalizability test. 

It is not entirely clear whether the algorithm for analyzing 
the data as used by Peabody and Goldberg (1989) is consistent 
with the model. Via Peabody (1984), the reader is referred 
to an algorithm proposed by De Boeck (1978). De Boeck’s 
procedure, however, sets the I, II, and III dimensions orthog- 
onal to the desirability axis, rather than at 54.7 deg. Still, it 
is certainly possible to design an algorithm that would be 
consistent with the double-cone model. 

The next question, however, concerns fit. That may be 
tested by assessing the quality of chiasms that are generated 
by the model. Hofstee and Arends (1994, Table 1) present 
chiasms derived from Peabody and Goldberg’s (1989) mate- 
rials. An example is 


Forceful Peaceful 


Quarrelsome Submissive 


The content contrasts in this and in other examples are not 
convincing. The reasons are not hard to find. First and most 
important, the cone structure supposes an angular distance of 
only 109.5 deg between terms that should form a content 


contrast, like forceful-peaceful and quarrelsome-submissive. 
Second, 5-D factors have different angles with the desirabil- 
ity vertical axis: II+, agreeableness, for example, is much 
further north than is I+, extraversion. When these angular 
distances are forced to be equal, as in the model, content con- 
trasts become contaminated by a desirability contrast. In the 
example, peaceful is more desirable than forceful; therefore, 
to the extent that they are at all judged opposite, that is partly 
an artifact of a desirability difference. 

It is fair to conclude that the double cone does not model 
the underlying principle of chiasmic structure in an optimal 
way. One could refine the model, but there is no need to do 
so: Hofstee and Arends (1994) showed that the Abridged Big 
Five circumplex (ABSC; see Hofstee, De Raad, & Goldberg, 
1992) model to be discussed later can account for chiasmic 
structure, and generates credible chiasms: 


Daring Cautious 


Reckless Timid 


In two experiments, participants judged content contrasts 
taken from ABSC chiasms to be superior over double cone 
contrasts. This is not to say that chiasmic structure exists: 
Hofstee and Arends reiterate a point already taken by 
Peabody (1967) himself, namely, that desirability and content 
cannot be separated. So the best one can do is create a chias- 
mic illusion, as in the previous example. The algorithm goes 
as follows: Take a particular circumplex; draw a diameter 
separating desirable from undesirable traits; select two traits 
on different sides of the diameter but close to it and to each 
other, for example, cautious (slightly desirable) and timid 
(slightly undesirable); together with their opposites, they cre- 
ate the chiasmic illusion. It arises because in this case the 
alleged content contrast is formed by two terms with an an- 
gular distance that is only slightly less than 180 deg, instead 
of 109.5 deg as according to the double cone model. 


Do Chiasms Have a Future? 


The double cone model was shown to be generalizable; it 
may be possible to design a refined version by widening the 
angle between content opposites, amounting to oblique 
rotation. The more basic questions that remain, are What is 
the taxonomic status of the underlying principle of chiasmic 
structure? and What does it do to our conception of per- 
sonality? 

Whatever the refined model would be, it would focus on 
traits that are close to the equator of a hypersphere whose 
vertical axis is desirability: The model would focus on fairly 
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neutral traits. They form a small minority, so the focus 
would be on a counterrepresentative subset of personality 
variables. On the one hand, there is something venerable (to 
use Saucier’s, 1994, term) to such a value-free approach; 
personality psychologists, like everybody else, would prefer 
practicing a discipline that is not submerged in extrascien- 
tific values. On the other, desirability is not fruitfully consid- 
ered as a mere response set or other artifact that is to be 
separated from content: Hofstee and Arends (1994) empha- 
sized that even in the classical example of chiasmic structure 
cited earlier, stinginess is not merely undesirable thrift, but 
an asocial version of it, whereas generousness differs from 
extravagance in being prosocial; therefore, the evaluation 
contrast is in fact one of content, as in the AB5C model. So 
the most realistic conclusion is that chiasmic structure and 
related models cannot be central to the concept of personal- 
ity, even though they may have their place in specific 
contexts (see Saucier, 1994; Saucier, Ostendorf, & Peabody, 
2001). 

Central features of the double cone model, however, ap- 
pear to be valuable by themselves. One is the “circular pat- 
tern” (Peabody & Goldberg, 1989, p. 556), as opposed to 
simple structure, that is embodied in the model. Another is 
orienting the trait space toward desirability as its reference 
axis. These points are taken up later when developing an 
integrative family of structure models. 


Generalized Circumplexes 


In circumplex models, traits are assigned to segments of a 
circle and are thus represented by their projection on the 
bisectrix of that segment. Circumplexes picture tissues or 
networks of traits: Contrary to hierarchies, circumplexes 
have no superordinate and subordinate concepts. Eysenck 
and Rachman (1965), for example, represented Hippocrates’ 
melancholic, choleric, sanguinic, and phlegmatic types as 
mixtures of the positive and negative poles of neuroticism 
and extraversion; presumably, however, Hippocrates would 
have preferred a rotation by which an extravert is a mixture of 
the choleric and sanguinic types, neuroticism is what melan- 
cholics and cholerics have in common, and so on. Circles 
enjoy full freedom of rotation. 

Circles generalize to spheres, and spheres generalize to 
hyperspheres—particularly, in this context, to the 5-D hyper- 
sphere. An early example of a 3-D structure is Heymans’s 
(1929) temperament cube. Not until the end of the twentieth 
century, however, did 5-D researchers (Hofstee et al., 1992; 
Saucier, 1992) construct circumplexes of more than two 
dimensions. 
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Heymans’s Cube 


Heymans (1929) constructed a network model with three 
dimensions—emotionality, primary versus secondary function 
(comparable to extraversion-introversion), and activity— 
forming the axes of a cube. Types are located at each of 
the eight vertices of the cube, among which are the four 
Hippocratic types; for example, the sanguinic type is at the 
vertex where low emotionality, primary function, and high 
activity meet. 

Heymans tended to conceive the temperament space as 
unipolar: The type characterized by the absence of emotion- 
ality, activity, and secondary function is named amorphous. 
One amendment therefore is to move the origin of the trait 
space to the center of the cube. Next, it is difficult to conceive 
of activity and primary function as orthogonal; different 
dimensions (and types) would be chosen in a contemporary 
three-dimensional model. Finally, one would prefer rounding 
the cube to a sphere. On the one hand, it is thus gratifying to 
note that time has not stood still, and that Heymans’s cube is 
now obsolete by reasonable standards. On the other, it is 
equally gratifying to recognize Heymans’s model as a fore- 
runner of the generalized circumplexes that did not appear 
until the end of the twentieth century. 


Saucier’s Rhombicuboctahedron 


Saucier (1992) presented an integration of interpersonal and 
mood circumplexes and the Big Five Factors I, II, and IV. He 
drew attention to the fact that simple structure does not mate- 
rialize in these domains; many variables are interstitial in that 
they are closer to the bisectrix of the angle between two fac- 
tors than to the factors themselves. When simple structure is 
nonetheless imposed, interstitial variables are likely to be 
assigned to different factors by different investigators, even 
though the positions of variables and factors are closely 
comparable. Saucier constructed 6 bipolar scales as bench- 
marks for the interstitial positions, in addition to the 3 bipolar 
factor markers: a I+II+ versus I-II— scale (friendly vs. 
unfriendly), a I+II— versus I-II+ scale (dominant vs. sub- 
missive), and so on. He depicted the resulting trait struc- 
ture as a rhombicuboctahedron, a prism showing the 18 
Oe, 2 X [3 + 6]) unipolar benchmarks as facets. 

Saucier’s model may be alternatively conceived as an 
abridged three-dimensional circumplex, depicted by three or- 
thogonal circles based on two of the three factors at a time. 
Each circle contains two bisectrices of the angles between the 
factor axes; in the model, a variable is represented by its pro- 
jection on the vector (out of 9 bipolar or 18 unipolar vectors) 


to which it is closest. This representation has the advantage 
that it is easily carried to the fifth dimension (discussed later). 
Saucier showed that the I X II X IV sphere was the most in- 
terstitially structured of all 10 spheres that are contained in the 
5-D hypersphere; that difference, however, is quite relative in 
view of the many mixtures involving Factors III or V. 

Like Wiggins’s (1980) two-dimensional interpersonal cir- 
cumplex, Saucier’s model uses octants, which are 45 deg 
wide, corresponding to a correlation of .707. Therefore, the 
variables assigned to such a segment may still form a fairly 
heterogeneous set. Hofstee et al. (1992) distinguished traits 
that had their primary loading on one factor and their sec- 
ondary loading on another (e.g., I+II+; sociable, social) and 
traits with a reverse pattern (II+I+; merry, cheerful). This 
strategy amounts to slicing up a circle into 12 clock segments 
of 30 deg, corresponding to a correlation of .866. A reason for 
making these finer distinctions is that 30 deg is about the 
angular distance at which vectors are still given the same sub- 
stantive interpretation (Haven & Ten Berge, 1977). If this 
amendment is worked into Saucier’s model, it becomes 
identical to a three-dimensional version of the abridged 
circumplex. 


The Abridged Big Five Circumplex Model 


The AB5C model consists of the 10 circumplex planes that 
are based on 2 of the 5 factors at a time. Thus, variables are 
represented by their projections on the closest plane or, more 
precisely, on the closest of the 6 bipolar clock vectors 
(running from 12 o’clock to 6 o’clock, 1 to 7, and so on) in 
that plane. The hypersphere contains large empty spaces be- 
tween the model planes, so it may look as if the abridgement 
is rather drastic. However, varimax rotation puts the variables 
as close to the planes as possible; Hofstee et al. (1992) 
showed that it does a better job at this than at maximizing 
simple structure, which is putting the variables as close to 
the single factors as possible. Thus, representing traits by 
their two highest loadings seems acceptable; a model includ- 
ing tertiary loadings is entirely conceivable, but it would be 
much more complex and add very little. 

More aptly than by a spatial configuration, the ABSC 
tissue is depicted by a table using the 10 factor poles (I+, I—, 
I+, II-, and so on) as both warp and weft, the column de- 
noting the primary loading, and the row, the secondary load- 
ing of the traits assigned to a cell. Of the 100 cells in that table, 
the 10 combinations of the positive and negative poles of the 
same factor are void; the remaining 90 contain the unipolar 
facets generated by the model. The gain over the simple- 
structure model is enormous. That model accommodates only 


relatively pure factor markers, that is, traits assigned to the 10 
diagonal (I+I+ to V—V-—) cells of the table. If simple struc- 
ture would in fact materialize, most if not all of the variables 
would be found in those cells. If, on the other hand, the em- 
pirical structure is essentially circumplex, only 11.1% of 
the variables would find their way to the diagonal cells. In 
Hendriks’s (1997) analysis of 914 items, 105 (11.5%) ended 
up in those cells. That illustration is as dramatic as is the per- 
centage of variables that would have to be discarded in a 
proper application of the simple-structure model. 

In the discussion of the person-centered approach, I intro- 
duced a distinction between the contexts of prediction and 
communication. Against that background, it should be noted 
that the predictive gain of the off-diagonal ABSC facets over 
the five principal components is nil, as the facets are linear 
combinations of the components. However, they do serve 
conceptual, interpretive, and communicative purposes. An 
individual’s profile of scores on the FFPI, for example, may 
be typified by that person’s single most characteristic facet; 
thus, for example, a person whose highest score is on factor 
V+ and whose second highest score is on III+ may be char- 
acterized by the cluster of expressions and adjectives that 
form the V+III+ facet (Knows what he/she is talking about, 
Uses his/her brains, Sees through problems, and the many 
other items listed by Hendriks, 1997, for this “Tight Intelli- 
gence” facet). One or more of these catch phrases should be 
more effective than presenting a 5-D profile or even the 
subset based on the scores in question (“This person is pri- 
marily someone who Thinks quickly [V+], and secondarily 
someone who Does things according to plan [III+]”). 
Furthermore, at the theoretical level the AB5C model ac- 
counts for a large number of concepts that do not coincide 
with the five Factors but are quite adequately reconstructed as 
their mixtures. 

Another way to document the flexibility of the AB5C 
design is in noting that it incorporates features of both 
oblique rotation and cluster analysis on an orthogonal basis. 
Oblique rotation as such does not solve the simple-structure 
problem when the configuration of variables is essentially 
circumplex. However, the insertion of oblique model vectors 
enables one to capture relatively homogeneous clusters of 
traits. That function is also served by cluster analysis proce- 
dures, but they lose sight of the dimensional fabric of the 
structure and the recursive definitions of clusters. 

With respect to predictive purposes, the loss incurred by 
adopting the AB5C model is quite limited. First, the princi- 
pal components base maximizes the internal consistency of 
the facets (Ten Berge & Hofstee, 1999), which should be 
beneficial to their validity. Second, if factors beyond the 
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Big Five are needed to increase validity, the model is easily 
extended to include those factors. That would be more 
efficient than including separate scales for each additional 
specific concept. 


Undoing Hierarchies 


The traditional design of questionnaires is hierarchical: Items 
are grouped into subscales, subscales into scales. From the 
manuals of such questionnaires (see, e.g., Costa & McCrae, 
1992) it is easily verified that subscales actually form a 
network; they have substantial secondary correlations with 
scales other than the one they are assigned to. Upon analyzing 
the single items of a questionnaire, a similar tissue pattern 
would arise; items would appear to have all sorts of promiscu- 
ous relationships, inviting circumplex analysis of the data. 

Generalized (beyond two dimensions) circumplex analy- 
sis would proceed as follows: First, the item scores are 
subjected to PCA. The maximum number of principal com- 
ponents would be the number of subscales or facets (e.g., 30 
in the case of the NEO-PI-R). Note that these 30 principal 
components extract more variance by definition than tradi- 
tional scoring does. (In practice, it would soon become 
apparent that only a part of these principal components 
should be retained because the redundancies in the item 
tissue are captured by fewer components than the number of 
subscales.) 

At the scale level, the optimal strategy is to proceed from 
the first m principal components, m being the number of 
superordinate scales (e.g., 5), as they make more efficient use 
of the data than do traditional scale scores. If, for reasons of 
continuity, the original interpretations of the scales are to be 
simulated, target rotations of the m principal components to- 
ward these scales could be carried out. If the original scales 
are conceived to be orthogonal, as in 5-D questionnaires, the 
optimal approximation procedure would be a simultaneous 
orthogonal target rotation of the m principal components to- 
wards the set of m scales. That procedure conserves internal 
consistency (Ten Berge & Hofstee, 1999); consequently, the 
average coefficient alpha of the rotated principal components 
is maximal. Most notably, it is automatically higher than the 
average alpha of the original scales. 

Subscales of traditional questionnaires are very short; 
therefore, they are unreliable or consist of asking essentially 
the same question over and again, which is annoying to re- 
spondents and introduces unintended specific variance. If 
they are to be retained, their quality can be improved to a con- 
siderable extent by estimating subscale scores on the basis of 
(maximally) as many principal components as are postulated 
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by the questionnaire model (e.g., 30). The procedure would 
consist of a target rotation of these principal components to- 
ward the subscale in question. Thus, using the collateral 
information contained in related items would generally in- 
crease the subscale’s internal consistency, even though the 
contribution of “bloated-specific” variance to it would be 
diluted, which in itself would be a desirable side effect. 

The previous script, however, amounts to proving that the 
performance of a hierarchically conceived questionnaire can 
be boosted by placing a network model under its hood. Fol- 
lowing the script would sooner or later lead to adopting the 
generalized circumplex approach, which implies a view of 
personality structure as a tissue rather than a hierarchy. At the 
superordinate scale level, one would hit upon a large number 
of interesting AB5C facets, which are linear combinations of 
the first m principal components; at the subscale level, a great 
amount of redundancy would be found, leading to a drastic 
reduction of the conceptual rank of the data matrix. The 
future of personality structure is hyperspheric. 


Pruning the ABSC Model 


The hypersphere of personality is riddled with gaps. Upon 
presenting the AB5C model, Hofstee et al. (1992) already 
noticed that its circumplexes were not evenly filled: The 1 
o’clock versus 7 o’clock and 2 versus 8 segments attracted far 
more trait terms than did the 4 versus 10 and 5 versus 11 seg- 
ments. The former segments contain consonant mixtures of 
either two positives factor poles (e.g., I+ II+, sociable) or two 
negative poles (eg, I-II—, unsociable), whereas the latter 
contain discords combining a positive and a negative pole 
(e.g., I-II+, submissive, vs. I+II-, dominant). The scarcity 
of discords reappears in Hendriks’s (1997, p. 45) results: Of 
the off-diagonal items, only 23% combine a positive and a 
negative factor pole, whereas the number of consonant and 
discordant cells is equal. Furthermore, discords tend to have 
the smaller projections in the 5-space, as may be verified from 
the cited studies. These results reflect the classical (e.g., 
Cruse, 1965) finding that neutral trait terms are scarce. The 
lexical axiom would imply that people find the corresponding 
behaviors relatively unimportant. 

The rationale for introducing blends, in circumplex mod- 
els in general and in the AB5C model in particular, is commu- 
nicative. In the case of discords, the communicative benefits 
are unlikely to materialize. Rigid, for example, has a projec- 
tion of .29 on the I—II + vector in the AB5C model (Hofstee 
et al., 1992, p. 157); unkind and orderly have projections of 
.52 on H— and .67 on III+, respectively. Thus, the projection 
of the weighted sum of unkind and orderly on the H— IM+ 
vector would be about twice as high as the projection of rigid 


itself. “John is primarily unkind and secondarily orderly” may 
thus be expected to communicate better than “John is rigid.” 
Therefore, the discordant hyperquadrants may be deleted 
from the AB5C model. It would thereby become semicircum- 
plex: Of each circumplex, only the first and third quadrants 
would be retained. Clinicians, who tend to be sensitive to am- 
bivalences of personality, might deplore that loss. However, 
the removal might well clarify intraprofessional communica- 
tion, not to speak of communication with lay clients. 

Extending this analysis would lead to a proposal for a 
somewhat different rotational positioning of the 5-D axes in 
order to maximize the coverage of consonant variables. In 
their present definition, some factors (notably, II and III) are 
associated more closely with desirability than are others 
(notably, I and IV). Thus, the vector in the I+ by II+ 
quadrant upon which the projections of the Desirability val- 
ues of the traits would be maximal is closer to the II axis than 
to the I axis. This asymmetry is illustrated by the fact that an 
undesirable trait like unrestrained (at 2:30 on the clock) has a 
distance of only 30 deg from that bisectrix (which is at 1:30), 
whereas agreeable (at about 11:20) is more than 60 deg re- 
moved from it. A counterclockwise rotation of the two factors 
would recognize unrestrained as a discord and agreeable as a 
consonant trait, which seems appropriate. 

Applying this operation to all axes jointly amounts to a ro- 
tation to desirable manifold, mirrored by undesirable mani- 
fold. The resulting abridged Big Five semicircumplex 
(AB5SSC) model is thus contained by the 10 faces of the 
hyperquadrant centered around the desirability axis and their 
10 opposites. Each face is divided into three segments of 
30 deg, placing the model vectors at clock positions of 12:30 
versus 6:30, 1:30 versus 7:30, and 2:30 versus 8:30. (A more 
elegant representation involving a 45-deg counterclockwise 
rotation is presented later.) As no vectors recur in other semi- 
circumplexes, there are now 30 bipolar model vectors. 

In the following section the AB5SC model returns as a 
member of a family of models. That family comes about by a 
somewhat different rationale. The number five is no longer 
fixed; the emphasis shifts from 5-D models to accounts of 
trait structure that incorporate a number of principles that 
contribute to an efficient description of personality. Also, the 
factors as such disappear into the background, which is 
where they should have been from the start. 


A FAMILY MODEL OF TRAIT STRUCTURE 


What remained of hierarchical structure is the fact that each 
subsequent principal component explains less variance and is 
subordinate to its predecessors in that respect. The head of 


the trait pedigree is the first principal component, named the 
p factor of personality by Hofstee et al. (1998), in analogy to 
the g factor of intelligence, and in distinction to Eysenck’s 
(1992) psychoticism or P factor, which is intended as a lower 
level construct. A family of models may be constructed by 
adding one principal component at a time. Thus at the second 
level we have a circumplex or semi-circumplex; at the third a 
three-dimensional generalized one, and so on. 


The Primordial One 


In search of superlatives over the Big Five and the giant three 
(Eysenck, 1992), primordial appears as a good label for the p 
factor. That factor derives a mythical quality from its close 
association with desirability. It presents a fundamental para- 
dox to students of personality, whose ultimate challenge is to 
manage the potent values that nourish its roots: Not until we 
are capable of giving an overall evaluation of an individual’s 
personality in a perfectly respectful manner will we have 
mastered that challenge. 

In principle, there is nothing broad or vague about the p 
factor. Quite to the contrary, it is by definition the most inter- 
nally consistent linear combination of all traits, explaining 
some 10% to 15% of the total variance in unselected item sets 
(see, e.g., Brokken, 1978; Hendriks, 1997; Ostendorf, 1990), 
just like the g factor does in the intelligence domain. In other 
words, no scale based on any subset of the items, however 
optimally weighted, is as internally consistent as p. Its 
location in the personality sphere is almost completely fixed 
in any large data set. 

Fixing the interpretation of p across studies is another 
matter. In Hendriks’s (1997) unselected set of 914 items, p is 
best labeled as competence (Hofstee, 2001). In Saucier’s 
(2002a) study of representative sets of trait adjectives, it ap- 
pears as a character factor taking in altruism, self-discipline, 
and success. The first principal component of the FFPI, 
whose 100 items were selected to cover the five factors 
equally, is an optimism factor (Hofstee et al., 1998). In view 
of the psychometric accuracy and the statistical reliability 
that was attained in these large-scale studies, the differences 
in interpretation cannot be attributed to chance. Differences 
in composition of the item pools must be responsible. 

Saucier (2002a) interprets his first principal component as 
SD, for socially desirable qualities. On the one hand, this in- 
terpretation cannot be far off because the first principal com- 
ponent in any mixed set of positive and negative trait 
descriptors will be close to the desirability axis. On the other, 
it masks the fact that the first principal component bends to- 
ward whatever content is best represented in the item pool. 
For an extreme example, if that pool were overloaded with 
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fairly neutral extraversion and introversion items, the first 
principal component would appear as extraverted desirabil- 
ity. Therefore, active steps have to be taken to justify the 
desirability interpretation. 

I propose to define the p factor of personality as the indi- 
viduals’ Desirability. The most obvious operational definition 
of that variable consists of obtaining a score by weighting the 
items proportional to their desirability values. Both these val- 
ues and the item scores are best expressed as positive and 
negative deviations from the neutral midpoint of the scale. In 
the absence of desirability values, the first principal compo- 
nent of a heterogeneous and representative set of traits will 
closely approximate the desirability variable. The desirability 
score reflects the extent to which an individual is assessed to 
have desirable versus undesirable qualities. The result will be 
that most people are found to be desirable, although some are 
more desirable than others. A few people would be assessed 
to be undesirable. 

The implied conception of personality is literally perpen- 
dicular to the neutral view according to which, for example, 
there are no right or wrong answers to the items of a ques- 
tionnaire, and by which all people are equally desirable, just 
different. One could of course use the desirability variable 
just to partial it out, and retain a value-free, neutrally descrip- 
tive account of personality, as in Saucier’s (1994) model. 
Here, on the contrary, it functions as the pivot around which 
the personality hypersphere revolves. The present approach 
is comparable to emphasizing the g factor of intelligence, 
rather than its multidimensional conception according to 
which people are just differently intelligent (even though no 
one, to my knowledge, has gone as far as to partial out g). 
There can hardly be any doubt that capitalizing on p provides 
the most realistic account of personality. 

In the present context, the social part of social desirability 
is terminologically dubious. It could be used in opposition to 
personal desirability, but then the proper specification would 
be intersubjective versus subjective. In its actual use, SD 
refers primarily to impression management in self-report. 
This socially desirable responding (SDR) may be an interest- 
ing topic of study in its own right, but it is not at issue here. 
People have desirable and undesirable traits; they show over- 
all differences in the extent to which that is the case; there is 
substantial agreement among third persons, and even be- 
tween self and other, about someone’s desirability score; its 
heredity coefficient is undoubtedly in the same order of mag- 
nitude as with other traits, as it is a linear combination of 
them. Socially desirable responding is orthogonal to these 
individual differences: In a Persons X Assessors X Situations 
design with Desirability as the dependent variable, SDR is 
an Assessors main effect (e.g., a self-assessment may be 
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relatively socially desirable), and/or a Situations effect (e.g., 
a personnel selection context gives rise to elevated scores), 
and/or some interaction effect, but not a Persons or individual 
differences effect. The p component concerns the latter. 

Carrying out the slight rotation, if needed, to align the first 
principal component in any particular data set with the desir- 
ability variable should prove helpful in solving the vexing 
problem of indeterminate rotational positions of components. 
Saucier (2002a) has already documented that varimax rota- 
tion does not help in this respect: Across data sets, the posi- 
tions of unrotated principal components were at least as 
replicable as were varimaxed components. Among the princi- 
pal components, the first is by far the most replicable one. 
Across differently composed sets of variables, however, part 
of this stability gets lost (as discussed earlier). Anchoring p at 
the desirability values, which are external to the studies, 
should enhance replicability. 

The p-oriented model produces another taxonomic lever, 
namely, a measure of the representativeness of a set of per- 
sonality traits or items, in the shape of the correlation be- 
tween the first principal component of the set and the 
desirability variable. In a set overloaded with fairly neutral 
extraversion-introversion items attracting the first principal 
component, that correlation would be clearly below unity. In 
a heterogeneous set of neutral items, the desirability variable 
would be unstable, again lowering the correlation. In the 
spirit of the lexical axiom, such sets would be judged 
insufficiently representative. The proposed measure simu- 
lates that judgment. 


The Two-Dimensional Level 


Upon extracting p, a residual remains in the shape of a matrix 
of part scores. The first principal component of that residual 
matrix comes close to the second principal component of the 
original scores, at least in a representative set of variables. 
Taking p as the ordinate, a 45-deg counterclockwise rotation 
of the two components including p will produce an X struc- 
ture, or a flat version of the double cone. The upper and lower 
segments contain the most unambiguously positive and neg- 
ative, or consonant, traits; the left and right segments contain 
the most relatively neutral and discordant traits. The abridged 
semicircumplex structure at this level contains two bipolar 
facet vectors running from 11 o’clock to 5 o’clock and from 
1 to 7 in addition to the 12 to 6 p vector; the relatively neutral 
traits are left unaccounted for by the model, as their projec- 
tions on the vectors will be very low. 

Substantively, the plane would resemble, but not be iden- 
tical to, the interpersonal circumplex (Wiggins, 1980), the 
I XII or Agreeableness X Extraversion slice of the 5-D 


structure, Digman’s (1997) a X ß plane, and the like. In a 
perfectly representative set of traits as defined earlier, the 
model plane would be identical to the plane formed by the 
first two (rotated) principal components; this property makes 
it a good candidate for a canonical or reference structure. Its 
suitability for that purpose is enhanced by the absence of 
rotational freedom at this level: The positions of the model 
vectors are indirectly prescribed by the desirability values of 
the traits. Theoretical criteria, as in the interpersonal circum- 
plex, or the simple-structure criterion as in 5-D models, are 
insufficiently capable of serving that reference function. 

In the rationale of the semicircumplex model, the transi- 
tion from one p dimension to two dimensions means a 
spreading of the desirability component, in the manner of the 
unfolding of a fan. The primordial one becomes diluted in the 
process, like the g factor of intelligence does when it is 
spread over two or more dimensions. Following elementary 
rules of parsimony, the transition should not be made lightly; 
the burden of proof is on those who take the step. Psycho- 
metrics offers an adequate procedure for this proof: More- 
dimensional assessments of personality should be shown to 
have sufficient incremental validity over the p component. 
This requirement implies that an assessment of p, as a base- 
line variable, would have to be part of any empirical study of 
personality. 

Incremental validity of variables other than p would nec- 
essarily imply that variance orthogonal to it, thus neutral 
variance, is valid. This implication bridges the present fam- 
ily of models and those that capitalize on neutral variance, 
like Peabody’s (1984) and Saucier’s (1994; Saucier et al., 
2001). In fact, the latter model is the complement of the 
Semi-Circumplex, at the present and subsequent dimen- 
sional levels; it fills in what the present model leaves empty. 
Although the basic assumption—potential incremental valid- 
ity of neutral variance—is thus necessarily the same, a 
strategic difference remains at the executive level. In the 
semicircumplex approach, neutral variance is assessed indi- 
rectly, by suppressing the p variance from consonant traits 
rather than directly, as in Saucier (1994). The reason was 
given earlier: Discordant personality concepts are difficult to 
handle. 


Semicircumplex Spheres and Hyperspheres 


The three-dimensional member of the model family arises 
as follows: Add the second principal component of the matrix 
of residual scores (after removing p); retain the vertical ori- 
entation so that a globe is formed with the positive traits on 
the northern hemisphere and the negative traits on the south- 
ern one; perform an orthogonal rotation of the three axes 


including p so that they become equidistant (the angles being 
54.7 deg, with cosine „/1/3; further constraints are discussed 
later) from the vertical axis. All this is in correspondence with 
the double cone model. Now form three slices (circumplexes) 
by taking two rotated axes at a time. The projection of p on 
these tilted planes has the 12 o’clock to 6 o’clock direction, 
and the 3 o’clock and 9 o’clock positions are on the equator. 
Additional model vectors are constructed running from 11 
to 5 and from 1 to 6, as in the two-dimensional member of the 
model family. 

The central positions in this structure are taken by the 12 
to 6 vectors—to be labeled I/II, VIII, and IVIII—that are the 
bisectrices of the right angle between the two rotated princi- 
pal components forming the circumplex. The I, II, and III 
axes themselves merely guard the boundaries of the model 
structure; as such, they have no place or name in the model. 
The central model vectors appear to be close to p, namely, at 
a distance of 35.3 deg (with cosine .816 or ./2/3; generally, 
./2/n, where n is the number of dimensions). Note that this 
oblique structure arises as a side effect of an orthogonal rota- 
tion, not through some more liberal oblique rotation proce- 
dure as such. The central model vectors are thus much more 
saturated with desirability than are the factors themselves; at 
all dimensional levels of the model, they share exactly J/2 as 
much variance with p as do the orthogonal factors. 

What is new about this structure is that mixtures or blends 
of factors have stolen the central place that used to belong to 
the factors. Instead of being derivatives, the bisectrices of the 
factor pairs have become the central concepts. This play of 
musical chairs comes about because of the closer association 
of the central vectors with p, which entitles them to their po- 
sition. In passing, the model resolves the uneasiness of 
inserting orthogonal axes into an essentially oblique struc- 
ture; it rigorously defines oblique axes without giving up the 
convenience of an orthogonal base. The only price is that the 
number of musical chairs has to be increased, from four di- 
mensions onward: There are n(n — 1)/2 central vectors, with 
n the number of dimensions or factors. However, that exten- 
sion will be welcomed by those who have always wondered 
whether five is all there is. The model has shaken off the last 
remnants of simple-structure thinking. Parenthetically, I note 
that the model is equally appropriate in other domains, 
notably, intelligence. 

With four dimensions, the rotated factors are at an angle of 
60 deg with respect to the p factor; the central model vectors 
are at 45 deg from that pivot. With five dimensions, the fac- 
tors are at 63.4 deg, and the central vectors are at 50.7 deg. 
Still, the model rotation maximizes the sum of the correla- 
tions of the central axes with p, and in that sense minimizes 
their average neutralness. Conversely, any other orthogonal 
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rotation of these dimensions (e.g., varimax) is inferior in this 
respect: It takes in more neutral traits, which are less repre- 
sentative of the domain. 

With three or more dimensions, the model leaves freedom 
of spin. A three-dimensional structure, for example, may be 
rotated around its vertical p-axis without violating the model. 
For reasons of continuity, this freedom may be used for max- 
imizing the correspondence of the rotated factors with the 
current varimax factors, particularly, the 5-D model factors. 
This amounts to some lowering of the positive poles of the 
current dimensions I and III toward the hyperequator, and 
some lifting of the others. One may speculate, for example, 
that the American lexical extraversion factor loses its aggres- 
sive connotation and moves in the direction of sociability. 
However, it is difficult to gauge what the substantive effects 
of the joint rotation will be on all the versions in all the dif- 
ferent languages (see, e.g., Saucier et al., 2000) that have 
been proposed. The labels of the 5-D model are probably 
used in a manner vague enough to permit this twisting. 
(Agreeableness and conscientiousness, in particular, do not 
even fit their present axes; see Hofstee et al., 1992.) 

From the three-dimensional level on, there is some redun- 
dancy between model vectors at different levels. At the top 
level, there is the one vector. At the second level, two addi- 
tional bipolar vectors appear, which satisfy the requirement 
of being 30 deg removed from p. At the third, we find three 
semicircumplexes with three model vectors each; at the 
fourth, there are 6 X 3 at the fifth, the ABSSC model with 
30 vectors appears; in general, at the nth level from 3 on, 
there are 1.5n(n — 1) vectors specific to that level. In succes- 
sively adding levels, the cumulative number of model vectors 
thus becomes 1, 3, 12, 30, and 60. From the third level on, it 
appears impossible to rotate the central vectors in such a way 
that all the additional vectors stay at least 30 deg away from 
the ones at the second level. Thus some vectors would have 
indistinguishable interpretations. 

One strategy would be to settle for a particular dimension- 
ality of the trait space. That would prevent overlap and would 
simplify things in general. The foremost drawback is that 
from three dimensions onward the most central trait concepts 
would be missed. Furthermore, that strategy would only stir 
up the debate on the dimensionality of the trait space, to 
which there is no cogent solution; it would thus frustrate the 
attainment of a canonical structure rather than contribute to 
it. The other, preferable, strategy is to adopt the model family 
as a whole, including as many (or as few) levels as will ap- 
pear to be needed, and deleting concepts at lower levels that 
are virtual clones of those at higher levels. The foreseeable 
result of this strategy is maximal convergence of structures at 
each level, and maximal efficiency in communicating about 
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personality up to a particular level. In this manner, the family 
model may become a model family. 


CONCLUSION 


This probe into the credentials and future of the 5-D approach 
to personality ends in cautious optimism. Because of its re- 
liance on the questionnaire method, the 5-D paradigm stays 
at the phenotypic level; however, an efficient and coherent 
description of personality is indispensable also for research 
on genotypic and other determinants of individual differ- 
ences. The exploitation of the lexical axiom, with its rich his- 
tory dating back into the nineteenth century, in combination 
with PCA of large data sets that became feasible only in the 
last decades of the twentieth century, has provided a firm base 
for efficiently and coherently describing personality differ- 
ences. Of the several and diverging taxonomic models that 
have arisen in the 5-D tradition, I used elements to design the 
contours of an integrative structure that may serve scientific 
and applied communication. 

In the process, the penetrating evaluative aspect of per- 
sonality description exerted its influence. Its pervasiveness 
constitutes a fundamental and often frustrating problem 
to the field. I have chosen to adopt the strategy recommended 
to bridge players who find themselves in a squeeze: Relax 
and enjoy it. Desirability cannot be circumvented or sup- 
pressed without sacrificing the first principal component of 
personality description. So it might as well be squarely faced 
and put in the most central position. Giving in to the desir- 
ability component of personality will in all likelihood be 
rewarded with a more coherent, stable, and internationally 
replicable conception of personality structure. 
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Sociality is a hallmark of human functioning. Indeed, the 
survival and success of our evolutionary ancestors depended 
on their ability to form coordinated bands of interdependent 
actors (e.g., Leakey, 1978). The benefits of group living al- 
lowed a band to succeed where an individual might fail (e.g., 
Axelrod & Hamilton, 1981). Although our species has come 
a long way from the harsh and precarious conditions present 
during early hominid evolution, human beings continue to be 
utterly dependent on one another for their survival and well- 
being. It is therefore quite reasonable to assume that human 
cognitive and motivational tendencies were shaped by the 
demands of group living (e.g., Brewer, 1997; Seyfarth & 
Cheney, 1994). Some have claimed that our capacities for 
reasoning and our other higher mental functions may owe 
their very existence to the constraints imposed by sociality on 
human survival and reproductive success (Byrne, 2000). Our 
most fundamental concerns depend crucially on our ability to 
understand the characteristics, motivations, and intentions of 
others; according to Cummins (1998, p. 37), “the evolution of 
mind emerges as a strategic arms race in which the weaponry 
is ever-increasing mental capacity to represent and manipu- 
late internal representations of the minds of others.” This ca- 
pacity to understand the minds of others is so central to 
successful human functioning that when it is compromised, 
the consequences are often devastating (e.g., Baron-Cohen, 
1995). How the mind understands the social world within 
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which it functions is therefore a matter of central importance 
in psychology. It is this question that is at the center of theory 
and research on social cognition. 

Social cognition refers to the cognitive structures and 
processes that shape our understanding of social situations 
and that mediate our behavioral reactions to them. At its core, 
the fundamental assumption of social cognition research is 
the idea that internal mental representations of other persons 
and of social situations play a key causal role in shaping be- 
havior. The central task of social cognition research is thus to 
provide a specification of the nature of these mental struc- 
tures and the processes that operate on them. A simple, 
generic depiction of the theoretical space within which social 
cognition researchers work is provided in Figure 11.1. Stated 
at the most general level, a social cognition analysis incorpo- 
rates a consideration of (a) the informational cues that are 
currently experienced in the social environment; (b) mental 
representations that are constructed on the basis of current or 
previous experience; (c) the ways these representations are 
manipulated and the processes through which they influence 
other aspects of attention and cognition; and (d) the deci- 
sions, judgments, intentions, and behaviors that result from 
the application of these processes. The distinction between 
representation and process is more a matter of convenience 
than it is a reflection of a clear theoretical dissociation be- 
tween considerations of mental structure and mental process. 
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experienced = judgments, 
environmental intentions, 
input behavior 
Representations Processes 
of past or current involved in constructing 
experience and using representations 


schemas, stereotypes, selective attention, 


attitudes, exemplars, interpretation, 
prototypes, scripts, organization & storage, 
PDP networks, retrieval, 
narratives, etc. inference, etc. 


Figure 11.1 A schematic overview of the core assumptions of the social 
cognition perspective. 


In fact, as we shall see, many social-cognitive theories con- 
sist of propositions that link representational assumptions 
with particular processing tendencies that are assumed to be 
inherent within the representational format. 

In taking seriously the role of mental events in mediating 
social behavior, social cognition theorists part company from 
the radical behaviorists, who view the mind as a black box 
having little if any theoretical relevance to an understanding 
of the factors controlling and directing behavior. However, 
the form of mentalism embodied in contemporary social cog- 
nition research also parts company from the early structural- 
ists, who took the data provided by introspection to be the 
primary phenomena of psychological inquiry. Indeed, one of 
the cornerstones of social cognition is the recognition that the 
mind may be largely unaware of what it is doing; quite com- 
monly, social perceivers may have very little introspective 
access to the cognitive processes that give rise to their behav- 
ioral reactions (Nisbett & Wilson, 1977). These metatheoret- 
ical commitments create some methodological challenges for 
social cognition researchers of social cognition. On one hand, 
it is assumed that mental events have central, causal impor- 
tance in shaping social behavior. One the other hand, it is also 
assumed that people may not be able to provide accurate self- 
reports concerning the nature of these mental events. As a 
result, social-cognitive researchers have devoted consider- 
able effort to the development and adaptation of methodolo- 
gies for studying mental processes that do not rely upon 
introspection. Before commencing with our survey of social- 
cognitive theory and research, we begin with a brief consid- 
eration of the methodological underpinnings of this work. 

The obvious difficulties of explicitly studying mental 
events without falling prey to the potential biases and limita- 
tions of self-report measures have led to innovations in both the 
measurement and manipulation of social-cognitive processes. 
Two broad classes of process measures that do not rely on 


introspection have been developed. The first class consists of 
chronometric techniques that measure the speed with which 
a task can be performed (for a review, see Fazio, 1990). Build- 
ing on classic chronometric methods for analyzing mental 
processes (e.g., Donders, 1868; Sperling, 1960; Sternberg, 
1966), these techniques bring the workings of the mind into the 
scientific sphere by focusing on a directly observable property 
of mental events (i.e., their duration). Through carefully con- 
structed experimental situations, it becomes possible to use 
participants’ response times to derive inferences about a num- 
ber of theoretically important issues, such as determining the 
nature of mental associations (e.g., Bargh & Chartrand, 2000) 
and identifying the subsystems or component stages of a more 
general process (e.g., Lingle & Ostrom, 1979). The second 
class of process measures consists of techniques focusing on 
memory performance (for a review, see Srull, 1984). Through 
the study of aspects of performance such as omissions, intru- 
sions, and the serial ordering of freely recalled material, or the 
error rates observed in recognition memory, inferences can be 
drawn concerning both mental structure and process (e.g., 
Jacoby, 1998; Srull, 1981). Techniques such as these do not 

require any insight on the part of participants into the workings 
of their own minds; moreover, they are unlikely to be influ- 
enced by concerns about social desirability that can often 
contaminate self-report data. 

Another important methodological approach has involved 
the development of experimental manipulations that are de- 
signed either to activate or to interfere with hypothesized men- 
tal structures, processes, or both (for a review, see Bargh & 
Chartrand, 2000). For example, priming techniques can be 
used to study nonconscious biases in social perception (e.g., 
Bargh & Pietromonaco, 1982; Devine, 1989). In one version 
of this kind of research, general concepts (such as Blacks) are 
activated outside of perceivers’ conscious awareness, and the 
consequences for social perception and memory are exam- 
ined. If subsequent impressions of an ambiguous social target 
are more in line with the subliminally activated concept (e.g., 
more stereotypical of African Americans), then one can con- 
clude that stereotypical associations can be activated and ap- 
plied in a manner that is automatic and unintentional. Along 
similar lines, the imposition of secondary tasks can be used to 
study the efficiency or the resource dependency of the mental 
processes mediating social responses (e.g., Gilbert, Pelham, & 
Krull, 1988; Macrae, Milne, & Bodenhausen, 1994). Rela- 
tively automatic mental processes occur efficiently (i.e., they 
do not require much in the way of attentional resources for 
their successful deployment) and hence will not be disrupted 
by the imposition of a secondary task. This very brief method- 
ological sampler is merely meant to offer a taste of the general 
spirit within which social cognition research is conducted. The 


creativity with which researchers have gone about mapping 
the workings of the social mind testifies to the possibility of 
approaching the subject with a respectable measure of scien- 
tific rigor and objectivity, unhampered by the limitations of 
introspective methods. 

The study of the social mind inevitably proceeds from a 
set of (often implicit) assumptions about its fundamental 
character. The major theoretical precursors of contemporary 
social cognition research lie in the seminal research on social 
perception and attribution conducted by such pioneers as Asch 
(1946) and Heider (1958). Embodied within these historical 
approaches is a view of the human mind as largely rational and 
even—in its own naive way—scientific. Attributional models 
such as Heider’s were grounded in the assumption that per- 
ceivers seek out cues pertaining to issues such as the control- 
lability, foreseeability, or desirability of others’ behavior; 
perceivers then use these cues to logically derive assumptions 
about their mental states and about the reasons for their ob- 
served behavior. Classic models of impression formation 
(e.g., Anderson, 1965) assumed that social perceivers ascer- 
tain the likelihood that various characteristics or traits apply to 
a given target, and they then assess the favorableness of these 
traits, combining them into a composite impression in a man- 
ner dictated by familiar expectancy-value models of human 
judgment. Contemporary social-cognitive research calls this 
optimistic view of humans as rational actors into question and 
suggests a set of alternative metaphors. We mention several of 
these newer metaphors here, as a way to anticipate many of the 
major themes of the rest of this review. 


e Humans as automatons. Whereas classic social- 
psychological theories emphasized the role of rational 
analysis and active reasoning in guiding human behavior, 
much of contemporary social cognition research has em- 
phasized the role of automatic and implicit processes in 
shaping social conduct. This work certainly casts into 
doubt the assumption of pervasive rationality, and it sug- 
gests that in many (if not most) circumstances, we may be 
the slaves of mental processes that occur outside the realm 
of our ratiocinations. 


e Humans as motivated tacticians. Even when engaging in 
active thought, there is ample reason to believe that people 
seek out and use mental shortcuts rather than engage in 
a thorough and systematic analysis of relevant data. 
Because of the inherent limitations of our attentional 
capacity (Miller, 1956) and epistemic motivation (Simon, 
1967), humans are likely to be quite strategic in allocat- 
ing their mental resources to the tasks confronting them 
(Fiske & Taylor, 1991). A major theme of the research we 
review in this chapter concerns the specification of the 
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conditions under which social cognition will be likely to 
be relatively more analytical versus superficial. 


e Humans as intuitive lawyers. Whereas an intuitive scien- 
tist would be expected to be a truth-seeker, objectively 
seeking and using data concerning the state of the social 
environment, an abundant research literature shows that 
social cognition actually is subject to a wide range of 
powerful motivational biases. Rather than seeking to 
know the world as it is, we often see the world in the way 
we want it to be (e.g., Kunda, 1990). Much as a lawyer 
manipulates the available facts in a manner that is most 
flattering to a preferred conclusion, social perceivers also 
often show a rather shameless partiality in their dealings 
with the evidence relevant to their judgments, impres- 
sions, and choices. 


e Humans as affect-driven agents. The historical metaphor 
of the rational actor leaves relatively little room for the 
world of emotions, moods, and other feeling-states that 
form the real-life context of all social thought and action. 
In recent years, the importance of affective states in influ- 
encing social cognition and social behavior has been 
undeniably established (e.g., Forgas, 2001). It has thus 
become clear that affect is of integral importance in shap- 
ing the character of social cognition. In the remainder of 
this chapter, we trace the developments that have led re- 
searchers toward new conceptions of the social mind. 


MENTAL REPRESENTATION: STRUCTURE 
AND PROCESS 


With the advent of powerful technologies for studying the 
functioning of the brain in vivo, there have been many impor- 
tant advances in our understanding of the neural basis of in- 
formation processing (e.g., Gazzaniga, 2000). Nevertheless, 
there continues to be a sizable gap between our understanding 
of the low-level functioning of the central nervous system and 
the development of a satisfying theoretical account for the 
higher-order mental phenomena that are the focus of social 
cognition research. To fill the gap, theorists have hypothesized 
the existence of mental structures such as schemas and asso- 
ciative networks that can provide a relatively parsimonious 
account of how information is organized and used to meet the 
demands of a complex social world. These hypothetical repre- 
sentational constructs are best thought of as metaphors that 
capture theoretically or empirically important properties of 
social information processing. Although in a literal sense the 
nervous system may not contain schemas or other sorts of hy- 
pothesized mental structures, such constructs can be scientifi- 
cally useful to the extent that they capture some important 
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essence of whatever structures or processes actually do exist 
within the neural architecture of the brain. Because re- 
searchers have been able to account for a range of empirical 
phenomena by reference to these kinds of mental structures— 
and, indeed, have used their understanding of such structures 
to generate novel empirical predictions about the phenomena 
of social cognition—their hypothetical status has not been a 
source of great tribulation. Nevertheless, as Smith (1998) has 
pointed out, the metaphors that researchers use to understand 
mental representation can have the undesirable side effect of 
blinding them to important—even fundamental—properties 
of the how the mind works. For this reason, it is important to 
be cognizant of the background assumptions that underlie any 
particular representational model and to reevaluate these as- 
sumptions periodically. 

Before considering the most prominent models of mental 
representation (and many of their built-in assumptions), we 
can begin by summarizing some of the common ground that is 
shared by different theoretical approaches. First, all of the var- 
ious theoretical approaches are in agreement that our subjec- 
tive understanding of the social world consists of some sort 
of organized representations, and that these representations, 
whatever their nature, are definitely not merely veridical or ob- 
jective renderings of reality. These representations are filtered 
through the lens of each individual perceiver’s personality, 
motivations, knowledge, and attitudes. As such, mental rep- 
resentations are both more and less than a photographic record 
of the social world. They are less than a photographic record 
because they may fail to incorporate many aspects of the expe- 
rienced world. Some features of the informational environ- 
ment are selected for attention and subsequent processing, and 
these features are likely to be incorporated into mental repre- 
sentations of the relevant persons and events. However, many 
other features are neglected and will consequently fail to be in- 
cluded. On the other hand, the representations that are formed 
are often more than a photographic record: They may go be- 
yond the available data and incorporate aspects that were 
never directly experienced—that is, perceivers may generate 
inferences about otherwise unspecified characteristics of so- 
cial targets and then incorporate these inferences within their 
mental representations; indeed, they may subsequently be un- 
able to distinguish between actual and inferred features. These 
features of mental representation make it clear why it has as- 
sumed the central role in social cognition research: It is impos- 
sible to know what the person’s mental representation will 
consist of simply by examining the stimulus input. After a rep- 
resentation has been formed, it (and not the source information 
from which it was originally derived) will be the crucial deter- 
minant of observed reactions (e.g., Lingle & Ostrom, 1979; 
Srull & Wyer, 1983). 


A second universal assumption regarding mental repre- 
sentation is the notion that new representations are inevitably 
formed by referencing memory for relevant prior experiences 
and knowledge. It is quite obvious that we would be hope- 
lessly disoriented and quickly incapacitated if we had to treat 
every stimulus that we encounter as a novel phenomenon 
about which no preconceptions are available. Instead, we 
rely on our memories to determine such crucially important 
matters as how to interpret the meaning of different objects 
and events and how to allocate our attention to different 
aspects of the social environment. As such, our experience 
of the present is always inexorably linked to past experi- 
ences, as they are represented in memory. Clearly, what- 
ever theoretical choices one might make in accounting for the 
nature of mental representation, an understanding of the na- 
ture and determinants of social memory will be absolutely 
central to any complete account of the dynamics of social 
cognition. 

In an especially comprehensive and insightful review ofthe 
models of mental representation that have been employed by 
social psychologists, Smith (1998) identified four major 
classes of hypothesized representational mechanisms: asso- 
ciative networks, schemas, exemplars, and distributed (PDP) 
models. In the next sections, wereview each ofthese major ap- 
proaches, highlighting their key assumptions and document- 
ing the major phenomena that each approach has succeeded in 
illuminating. 


Associative Network Models 


The intellectual roots of associative network models lie in 
British empiricism (especially Locke and Hume), with its 
emphasis on the learning of simple associations between 
sensations as the foundation from which all mental capacities 
are presumed to arise. The associative network approach 
assumes that mental representations consist of nodes of 
information that are linked together in meaningful ways (e.g., 
Wyer & Carlston, 1994). For example, a mental representa- 
tion of a person named George could consist of various 
concepts that are associated with him, such as personality 
traits, occupational roles, physical appearance, and so on. 
Each attribute would constitute one node, and each node 
would be connected to a central organizing node via links. 
The strength of these links is hypothesized to vary; if certain 
attributes were especially strongly associated with George, 
for example, then the links connecting these attributes to the 
central one would be especially strong ones. The structural 
assumptions of this approach could thus hardly be simpler: 
Representations consist simply of nodes that are intercon- 
nected via links that vary in strength. 


The central process that is assumed to operate on this type of 
representational structure is the spreading of activation. Each 
of the nodes in a network can vary in its degree of activation. 
When activation levels are minimal, the information contained 
in anode is essentially dormant in long-term memory, exercis- 
ing no influence over the ongoing course of social cognition. 
However, when the level of activation rises above a critical 
threshold, the information contained in the node is assumed to 
enter working memory and to begin to influence ongoing cog- 
nition. For example, if our hypothetical friend George were 
suddenly encountered on the street, the George node in long- 
term memory would be activated and thereby brought into 
working memory. It is important that the activation that is in- 
fused into the central George node is assumed to spread along 
the available links to connected concepts, with more activation 
flowing along the stronger links. Whenever this activation is 
sufficiently high, the connected links will also enter working 
memory. Although there are variants on these assumptions, this 
brief summary provides a reasonable description of the core 
ideas of the associative network models. 

The assumptions of the associative network models have 
been used to illuminate a wide variety of social-cognitive 
phenomena. To provide a representative sample, in this chap- 
ter we focus on three domains in which such models have been 
influential: attitudes, stereotypes, and memory for expectancy- 
relevant material. Fazio (1986) proposed a model of attitude 
structure that follows from the principles of the associative net- 
work models. In his view, an attitude consists of a simple asso- 
ciative structure: a node representing the attitude object, an 
evaluative node, and a link connecting the two nodes. Of criti- 
cal importance is the strength of the connecting link. For strong 
attitudes, the link between the two nodes will be very strong, 
and any time the node representing the attitude object gets ac- 
tivated, the activation will be likely to spread to the evaluative 
node, thereby activating the associated attitude. Weak atti- 
tudes, however, will tend not to be automatically activated in 
this way, because the link connecting the attitude object to the 
evaluation is not likely to conduct enough activation to the 
evaluation node when the attitude object node gets activated. 
On the basis of this set of assumptions, Fazio was able to con- 
struct a compelling model of the determinants of attitude- 
behavior consistency. When attitudes are highly accessible 
(i.e., when the link between the attitude object and the evalua- 
tive node is strong), encountering the attitude object is likely to 
be sufficient to activate the attitude. After it is brought into 
working memory, the activated attitude can influence the on- 
going stream of information processing by biasing the process 
of interpreting the subjective meaning and perceived behav- 
ioral affordances of the immediate situation. But none of this 
will happen if the attitude is not sufficiently accessible. 
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Some prominent models of stereotyping also assume the 
operation of an associative network structure (e.g., Devine, 
1989; Dovidio, Evans, & Tyler, 1986). From this perspec- 
tive, stereotypes consist of a central node representing a 
particular social group (e.g., elderly people) that is linked to 
various concepts that are assumed to characterize group 
members (e.g., slow, forgetful). When a member of the rele- 
vant category is encountered, activation can spread along the 
links from the central identity node to the associated stereo- 
typical concepts. After these concepts enter working memory, 
they can influence subsequent impressions and reactions. One 
especially influential example is a study by Devine (1989). 
In her experiment, concepts that are part of the African 
American stereotype were activated (via a subliminal prim- 
ing procedure). It is important that none of these concepts 
dealt with the concept of hostility. However, because hostility 
is assumed to be part of the cultural stereotype of African 
Americans, Devine assumed that activating other parts of the 
stereotype would also result in the activation of the concept of 
hostility, through the spread of activation. In line with this 
assumption, it was found that priming the African American 
stereotype resulted in elevated perceptions of hostility on the 
part of a subsequently encountered, ambiguous target. This 
finding fits with the assumption that after a sufficient level of 
activation reached the hostility node (by traversing the links 
connecting it to the rest of the associative network), this 
concept entered working memory and influenced subsequent 
impressions. Devine argued that these associations are part 
of culturally ingrained belief systems, and even when people 
do not consciously endorse the relevant belief (e.g., even 
when they do not believe that African Americans are hostile), 
they are still prone to being influenced by the culturally 
learned association. 

Perhaps the most extensive development of associative 
network models by social cognition researchers has occurred 
as part of efforts to understand the impact of expectancies on 
social memory (e.g., Hastie, 1980; Srull, 1981; Wyer & Srull, 
1989). Researchers working in this tradition have attempted 
to specify the factors that determine both the strength and the 
types of links that form among activated pieces of infor- 
mation, and they have also developed models addressing how 
these associative structures are used in the process of memory 
retrieval. In the typical experiment, participants learn some 
initial facts about a particular target that establish a general 
expectancy about him or her (e.g., Tina is smart, intellectual, 
well-educated, etc.). After an expectancy has been induced, 
participants then read more detailed descriptions of the tar- 
get’s behavior. These descriptions contain three classes of be- 
haviors: those that are consistent, inconsistent, and irrelevant 
to the general expectancy about the target. When a consistent 
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behavior is encountered (e.g., won the citywide chess tourna- 
ment), itis assumed to be linked directly to the central concept 
(Tina) by a relatively strong node, because it fits with precon- 
ceptions about this target quite well. However, when an in- 
consistent behavior is encountered (e.g., got confused trying 
to figure out the subway system), it is assumed to be linked to 
the central concept in a more tenuous way because it does not 
really fit with the general image of the target. However, the in- 
congruity embodied in the inconsistent behavior is assumed to 
provoke efforts to resolve the confusion by thinking about 
how the inconsistent behavior might make sense in light of 
other known facts. This triggers the formation of inter-item 
associations among the different behavior nodes. Thus, al- 
though inconsistent behaviors are likely to be less strongly 
linked to the central person concept than are consistent behav- 
iors, the inconsistent behaviors are actually more likely to be 
linked to a variety of other behavior nodes. As a result, the in- 
consistent behaviors tend to be more memorable on average 
because they tend to have more associative links with other 
items, producing a greater number of pathways through which 
activation can spread into them and draw them into working 
memory. 

We have presented only the most general statement of how 
associative network models have been applied in the domain 
of social memory. Specific theoretical approaches have been 
much more elaborate in their assumptions—although they 
still share the key core assumptions that we have outlined. 
This general approach has been used to predict a wide and 
impressive array of empirical phenomena concerning social 
memory, including the serial order of information retrieval in 
free recall (e.g., Srull, 1981) and the influence of different 
processing goals and levels of attentional capacity on the 
probability of recalling inconsistent versus consistent infor- 
mation (e.g., Srull, Lichtenstein, & Rothbart, 1983; for a 
comprehensive review, see Wyer & Srull, 1989). Research in 
these and several other topic areas confirm the explanatory 
power of the relatively simple assumptions embodied in the 
associative network approach. 


Schemas 


Arather different view of the nature of human understanding 
emerged in Continental philosophy (particularly the ideas of 
Kant). From this perspective, simple associations are inade- 
quate to account for the complexity of human cognition. 
Instead, it is assumed that knowledge is organized into more 
elaborately structured conceptual representations. This ap- 
proach to mental representation is epitomized in schema 
theories. 

Originally introduced prominently into psychology by 
Bartlett (1932), schema theories focus on the role played 


by generic knowledge structures that organize a person’s un- 
derstanding of a particular domain. A schema can be thought 
of as a subjective theory (Markus & Zajonc, 1985) that is for- 
mulated to account for the generalities of one’s experience. 
The elements of the schema are typically thought to be orga- 
nized by more than simple association. For example, spatial, 
temporal, logical, and causal relations constrain and provide 
coherence to the schematic structure. To take a simple exam- 
ple, a face schema consists not only of a set of elements that 
are associated with faces (e.g., eyes, nose, mouth), but also of 
rules about the spatial relations among these elements. This 
general understanding of what faces are like is assumed to 
have been abstracted from experience with numerous specific 
faces over time. In addition to this inductive pathway to 
schema formation, it is often assumed that schemas can be 
learned in a more top-down manner. For example, most 
schoolchildren could, one hopes, articulate a rather detailed 
mammal schema, although they have most likely not induced 
its elements by observing particular instances. Instead, they 
have learned directly what the core elements of the schema 
are and how these elements are related to one another. 

The elements contained in a schema often function like 
variables that can take a variety of values, provided that they 
adhere to the fundamental constraints of the schema. For ex- 
ample, there is a range of acceptable colors and shapes that 
eyes can take, but they must invariably be located above the 
nose, contain a pupil and an iris, and so on. This observation 
points to the fundamental function of schemas: They serve as 
templates for understanding experience by providing preor- 
ganized, general-purpose understandings that can be adapted 
to the particulars of the current situation via instantiation. It 
is assumed that schemas will be activated spontaneously in 
situations in which they are relevant, and that this activation 
occurs in an all-or-none fashion. Thus, unlike the associative 
network models (in which some nodes in a network can be 
active while others are not), schema models assume that if 
any part of the schema has been activated, then the rest of the 
schema will also be activated. 

Schemas are thought to fulfill a variety of functions (for 
a review, see Bodenhausen, 1992). Most notably, they provide 
a basis for making inferences about unspecified elements of a 
stimulus or situation, and they can guide the interpretation of 
ambiguous features as well. Activated schemas also tend to 
guide the processes of perception and memory toward infor- 
mation that is relevant to the particular schema. One famous 
demonstration of the operation of schemas was provided by 
Bransford and Franks (1971), who showed that memory for 
ambiguous verbal stimuli (e.g., the notes were sour because 
the seam was split) was substantially enhanced when a relevant 
schema was activated that would allow for the disambiguation 
of the sentence (in this example, bagpipe). As Bartlett (1932) 


emphasized in his seminal writings, schemas also serve an 
important function in facilitating the reconstruction of the past. 
Schematic inferences undoubtedly do contribute to our memo- 
ries for past experiences in important ways. 

In many situations, competing schemas may be potentially 
applicable, and the understanding one gains of the situation 
may be substantially altered depending upon which schema 
is activated to parse the situation. Consider the famous case 
of Kitty Genovese, a New York resident who was brutally 
murdered in 1964. After observing Ms. Genovese being 
chased, screaming, by an unknown man, many witnesses 
failed to activate and apply the correct schema (i.e., homici- 
dal maniac pursuing victim) and instead applied a quite 
mistaken one (e.g., teenagers engaging in horseplay). The 
failure of other bystanders to take action only served to 
underscore the plausibility of the erroneous interpretation. 
Clearly, the meaning of observed behavior can take on a very 
different meaning—and obliges very different behavioral 
reactions—depending upon which schema is invoked. Re- 
search by Shotland and Straw (1976) subsequently showed 
that when people observe an ambiguous situation in which a 
man is harassing a woman on the street, they often assume by 
default that it is a lover’s quarrel and fail to take any steps to 
help the woman. Only when this schema was rendered inap- 
plicable (by the woman’s exclaiming, “I don’t know you!”) 
did people perceive the situation as one in which they should 
intervene. Research such as this underscores the importance 
of understanding the conditions under which particular 
schemas will be applied. 

Sometimes a relevant schema is activated because it fits 
the current situation unambiguously. But when there is any 
ambiguity and competing schemas can each afford some de- 
gree of fit to the situation, then the schema that is applied is 
likely to be the one that most accessible (Bruner, 1957). Ac- 
cessibility, in turn, is a function of relevance of the contend- 
ing schemas to the perceiver’s chronically and momentarily 
active goals, as well as the recency and frequency with which 
each of the competing schemas has been used. As such, 
schemas that are goal-relevant or that have been recently or 
frequently used will be much more likely to be applied. 
Dodge (1993) has shown, for example, that some boys have a 
chronically accessible schema for parsing social interactions, 
in which they assume that the behavior of others toward them 
is motivated by hostile intentions and disrespect. When 
confronted with ambiguous behavior, they consistently as- 
sume the worst. These schema-based impressions then lead 
to hostile reactions. Perhaps unsurprisingly, these same boys 
have a tendency to show poor social adjustment and are at 
higher risk for delinquency. In addition to dispositional 
biases in the accessibility of schemas, situational factors can 
prompt certain schemas to become more accessible. The 
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expansive literature on priming effects is built on the realiza- 
tion that schemas that have been activated in unrelated con- 
texts may continue to exert an influence on social cognition 
because their previous use has rendered them momentarily 
accessible (e.g., Higgins, 1996). 

Schema theory has been applied in a wide variety of topi- 
cal domains. One domain in which schematic models have 
been especially influential is gender. Bem (1981) proposed a 
gender schema theory, which asserts that cultural conven- 
tions regarding gender become a sort of lens through which 
perceptions of others are filtered. Bem (1993, p. 154) ex- 
plains that the gender-schematic person “has a readiness to 
superimpose gender-based classification on every heteroge- 
neous collection of human possibilities that presents itself.” 
In one of the most well-known studies of this phenomenon, 
Bem (1981) first identified individuals who were or were not 
gender schematic (i.e., based on their sex-role attitudes, they 
either did or did not appear to possess an internalized schema 
for gender appropriateness that was consistent with prevail- 
ing cultural conventions). Then she presented them with lists 
of concepts (animals, verbs, clothing) to learn, in a randomly 
mixed order. An important aspect of this study was that some 
of the concepts were pretested as being conventionally mas- 
culine (e.g., gorilla, hurling, trousers), some were conven- 
tionally feminine (e.g., butterfly, blushing, bikini), and some 
had no gender connotations (e.g., ant, stepping, sweater). The 
order in which these concepts were recalled in a memory task 
revealed that gender-schematic individuals were far more 
likely than were aschematic persons to cluster the concepts 
together in terms of their gender connotations, consistent 
with the idea that a gender schema guided the way the infor- 
mation was interpreted and organized in the minds of the 
gender-schematic participants. 

Gender is but one of many domains in which the schema 
construct has been invoked to account for the regularities 
of social cognition. Person schemas, event schemas, self 
schemas, role schemas, and many others have been proposed 
(for a review, see Fiske & Taylor, 1991). The appeal of 
schema theory as opposed to associative network models of 
mental representation appears to lie in the recognition that the 
stimuli of the social world are often quite complex, and the 
assumptions of structured organization contained within 
schema models seems more appropriate for capturing this 
complexity, compared to the comparatively simple structural 
assumptions underlying network models. Moreover, the em- 
phasis of schema approaches on processes of selective atten- 
tion and organization of social information has an undeniable 
resonance with many phenomena of long-standing interest to 
social cognition researchers. Nevertheless, schematic models 
have been criticized as being too loose and theoretically un- 
derspecified (e.g., Alba & Hasher, 1983; Fiske & Linville, 
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1980). In addition, newer approaches to mental representa- 
tion have been proposed that can account for many if not 
all of the same phenomena covered by schema theory, but 
with a much greater degree of theoretical specificity. We turn 
now to one of these alternatives to schema theory—namely, 
exemplar models. 


Exemplars 


Generic mechanisms for mental representation (e.g., 
schemas) assume that people forge abstract or prototypical 
models of reality by inducing generalities across similar, 
particular instances. A major alternative to this view was 
provided by exemplar models (e.g., Smith & Zarate, 1992), 
which hold that social cognition is based on specific repre- 
sentations of individual instances. Instead of relying on pre- 
computed generalizations, perceivers are assumed to retrieve 
and use sets of prior relevant and specific experiences to 
guide their social information processing. Consider, for ex- 
ample, how the category elderly people might be represented 
using the various mechanisms that have been discussed so 
far. In an associative network, various attributes would be as- 
sumed to be linked to the generic concept elderly people, 
with varying degrees of strength. In a schematic model, the 
same kinds of attributes would be assumed to be embedded 
within a more elaborate conceptual model, in which causal 
and other kinds of constraints provide a more integrated but 
still very generic structure. In the exemplar model, it is as- 
sumed that there is no abstract or generic elderly people rep- 
resentation at all. Instead, there would be a multitude of 
specific elderly persons (e.g., Grandma, the kindly pharma- 
cist, the doddering Senator, etc.), each represented in terms of 
how they were perceived or experienced by the individual. If 
the situation requires a person to make general judgments 
about elderly people, then he or she will retrieve relevant ex- 
emplars at that time and render judgments based on the aver- 
age features of these momentarily activated exemplars. 

This example conveys several important assumptions of 
the exemplar approach. First, it assumes that multiple exem- 
plars can be activated in parallel at the same time. The likeli- 
hood that any given exemplar will be activated depends on 
the degree of its similarity to the current retrieval cues. The 
most similar exemplars are the ones that tend to get activated. 
Upon activation, the entire set of exemplars can then influ- 
ence judgments and behavioral reactions. The assumption of 
parallel processing is an important characteristic of the ex- 
emplar approach. During retrieval of exemplars, many simi- 
lar instances tend to be retrieved simultaneously; when an 
individual makes generalizations about a concept or stimu- 
lus, multiple, simultaneously active exemplars enter into the 


judgment (with their implications being aggregated into a 
general summary). Exemplar models thus can produce and 
account for generic judgments, just as schematic models 
can, and they can also account for patterns of selective atten- 
tion and interpretation that were previously regarded as the 
hallmark of schematic processing. As soon as a set of exem- 
plars is activated, it can bias the ongoing stream of informa- 
tion processing, just as a schema is assumed to do. However, 
exemplar models are substantially more flexible than are 
schema models, because exemplar models assume that dif- 
ferent subsets of exemplars will be activated in different con- 
texts, depending on the particular retrieval cues that are most 
salient in each context. Whereas schemas have a fixed or sta- 
tic quality and are assumed to be activated in an all-or-none 
fashion, exemplar retrieval can be tuned very flexibly to the 
immediate situational context. Further, the specific exem- 
plars that are retrieved depend partly on recency and chronic- 
ity of activation. Thus, a more recently encountered elderly 
person would be more likely to be recruited into our sum- 
mary representation of elderly people than would a more 
temporally distant one. Inasmuch as exemplar models can 
account for many of the same phenomena as can other repre- 
sentational formats, yet offer a strikingly greater degree of 
flexibility, they have considerable theoretical appeal. 
Exemplar models are of relatively recent vintage, coming 
to prominence within social psychology only in the last 
decade or so; there are therefore few examples of substantive 
topics that have been shaped and guided by the assumptions 
of this representational mechanism. One case in which such 
models have taken on particular prominence is the study of 
perceptions of variability versus homogeneity in social 
groups (e.g., Linville, Fischer, & Salovey, 1989). Exemplar 
models provide a very natural way for thinking about how 
people understand and estimate group variability. By simply 
calling to mind a relevant set of exemplars, the degree of vari- 
ability can be gauged directly by making across-exemplar 
comparisons. Research findings confirm that exemplar-based 
models are better able to account for perceptions of group 
variability than can models relying on prototypic or generic 
representations of groups (such as schemas; e.g., Smith & 
Zarate, 1990). However, many researchers have concluded 
that the most sensible assumption about the mental represen- 
tation of social groups is that both specific exemplars and 
more abstract summaries are important components of such 
representations (e.g., Hamilton & Sherman, 1994; Park & 
Judd, 1990). Various ideas have been proposed concerning 
the relative prevalence and importance of each type of repre- 
sentation. For example, Sherman (1996) proposed that when 
representations of groups are initially being formed, they 
tend to be predominantly exemplar-based—but over time, 


more abstract and stable representations emerge. Park, Judd, 
and Ryan (1991) proposed that representations of groups to 
which one oneself belongs (i.e., in-groups) tend to contain 
more information about specific exemplars, whereas repre- 
sentations of out-groups tend to consist mainly of generaliza- 
tions (see also Sherman, Klein, Laskey, & Wyer, 1998). 
Hybrid representational models such as these provide even 
more theoretical flexibility because they can account for both 
the flexibility of social cognition that is emphasized in the ex- 
emplar approach and the stability (sometimes even rigidity) 
of social cognition that is emphasized in models relying on 
more generic forms of representation such as schemas. 


Distributed Memory Models 


As Smith (1998) notes, one potentially important distinction 
between exemplar models and earlier approaches to repre- 
sentational mechanisms lies in the fact that the schema and 
associative network approaches suggest the existence of a 
discrete, stable, enduring cognitive entity—a thing that is 
stored, accessed, used, and stored away again. In contrast, the 
exemplar approach suggests the dynamic construction of rep- 
resentations on the spot, depending on which particular ex- 
emplars happen to get recruited for processing in a particular 
context. After such a representation is formed from its con- 
stituent exemplars, it does not remain as a stable entity; 
rather, it is used and then deconstructed back into its underly- 
ing elements, which may never come together in quite the 
same way again. From this perspective, exemplar-based 
representations are more like a transitory state than like an 
enduring entity. This idea is taken even further in the latest 
representational mechanism to catch the fancy of social- 
cognitive researchers: parallel distributed memory (often 
called PDP) models. 

Based on models of distributed cognition developed by 
cognitive psychologists, this approach to mental representa- 
tion has been developed in a rather elaborate manner. An ex- 
cellent summary geared toward social-cognitive researchers 
was provided by Smith (1996). A detailed presentation of the 
assumptions of this approach is beyond the scope of this re- 
view, but a good general sense for the ideas embodied in this 
way of thinking is provided by a simple analogy used by 
Smith (1998). In distributed representations, a concept is rep- 
resented as a pattern of activation across a set of low-level 
processing units, in much the same way that a television 
screen image arises from a pattern of electrical activity occur- 
ring across a set of pixels. Any individual pixel does not con- 
vey meaningful information, but a very complex and richly 
meaningful representation can emerge from the pattern cre- 
ated by the activation’s occurring across multiple pixels. 
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Moreover, just as with the pixels in a television screen, the 
same processing units can be involved in numerous, very dif- 
ferent representations; meaning never resides in the process- 
ing units per se, but instead emerges from the overall pattern 
of their activity. According to this view, representations are 
clearly momentary states rather than enduring entities. In fact, 
whereas the other representational approaches make a dis- 
tinction between representational structures and the processes 
that operate on these structures, such a distinction is mean- 
ingless in the PDP approach because the process of activation 
itself is the representation from this perspective. According 
to the PDP approach, representations are not stored and re- 
trieved; instead, they are constructed and (approximately) 
reconstructed based on the relevant input cues. 

Extremely new to social psychologists, distributed- 
memory mechanisms have not yet won widespread applica- 
tion. Nevertheless, interest in their theoretical potential is 
growing, and PDP-based accounts for a diverse range of phe- 
nomena such as attribution, cognitive dissonance, and person 
perception have begun to appear (e.g., Kashima, Woolcock, & 
Kashima, 2000; Schulz & Lepper, 1996; Smith & DeCoster, 
1998; Van Overwalle, 1998). As Smith (1998) describes in 
careful detail, the PDP approach can provide an impressive 
degree of theoretical coherence because it can account for vir- 
tually all of the phenomena previously explained by asso- 
ciative network, schema, and exemplar models. Ultimately, 
Smith makes a convincing case that these alternative versions 
of mental representation are not really competitors, but in- 
stead are complementary windows, each with its own particu- 
lar theoretical usefulness. 


AUTOMATIC AND CONTROLLED PROCESSES 
IN SOCIAL COGNITION 


A great deal of social cognition theory and research is con- 
cerned with questions about the degree to which social infor- 
mation processing involves active, conscious analysis of the 
social environment. Historical models of person perception 
and attribution regarded the perceiver as operating as a “lay 
scientist” (e.g., Heider, 1958; Kelley, 1967), examining evi- 
dence and reasoning about its logical implications; research 
in this tradition was largely mute, however, with respect to 
whether these putative mental processes involved the con- 
scious application of deductive principles or processes of a 
more preconscious variety. As Gilbert (1998) observes, it is 
quite possible for a mental system to follow a reasoning 
algorithm without requiring that the conscious mind know or 
consciously apply the relevant principles. Mental processes 
that do not involve active, conscious ratiocination have come 
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to be called automatic or implicit social cognition and have 
been the subject of a massive amount of recent research. 

The contrast between conscious, effortful, controlled men- 
tal processes on one hand and unconscious, automatic ones on 
the other became a prominent issue in cognitive psychology 
largely due to influential papers by Posner and Snyder (1974), 
Shiffrin and Schneider (1977), and Hasher and Zacks (1979), 
yet there is quite a history of interest in the extent to which the 
mind might be operating in ways unknown to the conscious 
self. For example, researchers interested in human perfor- 
mance have long been interested in the processes involved in 
skill acquisition, whereby an initially novel task that requires 
considerable effort and attention becomes relatively auto- 
matic with practice (e.g., Fitts & Posner, 1967). After they be- 
come automated, skills can be triggered and used without 
much involvement of the conscious mind. In a different vein, 
psychoanalytically oriented researchers have been interested 
in how unconscious motivations might shape processes of 
perception and cognition (e.g., Erdelyi, 1974). Cognitive re- 
search of this sort addresses profound questions concerning 
who is running the show. Does the conscious self call the 
shots, or is the brain going about its business without much in- 
terference from the conscious thinker? In this section, we first 
review research on automatic aspects of social cognition, and 
then we consider the case that can be made for the capacity of 
the conscious mind to control and regulate processes of social 
cognition. Finally, we consider some of the ways in which au- 
tomatic and effortful processes can interact to determine 
jointly the course of perception, thought, and action. 


Automatic Social Cognition 


The foundations for social-psychological treatments of the 
issue of automaticity have been established in the work of 
Bargh (e.g., 1982; Bargh & Chartrand, 1999; Bargh & 
Ferguson, 2000). Synthesizing the insights emerging from 
disparate research areas touching on the issue of automatic- 
ity, Bargh (1994) argued that the notion of automatic mental 
processes is complex and multifaceted. He argued that the 
term has been used to refer to four distinct qualities of infor- 
mation processing: awareness, intention, efficiency, and con- 
trol. That is, a process tends to be considered automatic if it 
(a) occurs without the person’s awareness, (b) occurs without 
the person’s intention, (c) occurs with great efficiency and 
does not require much mental capacity, or (d) occurs in a 
manner that is difficult to prevent or stop. Not all four criteria 
are necessary for a process to be considered automatic. When 
one or more of these characteristics is present, the relevant 
process is often deemed to be relatively automatic. 


A particularly compelling and influential demonstration of 
the implicit operation of the mind was provided by Warrington 
and Weiskrantz (1968). Their research documented that indi- 
viduals suffering from anterograde amnesia, who are unable 
to consciously recollect their recent experiences, nevertheless 
showed a clear benefit from that experience in the perfor- 
mance of indirect tests of memory, such as completing word 
fragments. Although these patients have no explicit memory 
for the words they saw during a study period, they neverthe- 
less were better able to complete word fragments when the 
corresponding word had indeed been previously studied. This 
research clearly indicates that memories can be quite influen- 
tial even when there is no conscious awareness of the relevant 
prior episodes. 

Social cognition researchers have sought to investigate the 
role of awareness in social cognition in several ways. One 
approach has simply been to demonstrate that individuals 
are often unable to articulate accurately the factors that are im- 
portant in shaping their behavioral choices (e.g., Nisbett & 
Wilson, 1977). This fact obviously implies that people are 
generally unaware of the processes at work behind the scenes 
in the preconscious mind. Another approach to documenting 
that some processes occur without awareness has been 
adopted in research on priming. The basic idea of priming 
research is quite straightforward. Individuals are exposed to 
a task or environmental context that is designed to activate a 
particular mental representation. Then a second, ostensibly 
unrelated task is performed, and the researcher seeks to deter- 
mine whether the previously activated representation exerts 
any influence on information processing in the second task. 
Research of this sort conclusively demonstrates that concepts 
that have been activated in one context can continue to influ- 
ence social cognition in subsequent, unrelated contexts, by 
virtue of their enhanced accessibility (Higgins, 1996). Acom- 
mon effect of such priming is that subsequently encountered 
information is assimilated toward the activated concept. For 
example, Srull and Wyer (1979) showed that activating hostile 
concepts in a language-processing task caused participants to 
form more negative impressions of an ambiguous social target 
in a subsequent impression formation task, compared to par- 
ticipants who never had the hostile concepts activated in the 
initial task. It is typically assumed that this assimilation 
process occurs because the fortuitously activated concepts are 
used to disambiguate later information, and the perceiver is 
presumed to be oblivious to the fact that it is occurring. 

Perhaps the best evidence that priming effects occur with- 
out the perceiver’s awareness comes from research that 
employs subliminal priming techniques. In this research, 
concepts are activated by exposing participants to extremely 


brief stimulus presentations (see Bargh & Chartrand, 2000, 
for procedural details). Although perceivers are unable to 
describe the stimuli to which they have been exposed, they 
nevertheless show evidence of priming effects. We have 
already described one experiment by Devine (1989) that 
showed that subliminal activation of words associated with 
the African American stereotype caused perceivers to view 
an ambiguously aggressive target as more hostile, compared 
to individuals who had not been primed with the stereotypic 
concepts. Similar findings have been reported by other re- 
searchers (e.g., Bargh & Pietromonaco, 1982), confirming 
that priming effects can occur outside of the perceiver’s 
conscious awareness. 

It is usually assumed that for these assimilative priming 
effects to occur, not only must the relevant concept be accessi- 
ble, but it must also be applicable (Higgins, 1996). In line with 
this proposition, Banaji, Hardin, and Rothman (1993) demon- 
strated that priming gender stereotypes resulted in more 
stereotypical impressions of ambiguous targets, but only when 
the target was a member of the relevant gender group—that is, 
activating masculine concepts resulted in the perception of 
ambiguous male targets in a more stereotypical manner, but it 
largely did not affect perceptions of female targets. Con- 
versely, activating feminine concepts resulted in perceiving 
ambiguous female targets in a more stereotypical manner, but 
it did not affect perceptions of male targets. Although priming 
effects do operate under the constraints of applicability, the 
processes involved in using or failing to use activated concepts 
as a basis for disambiguating social targets appears to operate 
largely without any awareness on the perceiver’s part. 

It is not inevitably the case that priming results in assimi- 
lation to the primed concepts. For example, Herr (1986) 
demonstrated that when activated concepts are sufficiently 
extreme, they can produce contrast effects. A contrast effect 
is said to occur when an object is judged more extremely 
in the direction opposite to the activated concept. For exam- 
ple, if an ambiguous target were judged to be significantly 
less hostile after an African American stereotype had been 
activated (compared to an unprimed control group), this 
would constitute a contrast effect. The mechanism produc- 
ing contrast effects involves using the activated concept as a 
comparison standard rather than as an interpretive frame. 
Thus, in the case of Herr’s research, for example, the target 
person is compared to the activated standard and is conse- 
quently seen as relatively less hostile, given the extremity 
of the standard. The question of whether contrast effects 
occur automatically has been a matter of continuing theoreti- 
cal dispute (e.g., Martin, Seta, & Crelia, 1990; Stapel & 
Koomen, 1998). 
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Another hallmark of automatic processing is the occur- 
rence of unintended effects. The assimilative priming effects 
just reviewed certainly meet this criterion of automaticity, be- 
cause it is clearly not the case that individuals intend to use 
subliminally activated concepts to guide subsequent impres- 
sions. Another domain providing compelling evidence for 
unintended aspects of impression formation is research on 
spontaneous trait inferences. The question at stake in this re- 
search concerns whether social perceivers spontaneously 
infer that observed behavior implies that the actor has a 
corresponding personality trait. In historical models of this 
process of dispositional inference (e.g., Jones & Davis, 
1965), it was typically assumed that perceivers engage in a 
fairly extensive deductive reasoning process to determine the 
trait implications of observed behavior, comparing the effects 
of the observed behavior with the simulated effects of not 
performing it or of performing an alternative option. In con- 
trast, more recent research on spontaneous trait inferences 
suggests that perceivers automatically infer the trait implica- 
tions of behavioral information, even if that is not their con- 
scious intention. For example, Winter and Uleman (1984) 
presented participants with behavioral descriptions (e.g., 
Billy hit the ballerina) and subsequently asked participants to 
recall the presented descriptions with the aid of cues. The 
cues were either semantically related to the theme of the de- 
scription (e.g., dance) or were related to the trait implications 
of the behavior (e.g., hostile). Cued recall performance was 
markedly better when trait cues were available. In a different 
paradigm, Uleman, Hon, Roman, and Moskowitz (1996) 
showed that people spontaneously made trait inferences 
when processing behavioral descriptions, even when such in- 
ferences actually impaired performance of their focal task. In 
this paradigm, participants read behavioral descriptions on a 
computer screen. Immediately after the presentation of a de- 
scription, a word appeared on the screen and participants had 
to indicate whether that exact word had appeared in the pre- 
ceding sentence. When the target word was a trait that was 
implied by the behavioral description, reaction times were 
slower and error rates were higher than they were when the 
same target words followed similar descriptions that did not 
imply the traits in question. This kind of evidence suggests 
that fundamental aspects of social perception can occur quite 
spontaneously, without any conscious instigation on the part 
of the perceiver. 

Trait inferences are but one manifestation of unintended so- 
cial cognition. In a growing program of research, Bargh and 
colleagues have shown that without the formation of any 
conscious intention, primed or salient stimuli can trigger spon- 
taneous behavior (e.g., Bargh, Chen, & Burrows, 1996). For 
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example, Bargh et al. showed that activating stereotypes about 
elderly persons resulted in slower rates of walking. Similarly, 
Chen and Bargh (1997) showed that subliminal presentation 
of African American (as compared with European American) 
faces resulted in more hostile behavior in a subsequent verbal 
game played with an unprimed partner. Moreover, the un- 
primed partner’s behavior also became more hostile as a con- 
sequence, showing that self-fulfilling prophecies can emerge 
in a very automatic manner—even when participants are un- 
aware that stereotypical concepts have even been activated 
and have formed no conscious intention to act in a manner 
consistent with these concepts. Although the precise mecha- 
nisms responsible for these fascinating effects have not been 
isolated, the very existence of the phenomenon provides a 
potent demonstration of the potential automaticity of not only 
social thought, but also interpersonal interaction. 

A principal advantage of automatic reactions lies in the 
fact that they are largely not dependent on the availability of 
processing resources. Because of the great efficiency with 
which they unfold, automatic processes do not require much 
investment of attentional capacity or perceiver motivation. 
Whereas novice drivers can find it harrowing to coordinate all 
of the requisite activities (shifting gears, monitoring traffic, 
steering, braking, etc.), after the process has been automated, 
not only can these tasks be easily performed, but the driver 
may also have sufficient reserve capacity available for 
singing along with the stereo or engaging in mobile phone 
conversations. Empirical confirmation of the resource- 
conserving properties of automatic mental processes was pro- 
vided in a series of experiments by Macrae, Milne, and 
Bodenhausen (1994). In one of their studies, they asked 
participants to engage in two tasks simultaneously: a visual 
impression-formation task that involved reading personality 
descriptions of four different persons, and an audio task that 
involved listening to a description of the geography and econ- 
omy of Indonesia. For half of the participants, stereotypes 
were activated in the impression-formation task (by provid- 
ing information about a social group to which each target be- 
longed). Some of the personality information was consistent 
with stereotypes about the relevant group, and the rest was ir- 
relevant to such stereotypes. One might expect that giving 
these participants an additional piece of information to inte- 
grate would simply make their task all that much harder—but 
in fact, the introduction of the stereotype provided a frame- 
work that participants could spontaneously use to organize 
their impressions, making the process of impression forma- 
tion much more automatic and efficient. As a consequence, 
participants who knew about the group memberships of the 
social targets not only recalled more information about the 
targets (as revealed in a free recall measure), they also learned 


more information about Indonesia (as revealed in a multiple- 
choice test). The automatic reactions triggered by stereotype 
activation provided a clear functional benefit to perceivers by 
making the process of impression formation more efficient, 
thereby freeing up attentional resources that could be devoted 
to the other pressing task. 

When automatic effects of these sorts occur without 
awareness, intention, or much attentional investment, is there 
any hope of preventing them or stopping them after they start? 
In the realm of automatic stereotyping effects, Bargh (1999) 
has argued that the prospects for controlling such effects are 
slim to none. Indeed, the final hallmark of an automatic 
process is its imperviousness to control. In line with Bargh’s 
assertion, the previously described research of Devine (1989) 
showed that even low-prejudice individuals who disavow 
racist stereotypes are still prone to showing automatic effects 
of stereotype activation. Similarly, Dunning and Sherman 
(1997) found that implicit gender stereotyping occurred inde- 
pendently of participants’ level of sexism. However, other re- 
search has begun to suggest that at least some of the time, it 
may be possible to develop control over automatic processes. 
Uleman et al. (1996), for example, found that with practice, 
people could learn to avoid making spontaneous trait infer- 
ences. Similarly, it seems that egalitarian individuals can also 
learn to control automatic stereotyping effects, at least under 
some circumstances (e.g., Wittenbrink, Judd, & Park, 1997). 
It is toward the processes through which mental control can be 
achieved that we now turn our attention. 


Controlled Social Cognition 


The process of controlling thought and action, at least in rel- 
atively novel and unpracticed domains, requires attention. 
Whereas automatic processes occur efficiently and thus re- 
quire little expenditure of mental resources, effortful, con- 
trolled processes come with an attentional price to pay. 
Moreover, controlled processes typically require intentional 
deployment, and they occur in a manner that is at least par- 
tially accessible to the conscious mind. Whereas many com- 
putational processes of implicit cognition are regarded to be 
massively parallel, attention and consciousness represent a 
processing bottleneck that results in highly selective and 
serial information processing (e.g., Simon, 1994). As Simon 
notes, connecting one’s motives to one’s thought processes 
requires a system that can cope with the constraints imposed 
by limitations of attentional capacity. 

Attentional capacity has turned out to be a major theoreti- 
cal construct in social cognition research (for a review, see 
Sherman, Macrae, & Bodenhausen, 2001) precisely because it 
plays such a fundamental role in determining whether it 


will be possible for the perceiver to engage in controlled 
processing. Without sufficient mental resources, automatic 
mental processes are presumed to operate in an unchecked 
manner, and it is difficult or impossible for perceivers to im- 
pose their will and exercise control over the workings of their 
own minds. Early theorizing about attentional capacity as- 
sumed a simple, unitary structure to the mental resources that 
are used in conscious, controlled information processing. 
However, advances in cognitive neuroscience have made it 
possible to identify a more differentiated set of working mem- 
ory resources (e.g., Roberts, Robbins, & Weiskrantz, 1998). 
Baddeley (1998) proposed that there are three principal facets 
to working memory, each with a limited capacity for holding 
information: a phonological buffer, a visuospatial sketch pad, 
and a central executive. It is the latter resource that is most im- 
portant to social-cognitive theorizing, because it is the central 
executive that governs the conscious planning, execution, and 
regulation of behavior. When these executive resources are in 
ample supply, individuals are generally able to exercise a 
considerable degree of control over their conscious thought 
processes and behavioral responses; when these finite re- 
sources have been usurped by other ongoing processes, how- 
ever, the resulting executive dysfunction can put perceivers in 
the position of failing to produce intended patterns of thinking 
and responding. Under this circumstance, thought and action 
will be dictated more by potent automatic reactions than by the 
force of the conscious will. 

Research on mental control has undergone a dramatic resur- 
gence in the past decade (for an excellent sampling of research 
topics, see Wegner & Pennebaker, 1993). Wegner’s research 
on thought suppression has been a major impetus for this ex- 
plosion of research attention (e.g., Wegner, 1994; Wenzlaff & 
Wegner, 2000). In this research, the prospects for mental self- 
control have been investigated by providing participants with a 
self-regulatory injunction to consciously pursue (e.g., dont 
think about white bears or don t be sexist). Success is measured 
simply by the number of times the unwanted response is gen- 
erated, and success rates can be considerable—provided that 
the person has ample attentional resources. However, if a cog- 
nitive load is imposed on the person (e.g., a secondary task 
must be completed simultaneously, such as rehearsing an 
eight-digit number), not only are unwanted responses likely to 
emerge, but they are also likely to occur with even greater fre- 
quency than they would if the person had never tried to sup- 
press them in the first place (i.e., a rebound effect). 

Wegner (1994) proposed a theoretical account for this 
state of affairs; his account rests on the assumption that men- 
tal control reflects the operation of two separate processes. A 
monitoring process is responsible for checking to see whether 
undesired responses (e.g., sexist thoughts) are occurring. If it 
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should detect such responses, an operating process is trig- 
gered that serves to squelch the unwanted response by finding 
an acceptable substitute response (e.g., thoughts about a tar- 
get’s occupation rather than her gender). Crucial to his model 
are two additional assumptions. First, the monitoring process 
can do its work in a relatively automatic manner, but must of 
necessity keep active in memory (even if only at a relatively 
low level) a representation of the undesirable response so that 
it can be recognized if it should appear. Thus, the monitoring 
process ironically keeps an unwanted thought or response 
salient in the perceiver’s mind. This recurrent activation of 
the undesired target stimulus is not a big problem, so long as 
the operating process can counteract the unwanted response 
whenever it does exceed the threshold necessary for con- 
scious awareness. However, a second assumption of the 
model is that the operating process is relatively effortful and 
requires sufficient attentional resources. Hence, if these re- 
sources are being depleted by other tasks (e.g., rehearsing a 
digit string), the enhanced accessibility created as a byprod- 
uct of the monitoring process cannot be effectively checked, 
and the stage is set for rebound effects. 

These assumptions have been explored in the domain of 
stereotype suppression by several researchers. In the contem- 
porary social world, it has become largely taboo to respond to 
many stigmatized social groups in terms of negative stereo- 
types and prejudices that have historically been prevalent. In 
the previous section, we reviewed several pieces of evidence 
suggesting that stereotypes can exert numerous automatic ef- 
fects on information processing. If so, what are the prospects 
for success when perceivers strive to follow the dictates of 
cultural injunctions against thinking discriminatory thoughts 
about these stigmatized groups? In an initial demonstration, 
Macrae, Bodenhausen, Milne, and Jetten (1994) showed 
that individuals who strive to prevent stereotypical reactions 
from entering their thoughts can succeed as long as they are 
actively pursuing that objective. However, consistent with the 
implications of Wegner’s ironic model of mental control, this 
process rendered the unwanted thoughts hyper-accessible, 
and Macrae et al. found that after the suppression motivation 
had dissipated, rebound effects emerged when subsequent 
members of the stereotyped group were encountered. That is, 
participants reported even more stereotypical reactions to the 
subsequent group members than did individuals who had 
never engaged in any previous stereotype suppression. These 
findings confirm that intentionally suppressing stereotypes 
ironically involves repeatedly priming them, albeit at rela- 
tively low levels—and this in turn renders the stereotypes all 
the more accessible. If the operating process that is commis- 
sioned to direct attention away from unwanted thoughts 
should be compromised either by the imposition of a cognitive 
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load or by the dissipation of the motivation required for its 
activity (being a relatively effortful, controlled process), this 
in turn can lead to rebound effects. 

Additional ironic implications of stereotype suppression 
were uncovered in subsequent research. For example, trying 
not to think stereotypical thoughts about an elderly target re- 
sulted in better memory for the most stereotypical characteris- 
tics displayed by the target (Macrae, Bodenhausen, Milne, & 
Wheeler, 1996). Moreover, these effects are not limited to sit- 
uations in which an overt, external requirement for thought 
suppression is imposed; even when suppression motivation 
was self-generated in a relatively spontaneous manner, ironic 
effects were observed to result (Macrae, Bodenhausen, & 
Milne, 1998). Other research suggests that rebound effects of 
this sort are more likely to emerge in high-prejudice persons 
(Monteith, Spicer, & Toomen, 1998) and in situations in 
which the perceiver is unlikely to have chronically high levels 
of suppression motivation (Wyer, Sherman, & Stroessner, 
2000). These qualifications are quite consistent with general 
idea that even the process of mental control itself is subject to 
some degree of automation. With practice, the initial effortful- 
ness of stereotype suppression may be replaced by relative 
efficiency. 

Another form of controlled processing that has received 
considerable attention from social cognition researchers is 
judgmental correction. When perceivers suspect that their 
judgments have been contaminated by unwanted or inappro- 
priate biases, they may take steps to adjust their judgments in 
a manner that will remove the unwanted influence (e.g., 
Wilson & Brekke, 1994). Whereas the initial processes 
that produced the bias are likely to be automatic ones, the 
processes involved in correcting for them are usually consid- 
ered to be effortful. Hence, they require perceiver motivation 
and processing capacity for their deployment. One particu- 
larly noteworthy domain in which such hypotheses have been 
investigated is research on person perception. In particular, it 
has long been established that people are susceptible to a cor- 
respondence bias, in which they tend to perceive the behav- 
ior of others to be a reflection of corresponding internal 
dispositions—even when there are clear and unambiguous 
situational constraints on the behavior (e.g., Jones & Harris, 
1967; Gilbert & Malone, 1995). The previously described 
research on spontaneous trait inference is consistent with the 
idea that people often immediately assume that behavior 
reflects the actor’s dispositions. In an influential theoretical 
assessment of this bias, Gilbert (e.g., 1998) proposed that dis- 
positional inferences involve three distinct stages. In the cat- 
egorization stage, the observed behavior is construed in 
terms of its trait implications (e.g., Hannah shared her 
dessert with her brother could be categorized as kind). Then 


the inferred trait is ascribed to the actor in the characteriza- 
tion stage. Both of these stages are assumed to be relatively 
automatic —that is, they occur spontaneously, efficiently, and 
without intention. In a third correction stage, individuals may 
consider the situational constraints that might have influ- 
enced the behavior (e.g., Mommy threatened Hannah with 
retribution if she failed to share her dessert) and adjust their 
dispositional inferences accordingly (e.g., perhaps Hannah 
isn't so kind after all). This correction process is assumed to 
be a controlled activity that requires motivation and process- 
ing capacity for its execution. 

In numerous experiments, Gilbert and colleagues have 
pursued the implications of this model by demonstrating that 
situational constraints are often not taken into account when 
perceivers are given a taxing mental task to perform that oc- 
cupies their central executive resources (e.g., rehearsing a 
random digit string). For example, when watching a nervous- 
looking woman, people spontaneously assume that she is an 
anxious person; only subsequently do they correct this initial 
assumption in light of the fact that she is in an anxiety- 
provoking situation (e.g., a job interview). If they have to 
watch the seemingly nervous person while rehearsing a digit 
string, they still automatically infer the trait of anxiety, 
but they no longer engage in corrective adjustments in light 
of the situational constraint. This pattern of results is quite 
consistent with the idea that correction is a controlled, 
resource-dependent process. When attentional resources are 
diminished, the automatic tendencies of the system remain 
unchecked by more effortful control mechanisms. 

A more general treatment of the nature of correction 
processes has been provided by Wegener and Petty (1997) in 
their flexible correction model. According to this model, cor- 
rection processes operate on the basis of lay theories about 
the direction and extent of biasing influences. When people 
suspect that they may have fallen prey to some untoward in- 
fluence, they rely on their intuitive ideas about the nature of 
the bias to make compensatory corrective adjustments. For 
example, if they believe that their judgments of a particular 
person have been assimilated to stereotypes about the per- 
son’s gender group, then they would adjust those judgments 
in the opposite direction to make them less stereotypical in 
nature. Conversely, if they believe that their judgment of a 
target has been contrasted away from a salient standard of 
comparison, they will make adjustments that result in judg- 
ments in which the target is seen as more similar to the com- 
parison standard. Several points are important to keep in mind 
with regard to this correction process. First, it requires that 
the perceiver detect the biasing influence before the process 
can initiate (Stapel, Martin, & Schwarz, 1998; Strack & 
Hannover, 1996). Many automatic biasing influences are 


likely to be subtle and hence escape detection; as a result, 
no correctional remedy is pursued. Second, as a controlled 
process, it is likely to require motivation and attentional 
capacity for its successful execution. Third, if correctional 
mechanisms are to result in a less biased judgment, the per- 
ceiver must have a generally accurate lay theory about the 
direction and extent of the bias. Otherwise, corrections could 
go in the wrong direction, they could go insufficiently in the 
right direction, or they could go too far in the right direction, 
leading to overcorrection. Indeed, many examples of overcor- 
rection have been documented (see Wegener & Petty, 1997, 
for a review), indicating that even when a bias is detected 
and capacity and motivation are present, controlled processes 
are not necessarily effective in accurately counteracting auto- 
matic biases. 

Wegner and Bargh (1998) categorize several ways in 
which automatic and controlled mental processes interact 
with one another. The examples we have just described fall 
into the category of regulation—when a controlled process 
overrides an automatic one. When an automatic process over- 
rides a controlled one, as in the rebound effect, intrusion is 
said to occur. Controlled processes can also launch automatic 
processes that subserve the achievement of the actor’s mo- 
mentary intentions, and this is termed delegation. For exam- 
ple, delegation would be said to occur if a conscious goal to 
go to the shopping mall triggered the many automatic aspects 
of driving behavior. Conversely, automatic processes can 
serve an orienting function in which they launch controlled 
processes, as in Wegner’s model of mental control: When the 
automatic monitoring process detects an unwanted thought, it 
triggers the more effortful operating process to banish the 
thought from conscious awareness. Finally, controlled 
processes can be transformed into automatic processes via 
automatization, as when perceivers become so skilled at sup- 
pressing stereotypes that it happens automatically, and auto- 
matic processes can be transformed into controlled processes 
via disruption, as when one starts thinking too much about 
the steps involved in a well-learned task and subsequently 
performs the task more poorly. 

In many ways, the tension between automatic and con- 
trolled processes has become the heart of social cognition 
research. Most contemporary social cognition research pro- 
grams are oriented toward this issue in a fundamental way. 
One of the key insights to emerge from this research is that 
our perceptions of and reactions to the social world are often 
shaped by rapid, automatic processes over which we com- 
monly exercise very little control. By virtue of their very au- 
tomaticity, the impressions that are constructed on this basis 
often have the phenomenological quality of being direct rep- 
resentations of objective reality. We feel, for example, that 
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Mary is objectively a kind and caring person rather than 
recognize the role that our own biases (e.g., gender stereo- 
types) may have played in shaping this necessarily subjective 
interpretation. It may be possible to exercise control over 
these processes. If we pause long enough to entertain the pos- 
sibility that our perceptions of the world may contain system- 
atic biases, we can engage in suitable corrective action. This 
action, however, requires awareness, motivation, and atten- 
tional capacity. Without them, we may function more like 
automatons than like the rational agents we often fancy 
ourselves to be. 


SOCIAL COGNITION IN CONTEXT: 
MOTIVATIONAL AND AFFECTIVE INFLUENCES 


A common question asked of social cognition researchers is 
How is social cognition different from “regular” cognition? 
A common answer to this question is that whereas cognitive 
psychologists often study cognitive processes in a manner 
that is divorced from the real-life contexts in which these 
mechanisms operate, social-cognition researchers muddy the 
waters by attempting to add back some of the real-life context 
into their experiments. In real life, our mental processes 
occur within a complex framework of motivations and affec- 
tive experiences. Whereas most cognitive psychology exper- 
iments attempt to eliminate the role played by these factors, 
social cognition researchers have had to increasingly recog- 
nize that an understanding of how the social mind works 
must include a consideration of how basic processes of per- 
ception, memory, and inference are influenced by motivation 
and emotion. 

There have been a series of interesting debates in social 
psychology that take the form of questioning whether a par- 
ticular phenomenon can be explained in purely cognitive 
terms, or whether one must invoke motivational processes in 
order to account for it. One case in point is the tendency for 
people to form negative stereotypes about minority groups. 
This phenomenon has been studied for quite a long time, and 
many explanations for it focus on the perceiver’s motivations 
that are gratified by engaging in stereotyping of this sort. For 
instance, maybe perceivers derive feelings of superior self- 
worth by looking down on members of other groups (e.g., 
Adorno, Frenkel-Brunswik, Levinson, & Sanford, 1950) or 
by viewing their own group as positively distinct from other 
groups (Tajfel & Turner, 1986). Alternatively, negative 
stereotypes might arise in order to forestall feelings of guilt 
about social inequality (Jost & Banaji, 1994). Could the ten- 
dency to stereotype minority groups negatively ever be ex- 
plained in purely cognitive terms, without appealing to these 
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kinds of motivational explanations? Hamilton and Gifford 
(1976) produced an influential affirmative answer to this 
question. In a paper that stimulated dozens of subsequent in- 
vestigations, they argued that a simple cognitive principle 
could account for the tendency to believe that minority 
groups are more likely to possess negative characteristics. 
Specifically, they proposed that one could view stereotyping 
merely as a natural by-product of our tendency to pay more 
attention to distinctive events. 

In their experiments, they presented participants with in- 
formation about members of two groups of differing sizes. 
Although the information was predominantly positive, 
negative information was provided about both groups. An im- 
portant feature was that within each group, the ratio of nega- 
tive to positive information was exactly the same; thus, there 
was objectively no correlation between group membership 
and positivity of the available information. Hamilton and 
Gifford argued that if people have a tendency to attend more 
to distinctive information, they will (a) tend to pay more at- 
tention to information about the smaller (minority) group, be- 
cause it is more rare and hence distinctive; and (b) tend to pay 
more attention to negative information than to positive infor- 
mation, because it is also relatively rarer. This means that the 
negative information about the minority group will be espe- 
cially (doubly) distinctive, and it should tend to stand out 
more in perceivers’ memories and impressions of the groups. 
This pattern was in fact empirically observed. The partici- 
pants tended to form illusory correlations, in which the 
minority group was perceived to be characterized by negative 
qualities greater than those of the majority group. 

Although it has been debated whether distinctiveness per 
se is the factor that produces this pattern of illusory correla- 
tion (e.g., Fiedler, 1991), the interesting point is that in this 
situation, negative stereotypes arise in the absence of the 
usual motivational forces that had previously been assumed 
to be causally responsible for prejudice-related phenomena. 
It thus appears that sometimes the basic properties of our in- 
formation-processing apparatus can produce biases and 
distortions that have important consequences; this can occur 
independent of the perceivers’ particular motivational orien- 
tation. Yet anyone who has observed the phenomena of 
stereotyping and prejudice would instantly recognize that this 
account is at best only part of the story. The unsavory signa- 
ture of motivated distortion is written in a variety of stereo- 
typing phenomena (e.g., Fein & Spencer, 1997; Sinclair & 
Kunda, 2000). More generally, purely cognitive explanations 
for virtually any socially interesting phenomenon are likely 
to fall well short of providing a satisfying explanatory ac- 
count. It seems to us undeniable that the study of cognitive 
processes must ultimately be situated within the context of 


the mind’s affective and motivational dynamics, because 
there is no compelling way in which these various facets of 
mind can be meaningfully divorced from one another. In the 
present section, we attempt to provide a representative survey 
of the extensive evidence indicating that motivational and af- 
fective forces are indeed of central importance in understand- 
ing the dynamics of social cognition. In fact, it becomes 
impossible to think of motivation and cognition as separable 
phenomena after one develops an appreciation for the impli- 
cations of this evidence. 

As a Starting point for understanding the motivational un- 
derpinnings of social cognition, it is useful to consider the 
general categories of motives that have come under theoreti- 
cal and empirical scrutiny in social cognition research. We 
focus here on three such broad categories: epistemic motives 
(pertaining to the need to understand the social world), de- 
fensive motives (pertaining to the need to view oneself and 
one’s environment in pleasing and desirable rather than 
threatening ways), and social-adjustive motives (pertaining 
to the need for the acceptance and approval of others). 


Epistemic Motivation 


A fundamental motivation thought to underlie all of social 
cognition to various degrees is the desire to understand the 
people and events we experience in our daily lives. This mo- 
tivation undoubtedly arises from our basic desire to feel safe 
and in control of our lives. When we have a sense of under- 
standing the social world, interactions seem predictable and 
manageable. When we do not understand what is happening 
around us, we quite naturally feel disoriented and relatively 
helpless. This core need to figure things out must be balanced 
against a variety of constraints, most notably the constraints 
imposed by the information-processing limitations of our 
nervous systems (e.g., the finite capacity of working mem- 
ory). Given that we cannot engage in active processing of all 
potentially relevant evidence before forming an impression 
or making a judgment, our efforts to understand the world 
must necessarily be selective and rely to a certain extent on 
inference and supposition. Of primary importance is the sub- 
jective sense that we have a reasonably clear understanding 
of the situations we face in everyday life. 

An assumption shared by many social-cognitive theories 
is the idea that epistemic motivation varies across persons 
and situations, and this variability has important implications 
for the types of information-processing strategies that per- 
ceivers are likely to use in making judgments and choices 
(see Chaiken & Trope, 1999, for numerous relevant reviews). 
Stated simply, the assumption is that when such motivation 
is high, perceivers are likely to engage in more sustained, 


effortful, and detail-oriented analysis of the social envi- 
ronment. However, when epistemic motivation is low, per- 
ceivers are likely to rely on their immediate reactions, which 
often arise via the operation of automatic processes, the exer- 
cise of relatively simple judgmental heuristics, or both (e.g., 
Tversky & Kahneman, 1974). These fast, top-of-the-head 
reactions often seem to provide an adequate basis for under- 
standing the situation, so unless there is some particularly 
pressing reason to do so, pursuing more effortful and analytic 
strategies may seem unnecessary. 

One prototypical context within which these contingen- 
cies have been much studied is the domain of stereotyping. 
Recognizing the extent to which stereotypes can provide 
rapid, efficient appraisals of others, Lippmann (1922) argued 
that preconceptions about social groups serve a vital knowl- 
edge function for perceivers. Instead of getting to know each 
individual in terms of his or her own unique constellation of 
characteristics (which would certainly be a daunting task in 
complex, socially dense environments), we can rely to a great 
extent upon generalities that subjectively seem to be suffi- 
ciently accurate. Only in circumstances in which it is really 
imperative to know an individual with particular accuracy do 
we need to devote the extra time and energy necessary for 
going beyond a stereotypical impression. Many situational 
moderators of accuracy motivation have been investigated. 
For example, telling people that they will be held accountable 
for their judgments makes them much less reliant on stereo- 
types (e.g., Bodenhausen, Kramer, & Siisser, 1994). Being in 
a position of interdependence with (or dependency on) an- 
other person can provide an impetus to know the other person 
more accurately and can thereby also reduce reliance on sim- 
ple generalizations (e.g., Fiske & Dépret, 1996). The personal 
relevance or importance of a judgment also can raise fears of 
invalidity, motivating perceivers to invest more effort into the 
judgment-making process (e.g., Kruglanski & Freund, 1983). 
Finally, experiencing a loss of control can motivate greater 
information-processing effort in subsequent judgment con- 
texts (Pittman & D’ Agostino, 1989). Presumably, the desire 
to restore a sense of subjective control motivates careful 
attention to the details of the environment in order to provide 
a maximally accurate assessment of its contingencies. 

Epistemic motivation also varies across persons; certain 
types of individuals show a more chronic orientation toward 
relatively effortful and detailed impressions of the social 
world. Cacioppo, Petty, Feinstein, and Jarvis (1996) re- 
viewed an extensive body of research on an individual differ- 
ence variable that directly captures this tendency, the need for 
cognition. Persons high in need for cognition are likely to 
engage spontaneously in more elaborate analysis of social 
information, in part because they may simply enjoy figuring 
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things out. Whereas others may be quite content to rely on 
simple heuristics and stereotypes, individuals who possess a 
higher need for cognition are unlikely to do so, provided that 
they have the time and attentional resources available to think 
more deeply about the judgmental situation. Another disposi- 
tional quality that can motivate individuated rather than 
stereotypical impressions is found in persons who experience 
chronic loss of control (e.g., Edwards & Weary, 1993). Just as 
individuals who have experienced a situational loss of con- 
trol are motivated to repair their feelings of vulnerability by 
taking care to know the environmental setting accurately, so 
do persons experiencing more chronic problems with loss of 
control. Individuals who experience a chronic fear of invalid- 
ity for any reason are likely to have generally higher levels of 
baseline motivation for systematic or effortful information 
processing (see Kruglanski, 1996). Thus, greater accuracy 
motivation can arise either because of (positive) enjoyment 
of engaging in mental analysis or (negative) anxiety about 
making a mistaken judgment or choice. 

Apart from accuracy motivation, epistemic motivation 
also derives from the extent to which persons feel a strong 
need for closure (or its avoidance). According to Kruglanski 
and Webster (1996), the need for closure involves the desire 
for definite knowledge and the desire to avoid ambiguity and 
uncertainty. This need, which also varies across persons and 
situations, lends a sense of urgency to information processing, 
leading to a tendency to seize on initial reactions rather than 
pursue a more detailed analytical course. In addition, this need 
also invokes a desire to maintain closure. Situational varia- 
tions in need for closure can be created, for example, by ma- 
nipulating the deadline for judgments. Many investigations of 
the need for closure have examined more enduring, disposi- 
tional variations. Consequences of high need for closure have 
now been documented in many social settings. For example, 
de Dreu, Koole, and Oldersma (1999) showed that persons 
high in the need for closure were more likely use simplifying 
heuristics in a negotiation setting. Shah, Kruglanski, and 
Thompson (1998) showed that need for closure increased in- 
group favoritism and out-group derogation—both processes 
that can be attributed to the invocation of simple evaluative 
heuristics. When individuals want an answer quickly, they are 
thus unlikely to go beyond their initial reactions; to do so 
might necessitate addressing complexities and ambiguities 
that would only undermine the press for closure. 


Defensive Motivation 


The desire for predictability and control is presumably best 
served by the formation of accurate representations and 
judgments. Yet sometimes the truth hurts, and we would feel 
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better if we could avoid it. Thus, our need to know the social 
world inevitably involves a tension between accuracy and de- 
fensive motives (e.g., Taylor & Brown, 1988). Defensive mo- 
tives reflect our desire to see ourselves and our social worlds 
in desirable, positive ways and to avoid unflattering or threat- 
ening realities. This tension is reflected in research examin- 
ing the perceiver’s need to feel like a reasonable, rational 
agent. Although previously described research has suggested 
that people often apply stereotypes as a sort of default, only 
going beyond a stereotypical impression when accuracy mo- 
tivation is high and need for closure is low, there are some 
cases in which this tendency may be undermined by a differ- 
ent set of concerns. 

Sometimes individuals may be reluctant to apply stereo- 
types in their judgments of others because such stereotypes 
are considered socially undesirable or inaccurate. As re- 
viewed in the previous section, this kind of situation can mo- 
tivate effortful attempts to suppress stereotypes or otherwise 
correct for their influence on judgments. More generally, 
people may be reluctant to render judgments about others un- 
less they feel they have a defensible basis for doing so (e.g., 
Yzerbyt, Leyens, & Corneille, 1998). For example, if pre- 
sented with a male versus female target (e.g., just a picture 
and no other information) and asked to judge the person’s 
suitability for an engineering job, judges would probably be 
very reluctant to rely on sexist stereotypes. Under these cir- 
cumstances, they would very likely feel that they were not 
entitled to judge the person. However, if given a résumé to go 
along with the photo, perhaps containing evaluatively mixed 
credentials, they may then feel entitled to judge (and might 
very well rely on their sexist stereotypes under this circum- 
stance). In a different vein, some individuals typically do not 
rely on social stereotypes because they do not view persons 
(or groups) as having very stable, enduring qualities (Levy, 
Stroessner, & Dweck, 1998). For these persons, simply 
knowing a person’s group membership does not seem like a 
very informative basis for forming impressions, so they must 
satisfy their epistemic motivations by seeking out other kinds 
of data. Perceivers thus must balance their tendency to use 
simplifying generalizations with their desire to feel that they 
have a valid and reasonable basis for judging others. This 
latter desire can derive as much from defensive as from epis- 
temic motivations. 

Perhaps the most classic example of a defensive motive is 
the desire for self-enhancement. People want to think well 
of themselves and avoid confronting their own shortcomings. 
This powerful motivation has been examined in innumer- 
able psychological studies (for a review, see Pittman, 1998). 
The obvious implication for social cognition is that people are 
motivated to form self-serving impressions, and this tendency 


has been documented in many ways. To pick but one exam- 
ple, it has been found that people are more likely to activate 
and apply negative stereotypes when self-enhancement needs 
have been aroused by a recently experienced threat to self- 
esteem (Spencer, Fein, Wolfe, Fong, & Dunn, 1998). As pre- 
viously noted, one fundamental motivation for prejudice and 
stereotyping may be the fact that their application can provide 
a mechanism whereby the perceiver can feel superior to oth- 
ers (e.g., Fein & Spencer, 1997). In addition to economizing 
cognition, stereotyping thus can simultaneously gratify other 
motivational constraints. 

In addition to wanting to feel superior to others, we also 
want to feel impervious to harm and to believe that the world 
is fair and just. The phenomenon of “blaming the victim” 
(e.g., Lerner, 1998) is one important by-product of these 
profound needs. If bad things can happen to good people, this 
has disturbing implications for our senses of safety and 
justice. Consequently, we may come to view the victims of 
unfortunate circumstances as possessing qualities that pre- 
cipitated or otherwise can explain their unhappy fate. Lerner 
argues that these beliefs often operate in a primitive, implicit 
manner in shaping our impressions and blame reactions, 
rather than through a more conscious application of deduc- 
tive reasoning. Seen in this light, applying negative stereo- 
types to members of socially disadvantaged groups can be 
seen as a way of bolstering our sense that the existing system 
of social inequality is just and appropriate (see also Jost & 
Banaji, 1994). 

Perhaps the greatest threat to our sense of safety and 
invulnerability comes from the recognition of our own mor- 
tality. Research on terror management (e.g., Pyszczynski, 
Greenberg, & Solomon, 1999) suggests that we have a fun- 
damental motivation to defend ourselves against confronting 
our own eventual demise. One strategy for coping with this 
unpleasant reality lies in the creation and maintenance of 
broader worldviews that imbue life with a sense of meaning 
and purpose that extend beyond the life of the individual. In a 
series of studies, it has been shown that reminding people of 
their own mortality results in the motivation to bolster one’s 
cultural worldview. One way in which this can be accom- 
plished is by disparaging individuals who threaten or contra- 
dict one’s worldview, such as the members of other social 
groups (Schimel et al., 1999). Given the wide array of defen- 
sive motivations that are addressed by forming negative and 
hostile impressions of out-groups, the enduring manifesta- 
tions of intergroup conflict around the world may seem all the 
more intractable. 

Research of this sort shows that although accurate percep- 
tions are important to attainment of control, other powerful 
needs operate, pushing us toward perceiving the world in 


ways we want it to be (Kunda, 1990; MacCoun, 1998). Fortu- 
nately for the social perceiver, given the often-considerable 
ambiguity of social stimuli, the need to feel that one has 
accurate knowledge can often be met while simultaneously 
pursuing the need to feel good about oneself. But just what 
mechanisms are available to produce the desired self-serving 
impressions and judgments? There are many such mecha- 
nisms. First, perceivers may selectively attend to stimuli 
in ways that provide desired outcomes. In one recent demon- 
stration, Mussweiler, Gabriel, and Bodenhausen (2000) 
showed, for example, that when put in the threatening position 
of having been outperformed by another person, people tend 
to strategically focus on aspects of their own identity that 
serve to differentiate them from the upward comparison stan- 
dard. For example, a European American woman who is out- 
performed by an Asian woman may activate self-definitions in 
which her ethnicity is more salient. People generally find sim- 
ilar others to be more relevant bases for social comparison, so 
by emphasizing an aspect of her identity that differentiates her 
from a potential comparison standard, she renders that stan- 
dard less diagnostic for self-evaluation. Use of this identity 
differentiation strategy is indeed associated with greater posi- 
tive affect and enhanced situational self-esteem following an 
upward comparison. 

A particularly powerful demonstration of motivated selec- 
tivity in the use of identity dimensions was provided by 
Sinclair and Kunda (1999). In their research, they presented 
individuals with evaluative feedback that ostensibly came 
from a source that was simultaneously a member of both a 
positively stereotyped and a negatively stereotyped group. 
For example, the participants were either praised or criticized 
by an African American doctor. Having been criticized, par- 
ticipants were motivated to discredit the evaluator, and they 
tended to activate African American stereotypes while at the 
same time inhibiting doctor stereotypes. Conversely, having 
been praised, participants were motivated to imbue the eval- 
uator with credibility, so they tended to activate doctor 
stereotypes while simultaneously inhibiting African Ameri- 
can stereotypes. This research suggests an important mecha- 
nism whereby desired conclusions can be reached: By 
inhibiting stimulus dimensions that could challenge the pre- 
ferred impression, perceivers do not have to face their un- 
wanted implications. Selective attention is clearly a hallmark 
of motivated social cognition. 

Perceivers can also selectively sample from their memo- 
ries in order to reach desired conclusions. For example, 
Sanitioso, Kunda, and Fong (1990) showed that after receiv- 
ing information indicating that introverts (or, alternatively, 
extroverts) are more likely to enjoy positive academic and 
social outcomes, people selectively recalled past behaviors 
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that were consistent with the desirable characteristic. There 
are many ways in which we selectively construct autobio- 
graphical memories in order to confirm our desired 
beliefs about ourselves (e.g., Ross & Wilson, 2000). Similar 
processes may operate in our perceptions of others. That is, 
we may selectively remember the “facts” differently about 
liked versus disliked others, giving the benefit of the doubt to 
those toward whom we feel an affinity by recalling their most 
favorable moments; however, when we pause to think about 
those to whom we feel enmity, we may conjure up episodes 
when they were at their worst. Moreover, if confronted with 
an irrefutable set of facts, perceivers always have the option 
of explaining the facts in different ways. For instance, a liked 
individual (or group) will be assumed to be more responsible 
for a positive event than a disliked entity would be, whereas 
negative events may be seen as more situationally caused for 
liked (versus disliked) social entities (e.g., Pettigrew, 1979; 
Regan, Straus, & Fazio, 1974). Further, the perceived trait 
implications of a behavior can depend critically on whether 
we are motivated to think well or ill of the actor. An ambigu- 
ously aggressive behavior may be seen as disgraceful hostil- 
ity when performed by an African American, yet the same 
behavior may be seen as a playful interaction when per- 
formed by a European American (e.g., Sagar & Schofield, 
1980). Again, the inherent ambiguity of many social events 
lends itself to creative and selective interpretations and 
reconstructions. 

Perceivers can also apply differential evidentiary standards, 
depending on the desirability of the implied conclusion. Natu- 
rally, a more stringent criterion of proof is required for un- 
wanted or unpleasant conclusions compared to pleasing ones 
(Ditto & Lopez, 1992). That is to say, if an initial consideration 
of the evidence supports a desired conclusion, we may be quite 
content to stop, but if the initial implications are displeasing, 
we may sort through the evidence much more extensively and 
subject the counterevidence to our desired conclusions to par- 
ticularly harsh scrutiny. In this way, effortful reasoning can be 
engaged in the service of producing desired impressions and 
judgments. We also may estimate the likelihood of events at 
least partially in terms of their desirability. This form of wish- 
ful thinking appears to be a ubiquitous source of bias in belief- 
based reasoning (McGuire, 1960). However, the fact that our 
expectations tend to covary with our desires can also reflect 
the simultaneous operation of a mechanism whereby desires 
are constrained by reality—that is, just as we may want to 
think that desirable events are more probable, we may also 
determine what it is that we desire in part by assessing its 
attainability. 

It is thus evident that the wily social perceiver has many 
strategies for getting what he or she wants. Via selective 


276 Social Cognition 


attention, memory, and interpretation, the world can be seen 
as a flattering, safe, desirable place. These positive biases 
may provide important coping resources for us (Taylor & 
Brown, 1988). However, it is important to recognize there are 
always some reality constraints in operation when we per- 
ceive the social environment. It is only when a suitable justi- 
fication can be constructed that the perceiver is free to 
indulge in these positive illusions. Given the typical degree 
of ambiguity in social reality and the range of motivational 
strategies that are available, it may only rarely be the case 
that reality constraints are completely impervious to the dis- 
torting influence of defensive motives. 


Social-Adjustive Motivation 


The need for belonging and interpersonal acceptance is an- 
other powerful motivational force acting on social perceivers, 
as decades of research on normative social influence have 
documented (e.g., Baron, Kerr, & Miller, 1992). A major im- 
plication of this body of research is that social perceivers will 
be motivated to perceive the world in ways that win them ac- 
ceptance and approval and that make them feel like worthy 
members of their social groups. One major component of this 
tendency is simple conformity to the impressions and judg- 
ments of others. For instance, hearing information that con- 
dones or criticizes prejudice can influence the types of 
attitudes that an individual expresses (e.g., Blanchard, Lilly, & 
Vaughn, 1991). One might argue that such an effect merely re- 
flects simple compliance with clear situational demands and 
does not necessarily reflect motivated distortion of the per- 
son’s true inner judgments and impressions. However, similar 
findings have emerged even when relevant social norms are 
activated in very subtle and indirect ways, and when there is 
no audience that will be aware of whether the person con- 
formed or failed to conform to the apparent social consensus 
(Wittenbrink & Henley, 1996). 

There is also interesting evidence that belongingness 
needs can direct social attention and memory. Gardner, 
Pickett, and Brewer (2000) had participants engage in inter- 
actions in a computer chat room. The nature of the interaction 
was manipulated so that the participants would have social 
acceptance or rejection experiences. Following social rejec- 
tion, belongingness needs were expected to be activated and 
to guide subsequent information processing. After the chat 
room experience, all participants read a diary that contained 
information about both social and individual events. As ex- 
pected, in a subsequent memory task, the individuals who had 
experienced exclusion in the chat room were significantly 
more likely to remember the social information contained in 
the diary. This finding confirms the long-standing claim that 


the momentary needs and goals of the person are likely to 
play an important directive role in social cognition (e.g., 
Bruner, 1957; Jones & Thibaut, 1958; Klinger, 1975). The 
pursuit of belonging is just one of many possible goals that 
can serve this directive function, and a recent focus of empir- 
ical attention has been on the mechanisms through which 
goals guide cognition down a path toward desired outcomes 
(e.g., Bargh & Barndollar, 1996; Gollwitzer, 1990). 


Affective States 


The study of emotion is intimately tied up with the study of 
motivation. Just as the perceiver’s motives can influence the 
extent and direction of social cognition, so too do affective 
states play a regulatory role in shaping the course of social in- 
formation processing. Moods and other emotional states can 
direct memory toward affectively congruent material (e.g., 
Forgas, 1995), influence which dimensions and attributes 
of objects are salient (e.g., Niedenthal, Halberstadt, & 
Innes-Ker, 1999), and lead perceivers to interpret ambiguous 
social stimuli in a manner that is consistent with the implica- 
tions of their affect (e.g., Keltner, Ellsworth, & Edwards, 
1993). Affective states can influence the perceived likelihood 
of events (e.g., Johnson & Tversky, 1983) and can them- 
selves be used as information directing judgments when per- 
ceivers interpret their affect as being a reaction to the object 
of judgment (e.g., Clore, Schwarz, & Conway, 1994). 

In keeping with major themes of the present review, affec- 
tive and arousal states have also been hypothesized to influ- 
ence attentional capacity and epistemic motivation. Thus, 
they may play a role in determining the extent to which social 
impressions are based primarily on relatively automatic, 
immediate reactions or instead are based on more controlled, 
analytic assessments. Evidence consistent with these possi- 
bilities has emerged in many domains of social cognition, 
including the study of stereotyping. For example, several 
studies suggest that happiness is associated with a tendency to 
think less extensively about the social environment. Instead, 
happy people often appear content to rely on their generic 
knowledge about social groups rather than taking the trouble 
to engage in extensive individuation of particular group mem- 
bers (e.g., Bodenhausen, Kramer, & Siisser, 1994; Park & 
Banaji, 2000; for a review, see Bodenhausen, Mussweiler, 
Gabriel, & Moreno, 2001). Happiness may confer a sense of 
confidence in initial top-down impressions that makes effort- 
ful thought processes seem subjectively unnecessary. Fluctu- 
ations in arousal can also influence information-processes 
resources and thereby moderate the extent of reliance upon 
stereotypical generalizations. For example, Bodenhausen 
(1990) showed that stereotype-based discrimination covaried 


with circadian fluctuations in mental energy. When “early 
birds” were tested in the morning, they showed little evidence 
of reliance on stereotypes, but later in the day they were much 
more likely to render stereotypical judgments. Conversely, 
“night owls” were highly likely to make stereotypical judg- 
ments in the morning, but not in the afternoon or evening. 
These findings suggest that low levels of circadian arousal 
represent a risk factor for intergroup discrimination because 
perceivers will lack the mental resources to marshal the effort 
necessary for forming more individuated impressions. At the 
other end of the spectrum, it is also the case that excessive 
amounts of arousal can prompt greater reliance on stereo- 
types, presumably by disrupting attentional processes (e.g., 
Kim & Baron, 1988). 

One emerging trend in the literature on affect and social 
cognition is the examination and comparison of the effects of 
integral versus incidental sources of affect (e.g., Bodenhausen 
et al., 2001). Most of the previous research has focused on 
moods and other affective states that were triggered in con- 
texts unrelated to the current information-processing situa- 
tion. This incidental affect can be contrasted with feelings and 
emotions that arise in reaction to the present situation itself. 
This latter, integral affect, comes in two important varieties. 
Chronic integral affect refers to enduring feelings one has 
about the individual(s), group(s), or setting present when an 
interaction is transpiring, whereas episodic integral affect 
refers to momentary feelings triggered in a particular interac- 
tion. For example, if one has a fear of dentists, a trip to the 
dentist’s office will be imbued with chronic negative integral 
affect; however, if this particular trip happens to go very well, 
the episodic integral affect may end up being quite positive. 
The affective dynamics of social behavior are very likely to 
involve both of these kinds of integral affect, as well as the in- 
cidental affective background of moods that are brought into 
an interaction from previous unrelated events. Whereas a rich 
set of theory and data has emerged to study the incidental side 
of the picture, the role of integral affect in social cognition is 
only beginning to be explored (e.g., Moreno & Bodenhausen, 
2001; Perrott & Bodenhausen, 2002). In any case it is clear 
that the affective and motivational context of social cognition 
will continued to be explored with great vigor as researchers 
attempt to reunite thinking and doing with feeling and 
wanting. 


CONCLUSION 


There have been many debates about the appropriate defini- 
tion of social cognition—many reflect attempts to circum- 
scribe the content domain or topics that fall within its purview. 
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In our view, social cognition is not so much a topic area as a 
general perspective that can be applied to virtually any social 
psychological topic in which one is interested. In keeping with 
this perspective, we have reviewed the central conceptual 
themes of social cognition research, including the form and 
nature of mental representations, the automatic and effortful 
use of such representations, and the ways in which these 
processes are modulated by the motivational and affective 
context within which they occur. Although we produced ex- 
amples of the use of these general principles from a limited 
number of topic areas (often focusing on stereotyping as a pro- 
totypical example), they could be (and have been) applied in a 
host of content domains, including group decision making, 
interpersonal conflict, relationship development, social influ- 
ence, political judgment, marketing and consumer behavior, 
academic and athletic performance, and countless others. The 
fruitfulness of these various applications shows that much ex- 
planatory power can be gained when psychologists explore 
the workings of the so-called black box, using objectively ob- 
servable aspects of task performance to derive and test infer- 
ences about how the mind goes about its business. Inevitably, 
much of that business is social in nature. The business of 
studying social cognition is to unravel the mysteries of our 
socially embedded minds. 
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In everyday conversation, Spaniards occasionally describe 
someone as being emocionado(a). To be emocionado means 
to be emotional, but this translation is misleadingly simple. 
Whereas English speakers use the phrases to be emotional 
and to have an emotion largely interchangeably, Spaniards 
make a clear distinction between estar emocionado and sen- 
tir una emociön. Emocionado is perhaps better rendered into 
American English metaphorically as “to be touched” or “to 
be moved” (as a psychological state); emocionado can be 
used in either positive or negative contexts. Spaniards recog- 
nize different expressive behaviors for emocionado and 
emociön, even when both occur in a positive context. For 
example, a Spanish journalist described two medal winners 
on an Olympic podium, one smiling and the other crying. 
The journalist described the smiling woman as alegre 
(joyful) and the crying woman as emocionada (Fernandez- 
Dols & Ruiz-Belda, 1995). Emocionado is an emotional state 
distinct from specific emotions such as anger or joy. In fact, 
as early as 1921, Gregorio Marañón, a Spanish doctor, 
pointed to Spaniards’ use of emocionado as a recognition 
of the nonspecific nature of visceral changes in emotion 
(Ferrandiz, 1984). If emocionado denotes an emotional state 
not recognized clearly in English, Spanish may segment 
emotional experience in a subtler way than does English. 
Ethnographers’ and historians’ descriptions of remote or 
past cultures reveal many more examples of different ways of 
talking about emotion. For example, Tahitians lack the word 
sadness entirely (Levy, 1973). Even where a similar word 
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exists, it may cover different experiences—just as the Eng- 
lish word sadness has covered different experiences during 
different historical periods (Barr-Zisowitz, 2000). 

Another observation from the ethnographic record is vast 
differences even when an emotion word appears the same. 
Consider two societies, both of whom have words easily 
translatable as anger. In Never in Anger, Briggs (1970) de- 
scribes an Utku family in the Canadian Arctic; the Utku smile 
and laugh off situations that would make most of us angry. 
They endure with patience and humor situations that would 
drive us to fury. The clearest case of an Utku’s anger recorded 
by Briggs was particularly telling. A group of visiting 
Kapluna (White) sports fishermen borrowed a canoe and 
damaged it. It was one of only two canoes the Utku band pos- 
sessed. The fishermen later asked to borrow the other canoe. 
Damage to this second canoe would endanger the future 
livelihood of Briggs’s Utku family. Briggs was the interpreter, 
and she refused the fisherman’s request, becoming overtly 
angry with them. The Utku elder for whom she was translat- 
ing did not react with anger toward the fishermen, who were 
to be shown indulgence and forgiven, as a child would be. But 
he did react with ningag to Briggs. He found her angry out- 
burst so inappropriate that she was ostracized for several 
months. The Utku never see anger (ningag) as justified. The 
Utku believe that angry feelings, by themselves, with no 
mediation, can harm others or even kill them. For an Utku, to 
experience ningaq is to experience oneself as unjustifi- 
ably harboring murderous feelings—this in a society in which 
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kindness and tolerance are expected of all adults and are even 
considered to define what it means to be a mature functioning 
human (Kaplunas—White people—are suspected of being 
descended from dogs and to have the minds of children). 

Our second case comes from the Ilongot, a group indige- 
nous to the Philippines and studied by Michelle Rosaldo 
(1980). Their word commonly translated as anger is liget. A 
young man is restless, frustrated, mulling over past insults. 
He is envious of the privileges of his elders and the successes 
of his peers. His liget mounts and weighs down on him. 
There will likely be other such young men, competing, envi- 
ous of one another, and heavy with frustration and boredom. 
Led by an elder man, a small group of such youths conducts 
a raiding party, sometimes against a known enemy, but more 
often against a random victim (man, woman, or child). Dur- 
ing the raid, the young men beat their heads to increase their 
liget. The liget mounts and weighs on them. It is felt as 
heavy and oppressive, an unrelieved yearning. Finally, the 
victim is selected and killed (it does not particularly matter 
by whom). The young men rush at the victim, slashing and 
mutilating. The victim’s head is severed, and one youth 
throws the head in the air. Now the liget is felt as “burning 
joy.” The young men feel lightened, freed from the heavy 
burden. Filled with liger, the party is intent on mutilation. 
They slash, mutilate, and toss heads. They then raid the 
home of the victim, destroying property. As they return to 
their homes, to keep their liget hot, they drink hot drinks. 
They return in triumph as men. 


QUESTIONS ARISE FROM 
CULTURAL DIFFERENCES 


Whatever the validity of these specific observations, these 
and many more like them have been important in the psychol- 
ogy of emotion. For one, reactions to such observations reveal 
diverse assumptions. One reaction to claims of this sort has 
been to conclude that emotions among Spaniards, Tahitians, 
Tlongot, and Utku are different from emotions in English- 
speaking societies. Another reaction has been to conclude that 
perhaps each language has a different way of describing emo- 
tions (but that does not mean that the emotions themselves are 
different). Another reaction has been to conclude that emo- 
tional experience is culture-specific but that emotions are not. 
Yet another reaction is to dismiss much of the ethnographic 
and historical evidence as concerning mere talk—nothing 
important for those who would study emotion. 

When confronted with claims of cultural and historical 
differences in the concept of sadness or with anecdotes of 
cultural variability of anger, psychologists must face difficult 


questions: Is anger of the Utku the same emotion as anger in 
the Ilongot? How could that sameness be empirically tested? 
What, if anything, is the anger behind the manifest differences? 
Is anger universal? Or could it be a cultural artifact? Answers 
to such questions follow predictable theoretical positions on 
fundamental issues such as what is real and what is not, what 
are legitimate topics in science, and what is the relationship 
between mind and body and between language and reality. 

The issues raised by reports of cultural differences have 
not been settled by available evidence. There is no consensus 
on such matters. Or even on how they could be settled. In- 
stead, different researchers assume different positions, based 
on deeply held philosophical assumptions. These often- 
unvoiced assumptions then guide scientific theorizing, divid- 
ing the field into camps that each pursue different goals with 
different methods. For example, an approach to emotions as 
universal natural entities independent of language and culture 
stems from (or resonates with) a basic philosophical position 
that could be labeled as ontological realism. In contrast, an 
approach to emotions as cultural products created through 
language stems from (or resonates with) a philosophical po- 
sition that could be labeled as nominalism. Psychological the- 
orists may not endorse all the traditional consequences of 
their philosophical assumptions, and most theorists introduce 
ways to accommodate data grounded on other assumptions. 
Nevertheless, exposure of these philosophical assumptions 
can help us understand some of the sources of strength and 
weakness in current research on emotion. 

In this chapter, we first outline four philosophical posi- 
tions (in necessarily overly simplified and stark terms) that 
seem to underlie different research programs on emotion and 
that center on the issue of the relation between language and 
reality. (Does language accurately describe reality? Influence 
reality? Constitute reality? Or does everyday language con- 
ceal and obscure reality?) We then explore one way that these 
research programs might be integrated. 


Ontological Realism 


The ontological realist assumes that words such as anger and 
sadness are simply labels for preexisting entities. Emotions 
are like rivers or lakes or other things in the natural world. 
They are self-contained and distinct from any other thing. 
They have a concrete localization inside human beings and 
other animals (“inside” nowadays often means “in the brain”). 
With scientific effort, emotions will be isolated, localized, 
measured, and manipulated. From this point of view, it would 
not be surprising if each specific emotion were discovered to 
correspond to a single neural center, neural circuit, peptide, or 
some other specific physical entity. 


Emotions existed long before culture or language. At best, 
language can provide labels for different emotions. Of course, 
different languages provide different labels, but these point to 
the same preexisting reality: Just as luna and moon are differ- 
ent labels for the same entity, so are anger, ningagq, and liget 
just different labels for the same entity. Language is useful 
only in providing labels, and most talk about emotion is of lit- 
tle interest to the scientist and can often obscure or conceal 
reality behind the words, as in romantic or metaphoric talk 
about the moon. 


Nominalism 


Nominalism is thought by some to have started with the me- 
dieval philosopher Ockham, who broke with many of the 
philosophical assumptions of his contemporaries. Ockham 
taught that there exist only individual events and things (such 
as Briggs’s reaction that day to the Kapluna fishermen). Indi- 
vidual events or things (even those called by the same name) 
do not share with each other some Platonic essence. Names 
for general classes of events or things (e.g., emotion or anger) 
are therefore misleading. Sometimes some events look simi- 
lar enough for an observer to group them together and give 
them a common name—hence nominalism. Through lan- 
guage, people can name general groups of these individual 
events and talk about the type in general. Nevertheless, such 
groupings are always arbitrary, in the sense that the only 
thing real is the individual. A nominalist position is thus 
skeptical about any claims about reality outside individual 
events and words themselves. 

In a modern version of nominalism, the emphasis is on the 
role of words. Words differ from one society to the next or 
one historical era to the next, and that is the reality to be ana- 
lyzed. As words, anger, liget, and ningaq are important in 
their own right, rather than as labels for a common entity 
(Harre, 1986). An extreme version of this approach asserts 
that these words lack any denotation. Instead, they are simply 
cultural practices (e.g., Lutz, 1988). Another version would 
be the belief that there do exist individual events, but these 
individual events take on the meaning of anger, liget, or 
ningag only by being labeled. For example, one approach to 
emotion words is to study them only as part of discourse and 
focus on pragmatics of their use. (What is the consequence, 
in Utku society, of accusing someone of being ningaq?) 
Emotion words, as part of discourse, create an object (the 
emotion) that exists only in the context of the speaker’s 
construed social reality: Words create a cultural, idiosyn- 
cratic illusion that is the emotion itself. Anger liget, and 
ningag are therefore not comparable and cannot be under- 
stood outside the culture in which they fulfill an important 
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role in the regulation of everyday interaction. From this 
point of view, much of the psychology of emotion is the im- 
position of a Western construction on other cultures, which 
ignores the implicit symbolic structure that gives shape and 
meaning to each potential candidate for the label emotion in 
that culture (Shweder & Haidt, 2000). 


Conceptualism 


A conceptualist position shares with ontological realism its 
assumption that emotion words refer to a nonlinguistic real- 
ity, its interest in that reality rather than in words, and its skep- 
ticism about the ability of language to reveal that reality. The 
conceptualist, however, takes such words as anger and liget 
as concepts rather than as labels for entities. There are many 
ways to construe reality. Thus, any inference to emotions as 
independent, real entities, while possible, is suspect. The na- 
ture of the reality so conceptualized is the focal question. For 
example, one might hold that when people use an emotion 
word, they are pointing to a physiological, behavioral, or sit- 
uational event—something observable—and not to any emo- 
tional entity. The scientist’s job is to search for an objective 
account of the actual processes commonly conceptualized as 
anger, liget, or ningaq. Behaviorist, functionalist, and situa- 
tionist approaches to emotion arise from this philosophical 
background. 


Formalism 


The formal approach treats emotion words as formal objects, 
much like numbers or logical operators. As in the nominalist 
approach, the focus is again on language, although in this 
case it is on the semantics rather than pragmatics of language. 
Emotion and emociön are first and foremost words. What are 
the necessary and sufficient features for emotion and emo- 
ción? Or for anger, liget, or ningaq? These terms may have 
both common and distinguishing features, which would re- 
veal universal and language-specific aspects of these words, 
respectively. Rather than simply assume that anger = liget = 
ningag, the researcher seeks to provide a formal analysis of 
each word. Words are linguistic phenomena, parts of a partic- 
ular language. Each specific language is a cultural product, 
but language in general has universal aspects. 


DIFFERENT RESEARCH PROGRAMS 
ON EMOTION 


The study of emotion is guided by deep assumptions that res- 
onate with old philosophical debates. The result is that differ- 
ent and apparently incompatible research programs have 
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arisen that provide different frameworks for research and ap- 
plications in the field. From one program to the next, there is 
no agreement about the meaning and scientific usefulness of 
words such as anger, sadness, and the like or even of emotion 
itself. In this section we describe several of these programs. 
Although we emphasize the philosophical assumptions 
guiding each program, we do not imply that individual theo- 
rists endorse these philosophies explicitly, consistently, or 
exclusively. 


Emotions as Entities 
The Facial Expression Program 


Ontological realism comes close to the philosophical assump- 
tions of the person in the street and remains the dominant po- 
sition in the psychology of emotion. (See Lillard, 1998, for a 
discussion of the ontological realist assumptions of the con- 
cept of mind.) Emotions are natural entities. By “natural,” we 
mean that emotions are now viewed as biological products of 
evolution. By “entity,” we mean (a) that an emotion could, at 
least in principle, be isolated from its surrounding context 
(i.e., from its eliciting stimulus and behavioral and physio- 
logical consequences) and still be the emotion that it is and 
(b) that an emotion has causal powers (fear causes flight and 
love makes one care for the loved one). Thus, in the days of 
faculty psychology, emotion was a faculty. 

The ontological position can be seen in much of the re- 
search conducted on emotion, but its major theoretical repre- 
sentatives today were inspired by Silvan Tomkins (1962, 
1963). Tomkins was a psychiatrist with a vast range of inter- 
ests and a formidable intellectual curiosity. Tomkins’s influ- 
ence on two creative, enthusiastic scientists, Carroll Izard 
and Paul Ekman, was a powerful tool in spreading his ideas. 
Together, they created the Facial Expression Program (FEP; 
Russell & Fernändez-Dols, 1997), arguably the most influen- 
tial network of assumptions, theories, and methods in the 
psychology of emotion. The FEP combined ontological as- 
sumptions about emotion with modern scientific concerns 
about the evolutionary origins, neural mechanisms, and pre- 
cise physiological correlates of emotion. 

In this framework, the kinds of cultural differences with 
which we began are acknowledged. Ekman (1972) named 
his own theory neurocultural. Culture influences the observ- 
able elements surrounding emotion, but not the unobservable 
emotion itself. Members of different societies learn to have 
different emotions in given situations: A food that produces 
pleasure in one society can produce disgust in another. And 
society regulates (through display rules) the observable man- 
ifestations of each emotion: A society might believe that boys 


should show a brave face even when sad or frightened. These 
cultural differences are not taken to challenge the reality or 
universality of the emotions themselves. 

Izard and Ekman traced their intellectual roots to Charles 
Darwin. Darwin’s Expression of the Emotions in Man and 
Animals was an extended argument for human evolution and 
against the then-popular belief that most muscles of the 
human face were God’s creations designed exquisitely for the 
expression of emotion (Montgomery, 1985). Darwin’s strat- 
egy was to show that expressions are not simply expressions 
at all but vestiges of formerly instrumental actions. (A facial 
expression of anger with bared teeth does not simply express 
anger but is a genetically transmitted habit of baring the teeth 
when preparing to bite.) Emotions and movements (expres- 
sions) were described according to the everyday categoriza- 
tion of nineteenth-century English society. Darwin was a 
great empirical scientist, but his views on emotion were com- 
monsense assumptions in the tradition of academic treatises 
of the seventeenth and eighteenth centuries. His three princi- 
ples of expression do not mention emotion, and his book is 
focused mostly on the physiology of expression. 

In the hands of Izard and Ekman the emphasis shifted 
back to facial movements as genuine expressions and even 
more to the emotions expressed. Darwin’s research became 
the search for universal entities (now called basic emotions) 
behind human faces. His findings of similar movements 
across cultures, ages, and species became a finding of similar 
emotions across cultures, ages, and species. Ekman and Izard 
transformed Darwin’s vague and open-ended list of emotions 
(e.g., meditation, hunger, determination, love, low spirits, 
despair) into a closed list of basic emotions. Ekman (1972) 
included happiness, fear, sadness, anger, surprise, and dis- 
gust and more recently added contempt (Ekman & Friesen, 
1986) and shame (Keltner, 1995). Basic emotions are 
prepackaged neural programs that can be detected in all 
human beings as well as in other species. Other emotions, 
such as love, jealousy, shame, emocionado, liget, or ningaq 
are blends, mixtures, subcategories, or synonyms of the basic 
emotions. 

Although different theorists have proposed somewhat dif- 
ferent theories, a list of the prototypical principles of the FEP 
would include the following: 


1. There is a closed (although revisable) list of basic 
emotions. 


2. Basic emotions are discrete entities. 
3. Basic emotions are genetically determined and universal. 


4. Each basic emotion produces a coherent and unique pat- 
tern of facial and vocal signals, conscious experience, 
instrumental action, and physiological changes. 


5. All emotions other than the basic ones are subcategories 
or mixtures of the basic emotions. 


6. Signals for basic emotions are recognized by any normal 
human being. 


7. Voluntary facial expressions are deceptive and culturally 
determined. 


The FEP stimulated the gathering of a vast quantity of 
data, much of it aimed at establishing one fact: Across a range 
of ages and cultures, human beings can attribute the same 
emotions to select facial configurations. But this fact (assum- 
ing it is a fact; cf. Russell, 1994) would establish only one of 
the basic principles of the FEP. For example, research has not 
yet shown that an allegedly universally recognized facial ex- 
pression is a manifestation in all human societies of the very 
emotion recognized. Indeed, available data make this as- 
sumption doubtful (see Camras; 2000; Fernändez-Dols & 
Ruiz-Belda, 1997; Fridlund, 1994). 

The FEP has generated much valuable data on how people 
associate emotion names with facial expressions and on 
physiological or vocal patterns of those said to have those 
emotions. Curiously, no data have been gathered to establish 
the existence of anger, fear, and other basic emotions beyond 
the facial configurations, vocal and physiological patterns, 
and so on from which the emotion is inferred—that is, be- 
yond the emotion’s observable manifestations. Instead, the 
emphasis has been on the importance (in Darwinian terms, 
the adaptiveness) of emotion. This approach is reminiscent of 
another argument in the ontological tradition: Anselm’s onto- 
logical argument for the existence of God, by which the 
meaning of the word God implies the necessity of his exis- 
tence. By definition, God is perfect, but nonexistence would 
be an imperfection and therefore a contradiction. Emotions 
are important (adaptive), but a nonexistent entity could not be 
important. 


Appraisal Theories 


The clearest candidate for the research program that is re- 
placing FEP as dominant in the psychology of emotion today 
is known as appraisal theory. Appraisal theory (see Frijda, 
1986; Lazarus, 1991; Scherer, Schorr, & Johnstone, 2001) 
shares with FEP the assumption that emotions are adaptive 
entities that have evolved to respond quickly to recurring 
important circumstances. Appraisal theories can be thought 
of as a development of FEP in which emphasis is put on a 
cognitive step between those circumstances and the emotion 
(event — appraisal — emotion). (Some versions of appraisal 
theory assume that appraisals are a part of the emotion). An 
appraisal provides an explanation for (a) which situations 
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elicit which emotions and (b) individual differences in the 
stimulus-response link. (For example, if you appraise dogs as 
threats, then, for you, dog — threat — fear; but if you like 
dogs, then dog — good — happy.) The main question then 
becomes the nature of appraisal. In the earliest versions, ap- 
praisal was a simple evaluation (Arnold, 1960). In later ver- 
sions, appraisals became increasingly complex and took into 
account a person’s plans, beliefs, desires, values, and so on 
(Lazarus, 1991). 

One appraisal theorist, Smith (Smith & Kirby, 2001), noted 
the ontological assumptions of appraisal theories. These theo- 
ries generally share with FEP assumptions about emotions as 
entities, but they also began by assuming that appraisals are 
also entities (which are capable of producing emotions). This 
ontological predisposition can be seen in the primary method 
used: If a subject could label an appraisal with a word such as 
threat or good, then these specific appraisals were assumed to 
exist and to trigger the emotion. Smith and Kirby called for 
more circumspect inferences from such methods and for the 
use of methods that focus on the actual processes that consti- 
tute appraisals. 


Concluding Comment 


The ontological approach has been an enormous success— 
indeed, sometimes a victim of its own successes. For exam- 
ple, data generated by the FEP unveiled extraordinary 
complexity within “basic” emotions. Facial, vocal, and 
instrumental behavior, as well as cognitive appraisal, subjec- 
tive experience, and physiological changes all show much 
more variability within each emotion than anticipated 
(Ortony & Turner, 1990; Smith & Scott, 1997). Further, these 
separate components do not correlate with each other as 
highly as anticipated (Lang, 1994). As a second example, the 
ontological approach has relied heavily on human judgment 
studies (e.g., in the studies on recognition of emotion from 
faces or in questionnaire studies on appraisal). This method 
did not yield the simple patterns anticipated, but it did pave 
the way for the study of a completely different topic: the cog- 
nitive representation of emotion. 


Emotion as Discourse 


A very different reaction to observations about cultural dif- 
ferences comes from a loosely related group known as social 
constructionists (e.g., Averill, 1982; Harré, 1986; Kemper, 
1978; Lutz, 1988; Parkinson, 1995). Social constructionist 
ideas show the influence of nominalism and focus on dis- 
course about emotion. They also emphasize cultural differ- 
ences in the observable antecedents and consequences of 
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emotions—but then these differences were acknowledged by 
such ontological realists as Ekman (1972). In contrast, the 
social constructionist takes the role of culture to be deeper, 
extending to emotion itself. 

Return to the Ilongot’s liget and Utku’s ningag. A nomi- 
nalist would argue that instances of liget and ningaq are 
merely similar enough for an observer to give them a com- 
mon name (anger). The word anger admits the head-hunting 
Tlongot youth and the Utku elder who ostracized Briggs. 
But this judgment is in the eye of the beholder—in this case, 
an outsider’s third-person point of view. There is no entity 
shared by the Ilongot youth and the Utku elder. Further, 
there is no entity shared by different examples of liget within 
Ilongot society, or shared by different examples of ningaq 
within Utku society, or shared by different examples of anger 
within an English-speaking society. Nothing, that is, except 
the label. 

It is clear that the causes and consequences of liget differ 
from those of ningaq. For the social constructionist, there is a 
difference as well in the conscious subjective experiences of 
the Ilongot youth’s liget and of the Utku elder’s ningaq. The 
two experiences are similar in some ways, but they differ in 
other ways and do not share any essence. Although the Utku 
elder might share with the Ilongot youth some of the same 
raw ingredients (and this remains to be demonstrated), he ex- 
periences ningag rather than liget or anger. To experience 
ningag is to experience something that human beings should 
not experience. In contrast, to experience liget is to experi- 
ence the most important force in life, something vital to life. 
For an Ilongot, to feel liger and to head-hunt as a result are the 
most natural thing. 

An analogy may make the nominalist position clearer. Con- 
sider a baby nursing, a Jew celebrating a seder at Passover, and 
a gourmand savoring a meal at Maxim’s in Paris. The word 
eating admits all three experiences, yet their experiences are 
quite different. They might have some of the same lip move- 
ments, physiological processes, and raw sensations. However, 
the meaning given to the behaviors, physiological changes, 
and sensations would be different. Experience is a complex 
web of associations that draws on expectations, history, norms 
of what is proper, and so on. Suppose that we give the Utku 
elder and the Ilongot youth a meal at Maxim’s. The expe- 
riences would be different again. Imagine the Utku elder being 
engaged in Ilongot head-hunting. He would likely experience 
this state as abnormal and unnatural. In contrast, the Ilongot 
youth experiences himself in line with his ancestors as doing 
something completely natural, almost inevitable. 

Some social constructionists view terms such as liget, 
ningaq, and anger are names for interpretative schemas or 
scripts (Shweder, 1994). Emotional experience is based on a 


narrative constructed with the help of this cultural script, 
which gives meaning to the experience. By sharing a script, 
members of a society create similar narratives. Sometimes 
narratives in different societies are similar enough to an out- 
side observer that they can all be called by the same name. 
People form widely applicable concepts and talk about them in 
general. From this view, emotional experiences are cultural 
products. To be sure, physiological changes, facial move- 
ments, and actions are also real and might even be universal, 
but these tend to be viewed as raw ingredients, devoid of in- 
herent meaning. 

The nominalist perspective is easy to apply to the experi- 
ences of those most foreign to us, but it applies equally to our 
own emotions. In his study of road rage in Los Angeles, Katz 
(1999) emphasized that road rage fits a highly regular narra- 
tive that shapes the driver’s experience in a characteristic 
way (although to an external observer road rage can be as 
mysterious and frightening as the Ilongots’ liget). When cut 
off by another driver, the driver becomes morally outraged, 
insults the other driver (even though the other driver cannot 
hear the insult), makes obscene gestures, and feels the need to 
retaliate in order to teach the offender a lesson (sometimes 
thereby increasing the danger). 


Emotion as Process 


A line of thinking about emotion that resembles a conceptu- 
alist philosophical stance began with William James. James 
wrote disparagingly of thinking of emotions as entities or of 
giving credence to distinctions embedded in everyday lan- 
guage (such as anger vs. irritation vs. annoyance, etc.). His 
view opened the door to asking about the actual process that 
occurs when an emotion is said to occur. He suggested that 
the actual process is quite different from what is suggested by 
common sense. James famously argued that bodily changes 
(e.g., crying, running) produce rather than follow the experi- 
ence of emotion. 

Marapon (1924, 1950) tested James’s hypothesis about the 
role of bodily changes in the experience of emotion. Marañon 
injected epinephrine (adrenalin) into 210 hospital patients. He 
observed two different results. Some (29%) of the patients re- 
ported a strong “genuine” emotion, but most (71%) reported 
an “as-if’ emotion. That is, they felt as if they were having an 
emotion but denied having any real emotion. Marafion con- 
cluded that James’s hypothesis was not confirmed. Instead, 
Marañon suggested that different reactions to the same epi- 
nephrine-induced bodily changes were related to a patient’s 
specific medical condition, such as hyperthyroidism. 

Cantril and Hunt (1932) challenged Marafon’s interpreta- 
tion by replicating his study with 22 students and professors 


without medical problems. They found a similar split between 
reports of genuine and as-if emotions. Cantril and Hunt 
pointed not to medical conditions, but to the unique situational 
circumstances of each subject. In cases of genuine emotion, 
the subject’s current situation bore a “logical relationship” 
to the emotion reported; in cases of as-if-emotion, no such 
situation was present. Landis and Hunt (1934) also replicated 
Marafion’s experiment, this time with psychiatric patients, 
and obtained similar results. Landis and Hunt therefore con- 
cluded that emotion was influenced by “environmental” fac- 
tors and “higher intellectual and perceptual functions.” 

Cantril (1934) placed subjects in four successive negative 
situations (e.g., watching photographs of mutilated war 
victims or hearing sudden loud noises as the lights were unex- 
pectedly turned off). Subjects went through the four situations 
in different orders. Each was injected four times, getting a 
placebo for the first three trials and epinephrine for the fourth. 
In comparison with the placebo, epinephrine increased the 
subjects’ ratings of their emotional reactions in fear situations 
but decreased their rated emotional reaction in disgust situa- 
tions. Cantril suggested that “the awareness of some object or 
situation around which the emotion is intellectually organized 
is the immediate cause for the emotional experiences” 
(p. 578), and that “the quality of an emotion is primarily de- 
pendent upon the attitude aroused in the [subject] by the stim- 
ulus” (p. 579). In this way, Cantril, Hunt, and Landis moved 
away from a view in which emotion is an entity triggered by a 
stimulus and defined by bodily changes, as assumed in early 
ontological theories. They moved toward a view in which 
emotion depends not just causally but logically on a complex 
situation intellectually organized in the context of bodily 
arousal. 

An interesting development of this view was Nina Bull’s 
(1951) attitude theory of emotion. Bull begins with a simple 
framework: Situations elicit actions. Action consists of two 
successive stages: (a) a preparatory phase and (b) a consum- 
matory movement (e.g., fight or flight). The first stage is a 
motor attitude or action readiness and includes involuntary 
changes in posture and in various organs. This phase has both 
a direct and an indirect consequence: The direct consequence 
is the particular action (the second stage) for which the first 
phase prepares. The indirect consequence is a feeling. It is 
this feeling that is usually known as emotion. The feeling of 
an emotion is thus an epiphenomenon of the sequence in 
which a motor attitude becomes action. Although Bull shares 
with ontological thinkers the attempt to identify the one 
event that is the emotion, she also moved in the direction of 
thinking of emotion as something that can be understood only 
in terms of a process that necessarily includes both situation 
and action. Indeed, unless the feeling of the emotion is 
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equated with the emotion itself, there is no emotion per se 
within Bull’s theory of emotion. 

Schachter (1964) further articulated and developed 
this perspective in his two-factor theory of emotion. 
Schachter’s specific ideas were an application of Lewinian 
principles (Ross & Nisbett, 1991): Emotion is the result of a 
tension between environmental constraints and cognitive 
construals. The environmental constraints were both situa- 
tional (mainly others’ behaviors) and internal (nonspecific 
arousal). The cognitive construals were originally cognitive 
labels but shortly afterward became attributions (Nisbett & 
Schachter, 1966). Schachter’s theory of emotion dominated 
the study of emotion in social psychology for several decades. 
The combination of nonspecific arousal and (mis)attribution 
inspired important theoretical models of aggression (e.g., 
Zillmann & Bryant, 1974), helping (e.g., Piliavin, Piliavin, & 
Rodin, 1975), interpersonal attraction (e.g., Dutton & Aron, 
1974), environmental behavior (e.g., Anderson & Anderson, 
1984), and attitude change (e.g., Zanna & Cooper, 1974). 
Mandler (1984) developed a related theory of emotion that 
dominated the study of emotion in cognitive psychology 
around the same time. 

Ginsburg and Harrington (1996) recently proposed an ac- 
count of emotion along more purely conceptualist lines. The 
concept of emotion refers to an action in a context. The con- 
text has two structural features. The first is hierarchical: a 
broad system of events and social relationships that are nec- 
essary to give meaning to the action. The second feature of the 
context is linear; that is, it includes a sequence of actions un- 
folding over time. Prior actions lead to (or, in Lewinian terms, 
create a channel for) subsequent actions. Actions also alter 
bodily state and felt experience. The entire sequence of ac- 
tions in context with its accompanying bodily and mental 
state is construed (conceptualized) as emotion. In this way, 
there is no emotion in addition to the action in context. No sin- 
gle event within the sequence can be equated with emotion. 

Ginsburg and Harrington described the proper study of 
emotion as descriptive. They suggested creating natural 
histories of specific emotional episodes. In turn, these emo- 
tional episodes are to be understood as a subsystem of larger 
and more complex systems. The search for universal entities 
is abandoned in favor of an exhaustive description of such 
systems relevant for a particular culture. 


Formal Definitions for Emotion Terms 


Much philosophical work on emotion has been aimed at a 
formal analysis of emotion terms. Solomon’s (1976) inspired 
analysis was a precursor to appraisal theories. Wierzbicka’s 
(1992, 1999) linguistic analysis provides a formal framework 
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for any word in any language. She developed a contemporary 
version of an ancient philosophical dream: the creation of a 
universal language based on fundamental concepts indis- 
pensable for thought. Wierzbicka developed a list of univer- 
sal semantic primitives (J, you, someone, something, know, 
good, bad, maybe, feel, etc.). These, together with a mini- 
grammar specifying the rules of their combination, constitute 
a universal language. This universal language can then be 
used to analyze any emotion word in any language. The in- 
teresting result of her analyses so far is that emotion words 
(anger, liget, ningaq) and emotion itself have all been found 
to be culture-specific but, nevertheless, definable in terms of 
her universal semantic primitives, especially feel, good, and 
bad. Even words with a similar etymology, such as emotion, 
the Italian emozione, and the Spanish emociön, have not been 
found to be equivalent. 

Psychologists have also offered formal analyses of the 
emotion lexicon, although they are limited to the English lan- 
guage. Oatley and Johnson-Laird (1990) defined emotion as a 
disjunctive set of five semantic primitives: happy, sad, fear, 
anger, and disgust. Any emotion term is then defined by ref- 
erence to one or more of these five. This approach clearly 
also has ties to the position of ontological realism. Ortony, 
Clore, and Collins (1988), in contrast, define all English emo- 
tion terms as referring to a valenced reaction to an event. Dif- 
ferences between terms are defined by cognitive differences 
in interpreting that event (along the lines of appraisal theo- 
ries). Their approach has ties to a conceptualist perspective. 

An alternative formal analysis began with Wittgenstein 
and entered psychology largely through the work of Rosch 
(e.g., Rosch, 1978, 1987). This analysis is skeptical of the 
classical search for necessary and sufficient features to define 
such everyday words as emotion or anger. Various nonclassi- 
cal alternatives have been proposed (e.g., Fehr & Russell, 
1984), but all share the idea that membership in the category 
labeled by a word is a matter of degree and that the border be- 
tween members and nonmembers is fuzzy. 


TOWARD INTEGRATION 


No one research program has been able to achieve consensus. 
The persistence of competing and possibly incommensurate 
programs is frustrating, but at the same time fascinating and 
potentially useful. Differences force us to question assump- 
tions and to notice ignored questions. Competing approaches 
thus create the grounds for a qualitative shift in our under- 
standing of emotion. This shift might take the form of an in- 
tegration of two or more of the various paradigms or even of 
a revolutionary change in our understanding of emotion. In 


this chapter we offer neither a revolutionary theory nor even 
a complete integration of available paradigms, but we do 
offer the beginnings of one possible integration. We describe 
a new descriptive framework deliberately built on all of these 
paradigms. 

So far, we have perhaps overemphasized the limitations of 
these various paradigms. Their longevity indicates that each 
addresses some aspect of the topic. All of them have made sub- 
stantial contributions to the understanding of emotion. Indeed, 
all of them are necessary for raising—if not answering— 
essential questions about those very important events that 
are labeled emotion. 

We also believe that all can be integrated within a com- 
mon framework. What has prevented integration in the past is 
the assumption that each of these research programs is deal- 
ing with the same thing, namely, emotion. If the word emo- 
tion denoted a homogeneous, well-defined set of events, then 
different theories of emotion would, indeed, be in direct con- 
flict with each other over the same territory. Scientific analy- 
sis would long ago have settled major disputes. If, instead, 
emotion is a heterogeneous, poorly defined mix of qualita- 
tively different events (originally grouped together by our 
hunter-gatherer ancestors and modified with each era to suit 
cultural concerns), then different theories could be about dif- 
ferent topics within that loose domain. Selected evidence 
could easily find support for each such theory. 

We therefore begin by abandoning emotion as a scientific 
term. It remains here only as an everyday term and as a fig- 
urehead, a convenient symbol for the general domain of 
study, but it is not allowed to set the boundary for the set of 
events that any theory in this domain must explain. In fact, 
our proposed integrative framework extends beyond the tra- 
ditional boundaries of emotion by including such states as 
fatigue, drowsiness, and calm. It is especially important to 
underscore that abandoning emotion as a scientific term does 
not mean abandoning the study of those very real and very 
important events now called emotion. 

Abandoning emotion as a scientific term allows us to bor- 
row from each of the established research programs on emo- 
tion (as diagrammed in Figure 12.1). Programs based on an 
ontological realist position embody the traditional scientific 
search for basic entities that underlie all the varied manifest 
differences in a domain. They rightly emphasize empirical 
examination of physiological and behavioral details. Pro- 
grams based on a nominalist position emphasize the unique- 
ness and complexity of each emotion event and of emotional 
experience. They also emphasize the role of meaning systems 
shared by members of a culture. Unique events are under- 
stood (both by a scientist and a nonscientist) through the 
mediation of concepts (which are mental processes that 


Ontological: 
Pre-emotional primitives 
that are not necessarily 
“emotion.” 


Conceptualist: 

The pre-emotional primitive 
is manifested in events. The 
experience of these events 
gives rise to categories such 
as “emotion.” 


Nominalist: 

Events are unique and 
constitute the ground of 
everyday knowledge about 
emotion. 


Formal: 
People’s talk about emotions 
can be deconstructed into 


bc) 
concepts that allude to 


emotional primitives. 


Figure 12.1 Four research programs on emotion. 


group or order unique events). Programs based on a concep- 
tualist stance hold that nonscientists and scientists alike hold 
conceptualizations of reality. The history of science teaches 
that common-sense conceptualizations can be improved and 
ultimately replaced with scientifically honed ones. And 
programs based on a formalist position suggest possible al- 
ternative universal primitives (such as feel, good, and bad) 
and bolster our claim that emotion is a heterogeneous cluster 
of events. 


CORE AFFECT AS A POINT OF DEPARTURE 


Next, we search for primitive entities. One reason that basic 
emotions are ill suited to serve as emotion primitives has 
been established by research from the basic emotions per- 
spective: They are too complex. For example, they typically 
consist of separable components (Izard, 1977) and are di- 
rected at an object (i.e., one fears, loves, hates, or is angry 
with something). Oatley and Johnson-Laird (1987) pointed 
out that an emotional primitive should be free of this some- 
thing (the object) because of the cognitive involvement that 
the object implies. Curiously, then, our search for emotional 
primitives begins with moods and other simple feelings that 
lack an object. In this way, Oatley and Johnson-Laird created 
an important new theory based on a categorical perspective. 

Here we explore that same approach but from a dimen- 
sional perspective. The goal in dimensional studies is to find 
what is common to various emotions, moods, and related 
states. Methods have included multivariate analyses of self- 
reported feelings, introspection, the semantic differential, 
and various biological techniques. This research has regu- 
larly found such broad dimensions as pleasure-displeasure 
and activation-deactivation. We refer to any state that can be 
defined simply as some combination of these two dimensions 
as core affect. 
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Core affect is similar to Thayer’s (1986) activation, Watson 
and Tellegen’s (1985) affect, and Morris’s (1989) mood; it is 
also translatable into the everyday term feeling. In its most 
primitive form, core affect is free-floating. That is, it lacks 
an object. For example, one can feel anxious (unpleasant acti- 
vation) about nothing in particular and without knowing why 
one feels that way. Core affect thus fits the ontological re- 
quirements for a primitive, elemental, and simple emotional 
ingredient. Biological research has often found that the most 
basic levels of emotional behavior are better conceptualized 
as dimensions than as discrete emotions (Cabanac 1990; 
Caccioppo, Gardner, & Berntson, 1999; Davidson, 1992a, 
1992b; Gray, 1994; Lang, 1979; Rozin, 1999; Shizgal & 
Conover, 1996; Thayer, 1996). For example, in their review of 
studies on the peripheral physio-logical changes in emotion, 
Cacioppo, Berntson, Larsen, Poehlmann, & Ito, (2000) em- 
phasized the existence of a primitive and fast response catego- 
rizing stimuli as hospitable or hostile (see also Carver, 2001). 

More formally, core affect is that neurophysiological state 
consciously accessible as the simplest raw feelings evident in 
(but not limited to) moods and emotions, such as feeling 
good or bad, energized or enervated. In line with nominalist 
ideas, core affect does not correspond with any word in a nat- 
ural language (just as the physicist’s concept of force cannot 
be easily translated into lay terms). Core affect consists 
of all possible combinations of pleasure-displeasure and 
activation-deactivation and therefore includes states that 
would not be called emotions, such as calm, fatigue, or 
drowsiness. Indeed, a person is always in some state of core 
affect, which can be extreme, mild, or even neutral. Core af- 
fect is part of most psychological processes. 

Specifically, core affect is one part of those events people 
call emotion (and which we call emotional episodes). Self- 
reports of emotion persistently yield two large general factors 
interpretable as pleasure and arousal (e.g., Russell & 
Mehrabian, 1977; Watson & Clark, 1992). Furthermore, the 
manipulation of arousal by drugs influences self-reported 
discrete emotions (Cooper, Zanna, & Taves, 1978; Gerdes, 
1979; Schachter & Latané, 1964; Schachter & Wheeler, 
1962). Feldman Barrett and Russell (2000) explored this 
hypothesis further in a study of self-reported emotions. In 
one condition participants were asked to describe how they 
currently felt. In a second condition they were asked to search 
their memory for the very last time they had an emotion. In 
the third condition they were asked to search their memories 
for a strong, clear emotion. In all three conditions the plea- 
sure and arousal dimensions accounted for substantial 
variance in the intensity of self-reported emotions. However, 
as the event to be described became more restricted to clearer 
and stronger cases of emotion, the amount of variance 
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accounted for declined, though not to zero. For example, the 
variance accounted for by pleasure and arousal in a scale of 
anger was .80, .63, and .68 in the three conditions, respec- 
tively. Thus, pleasure and arousal remained a part of strong, 
clear emotions, but other components played a larger role. 

Core affect also guides behavior. Core affect leads us to 
expose ourselves to affect-congruent situations (Bower & 
Forgas, 2000), thereby playing a role in action preparation and 
behavioral choice. Pleasure-displeasure influences our way of 
assessing resources when planning or deciding on action. 
Pleasure and displeasure are thus not restricted to emotional 
behavior and are currently found in the explanation of differ- 
ent kinds of action, including aggression (Berkowitz, 1993), 
eating (Pinel, Assanand, & Lehman, 2000), sex (Abramson, & 
Pinkerton, 1995), and drug abuse (Solomon, 1977). The di- 
mension of arousal is one’s state of readiness for action. For 
example, feeling enthused (high pleasure and arousal) gives a 
person a sense of optimism in choosing goals and plans. 
Arousal has been the basic component of the most popular sit- 
uationist theory of emotion in social psychology (Schachter, 
1964). The existence of core affect complements rather than 
contradicts the characterization of emotions as action patterns, 
provided that action patterns too are thought to be parts of 
rather than the whole of or essential to emotion. 

Core affect provides a way of comparing qualitatively dif- 
ferent scenarios by representing them on a single dimension, 
thereby solving a common human problem: The events en- 
countered and the choices available are often qualitatively 
different. Occasionally, one chooses between the larger and 
smaller dessert, but more often the choice is between two 
qualitatively different options: dessert or a film. The dimen- 
sion of pleasure-displeasure is a psychological currency that 
provides a yardstick for such comparisons (e.g., Mellers, 
2000). 

A final advantage of thinking in terms of core affect is that 
the psychology of emotion is more easily integrated with the 
rest of psychology. The concept of emotion has led writers to 
think of emotions as stemming from a separate faculty. In con- 
trast, the concept of core affect is compatible with a growing 
body of evidence that links it to other psychological processes. 
For example, core affect has been found to guide cognitive 
processes such as attention, perception, thinking, judgment, 
mental simulation, and retrieval from memory (e.g., Baron, 
1987; Blaney, 1986; Bower, 1992; Eich, 1995; Forgas, 1995; 
Forgas, Bower, & Krantz, 1984; Izard, Wehmer, Livsey, & 
Jennings, 1965; Mayer, Gaschke, Braverman, & Evans, 1992; 
Schiffenbauer, 1974). Pleasure and displeasure facilitate the 
accessibility of positive and negative material respectively; 
the more pleasant core affect is, the more positive are evalua- 
tive judgments (Schwarz & Clore, 1988) and the more 


optimistic is one’s simulation of the future (Sanna, 1998). 
Arousal could also have a similar effect; high or low arousal 
facilitates the accessibility of high and low arousal material 
respectively (Clark, Milberg, & Ross, 1983; for a dissenting 
opinion see Bower & Forgas, 2000). Core affect also influ- 
ences the quality and type of cognitive processing. Arousal af- 
fects the quality of cognitive performance (Humphreys & 
Revelle, 1984) and attention selectivity (Easterbrook, 1959; 
Eysenck, 1982). Pleasure affects heuristic processing and 
problem solving (see Aspinwall, 1998; Isen, 1993; Lerner & 
Keltner, 2000; Niedenthal, Halberstadt, & Setterlund, 1997; 
Park & Banaji, 2000; Schwarz & Bless, 1991). 


A VOCABULARY FOR A SCIENTIFIC 
FRAMEWORK FOR EMOTION 


Core affect is not simply another term for emotion, and a vari- 
ety of additional concepts are needed to deal with those events 
called emotion. Some of these new concepts are generated by 
thinking in terms of core affect, and others are simply bor- 
rowed from other branches of psychology. Core affect is thus 
a departure point for anew vocabulary in the study of emotion. 
It can be used to define some common terms and to generate a 
set of secondary concepts that covers various emotion-related 
events. 

Mood is defined as prolonged core affect without an ob- 
ject, and affect regulation is any attempt to alter core affect 
directly. Individuals typically (though not always) seek plea- 
sure and avoid displeasure. Individuals also seek a level of 
arousal appropriate to the task at hand (e.g., looking for 
pleasant relaxation when stressed, but for excitement when 
bored). Exercise, coffee, cigarettes, looking for particular 
companions, and listening to music are at least in part ways 
of regulating core affect. 


Affective Quality 


Just as the objects and events in our perceptual world emerge 
into consciousness already interpreted, they emerge affec- 
tively interpreted. Core affect should be distinguished 
from the affective qualities of the stimuli we perceive on at 
least two grounds. First, unlike core affect, which is object- 
less, affective quality is linked to a particular stimulus. Sec- 
ond, phenomenologically, core affect resides in the person 
who feels it, whereas affective quality resides in the stimulus; 
it is the odor that is pleasant (a fragrance) or unpleasant (a 
stench). Although core affect and affective quality are usually 
linked, each can change without the other: Core affect can be 
altered chemically, and a depressed patient can acknowledge 


that something is pleasant but report no changes in actual 
core affect. Various terms from the literature (e.g., evalua- 
tion, affective judgment, affective appraisal, affective reac- 
tion, or primitive emotion) are similar to the perception of 
affective quality (see Cacioppo et al., 1999; Zajonc, 1980, 
2000). Several experiments suggest that an initial perception 
of affective quality of a stimulus takes place automatically 
within 25 ms of encountering the stimulus (Bargh, 1997; 
Bargh, Chaiken, Govender, & Pratto, 1992; Bargh, Chaiken, 
Raymond, & Hymes, 1996; Fazio, Sanbonmatsu, Powell, & 
Kardes, 1986). 


Attribution 


People seek the cause of any change in core affect that they 
experience. They attribute core affect to someone or some- 
thing or some condition. In this way, core affect takes on an 
object: One moves from simply feeling bad to grieving over 
the loss of a friendship. Attributions are complex perceptual- 
cognitive processes and entail the possibility of misattribu- 
tion. Although the object typically is an obvious thing or 
event, it can be invented (fear of ghosts), hallucinated, re- 
membered, or anticipated. The object is a psychological con- 
struction that includes past and future. 


Motive 


Attributing core affect to an object becomes a motive for 
action—for example, attributing negative core affect 
(displeasure) to a deprivation (e.g., attributing discomfort to 
the lack of a cigarette constitutes a motive to smoke). Mo- 
tives may or may not result in action. 


Liking and Disliking 


These everyday concepts include both occurrent (actual, brief) 
events and dispositions to those events. An occurrent instance 
of liking (e.g., tasting a novel soup and liking it) is the experi- 
ence of pleasure attributed to the liked object (the soup). A per- 
son’s disposition to like something (e.g., Joe likes soup) is that 
person’s tendency to derive pleasure from that thing. 


Categories of Emotion 


Core affect, perception of affective quality, and the corre- 
sponding attributions to an object describe a huge variety of 
phenomena usually called emotion. Nevertheless, a dimen- 
sional affect system should also explain all these cases in 
which psychologists and laypeople prefer to speak in terms 
of specific categories such as fear, sadness, and so on. 


A Vocabulary for a Scientific Framework for Emotion 293 


Categorization is a basic cognitive process. Rather than con- 
sider each event encountered as unique (as we are encouraged 
to do by the nominalists), people group them together on the 
basis of perceived similarity. Thus, one notes a resemblance 
between some actual event and a stored representation of a 
group of events. On one theory, an emotion category is men- 
tally represented by a script of the components of that emo- 
tion, unfolding in a causally linked sequence (Fehr & Russell, 
1984; Fischer, 1991; Lakoff, 1987; Russell, 1991; Russell & 
Fehr, 1994). Categories are also linked to one another in a 
complex net of associations, and categorization is implicated 
in the perception of emotion both in others and in self. 


Emotional Episode 


Our term that comes closest to emotion is emotional episode. 
It is any actual event that resembles the mental representation 
of an emotion category sufficiently to count as a member of 
that category. Resemblance is a matter of degree, and no sharp 
boundary separates members from nonmembers. We define a 
prototypical emotional episode as an emotional episode for 
which the resemblance is especially close. Our notion of 
emotional episode as a pattern among simpler ingredients 
(including those already described, such as core affect and 
attribution) is congruent with much current conceptual 
and empirical analysis of emotion as the integration of 
simpler components through a process of attribution (Bem, 
1972; Blascovich, 1990; Higgins, 1987; Keltner, Locke, & 
Audrain, 1998; Ohman, 1999; Olson, 1990; Schachter, 1964; 
Weiner, 1985). 

An emotional episode typically begins with a real or imag- 
inary event, which has a perceived affective quality. (This ini- 
tial estimate of affective quality is included in appraisal 
theories, usually as a first evaluative step; Arnold, 1960; 
Smith & Ellsworth, 1985.) Core affect changes and prompts 
an attributional process. In most cases, the eliciting event is 
readily identified, but ambiguous cases can give rise to mis- 
attributions (Nisbett & Schachter, 1966). Whatever event is 
identified as the cause is thereby seen as the source of current 
core affect—and therefore as a problem to be solved or an op- 
portunity to be seized. Behavior follows accordingly. 


Emotional Meta-experience 


It is one thing to undergo an emotional episode, another to no- 
tice that this is happening. Emotional meta-experience is the 
perception of oneself as having a specific emotion. It is similar 
to what is commonly called subjective emotional experience. 
The prefix “meta-” draws attention to the notion that the raw 
data (affect core, affective quality, action, somatic sensations, 
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attribution, etc.) on which emotional meta-experience is based 
are themselves consciously accessible experiences. On this 
account, to perceive oneself as “angry” is acomplex process of 
self-categorization based on the everyday category of anger. 
The hypothesis of emotional meta-experience fits well with 
recent findings that conscious emotional feelings follow 
and monitor rather that precede other emotional ingredients 
(e.g., Gray, 1999; LeDoux, 1996; Ohman, 1999). 


THE UNIVERSAL AND THE CULTURAL 


In the debate between universalists and cultural relativists, 
the psychology of emotion inherited a version of the peren- 
nial nature-nurture controversy. As long as this question is 
posed about the heterogeneous cluster called emotion, more 
debate than resolution can be expected. When emotion is 
replaced with the variety of concepts proposed here, we hope 
that a resolution is nearer. In this chapter, we have come 
down squarely both on the side of nature and on the side of 
nurture. In principle, every psychological event is a joint 
product of genetic and epigenetic influences. In searching for 
elementary processes, we sought those whose existence ap- 
pears to be as much a part of a universal human nature as pos- 
sible. We offered core affect (and the specific dimensions of 
pleasure and arousal), perception of affective quality, attribu- 
tion, categorization, and so on as candidates. Specific out- 
comes of some of these universal processes, however, might 
show variability caused by epigenetic factors. For instance, 
the event to which core affect is attributed and the affective 
quality perceived in a specific stimulus might show measur- 
able epigenetic variability. 

Emotional episodes are patterns among these ingredients 
and might show more variability caused by epigenetic influ- 
ences. Behavior, for example, draws on prepackaged modules 
that are coupled or decoupled to suit the specific antecedent 
event and one’s goals, plans, social role, norms, values, and 
so forth. An emotional episode in response to frustration will 
bear a family resemblance to all human responses to frustra- 
tion. Still, it might more typically resemble the script for 
liget among the Ilongot, but more the script for ningag among 
the Utku. As a consequence, concepts formed in one so- 
ciety can be expected to differ from those formed in another 
(Russell, 1991). 

On our account, emotional meta-experience (although hy- 
pothesized to be a universal process) allows the greatest 
cultural diversity in content. For example, the concept of 
emocionado is available and readily accessible for Spaniards. 
They are easily able to conceptualize, label, and report states 
that resemble emocionado. Perhaps all persons experience a 


core affect combined with thoughts and behaviors that do not 
fit well into a specific emotion category, but Spaniards expe- 
rience this state in terms of emocionado. Doing so places that 
state within a culture-specific network of meaning. In con- 
trast, a person who lacks this concept might also have the 
same raw ingredients but would, nevertheless, not experience 
the resulting Gestalt in the same way. 


A COMPARISON OF CORE AFFECT 
WITH EMOTION 


Emotion is an old and rich term that refers to a variety of fas- 
cinating phenomena that are not as closely related to each 
other as one might think. The gap between emotion and 
nonemotion is fuzzier and smaller than was once thought. As 
a consequence, the psychology of emotion is fragmented into 
many largely independent areas. Research even on a suppos- 
edly single emotion is fragmented. For example, research on 
fear includes clinical research on anxiety, social psychologi- 
cal research on the effect of fear on attitude change, and ex- 
perimental research on fear as a basic emotion; each of these 
areas has its own traditions. Articles in one tradition rarely 
reference an article in another. These considerations suggest 
any number of strategies for the future. One suggestion is to 
take stock of the ecology of emotion events. Another is to 
move to a much lower level of analysis. Fear, sadness, and the 
like consist of components that can be studied in their own 
right. A search for patterns among the components would re- 
place assumption with empirically established patterns. 

Our proposal of a new framework and vocabulary for re- 
search on emotion should not be understood as a new theory 
about emotion but as an outline for the integration of old the- 
ories. The concept of emotional episode has several advan- 
tages over the old concept of emotion. It encourages the study 
of individual components and thus allows researchers to ex- 
plain and include in their theories the huge variability of 
emotional behavior, expression, experience, and physiology 
that has been uncovered in research on basic emotions. Be- 
havior probably does not divide naturally into two qualita- 
tively different classes, the emotional and the nonemotional. 
Core affect, affective quality, and attribution all occur outside 
emotional episodes as well. 

Unlike “basic emotions,” which are self-contained entities, 
emotional episodes consist of ingredients that can be shaped in 
a variety of ways. For example, Bugental (2000) proposed that 
socialization is not simply a general process of social influence 
but an acquisition of effective procedures (algorithms) for 
solving problems in five specific domains (attachment, power 
in hierarchies, mating, coalitions in groups, and reciprocity). 


Effective procedures in one domain may be unrelated to those 
in another. Considered to be emotional episodes, love and jeal- 
ousy involve not just such processes as core affect and attribu- 
tion but also these specific algorithms for social life—roles, 
strategies, tactics, stances, norms—that shape behavior and 
social interaction in a dynamic way. 

Although an amalgam of prior theories, our framework 
results in a picture of emotional life different from what is 
currently available. The events highlighted in previous 
paradigms—prototypical emotion episodes, discourse about 
emotion, concepts—all occur and are important. Neverthe- 
less, rather than incompatible approaches to one thing, they are 
interacting parts of a larger system. This system also includes 
many other related parts, such as moods, evaluations, and un- 
attributed core. Whatever the fate of our specific framework, a 
new study of emotion that goes beyond current assumptions 
promises to yield an even richer field than we see today. 
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Popular culture seems obsessed with the concept of attitude. 
Entering the word attitude into an Internet search engine gen- 
erates many listings, including “Art with Attitude,” “Animals 
with Attitude,” “Attitude Bikes,” and “Spice Girls—Spicy 
Attitude.” Moreover, the importance of attitude is frequently 
cited in promotional media (e.g., gym posters), self-help 
books (e.g., Russell-McCloud, 1999; Ryan, 1999), and even 
large-scale business conferences (e.g., Wal-Mart Canada, 
1997). All of these examples support (albeit indirectly) 
Gordon Allport’s (1935) famous assertion that attitude is one 
of the most indispensable constructs in social psychology. 

In this chapter, we review social psychological research 
and theory about attitudes. In the first portion of the chapter, 
we define attitudes and compare this construct to other impor- 
tant social psychological constructs. Next, we discuss differ- 
ent theories about the psychological structure of attitudes, 
focusing on the theories’ implications for measuring attitudes 
and the evidence supporting or refuting them. Third, we 


299 


ATTITUDE FORMATION 311 
Cognitive Processes 311 
Affective Processes 311 
Behavioral Processes 312 
Biological Processes 312 
ATTITUDES AND INFORMATION PROCESSING 313 
Selective Attention 313 
Selective Perception 313 
Selective Memory 314 
Attitude Polarization 314 
ATTITUDES AND BEHAVIOR 314 
Compatibility of Attitude and Behavior Measures 315 
Nature of the Behavior 315 
Nature of the Attitude 316 
Personality Variables 316 
Composite Model of Attitude-Behavior Consistency 317 
Applications to Social Behavior 317 
CONCLUSIONS 318 
REFERENCES 318 


examine the psychological functions served by attitudes. 
Fourth, we consider the relations among attitudes and between 
attitudes and higher-order constructs such as ideologies. Fifth, 
we identify important ways in which attitudes vary. Sixth, we 
address briefly how attitudes form. Seventh, we discuss the ef- 
fects of attitudes on information processing. Finally, we con- 
sider the relation between attitudes and behavior. 


WHAT ATTITUDES ARE AND WHAT 
ATTITUDES ARE NOT 


When they define attitudes, social psychologists focus on 
the tendency to like or dislike an attitude object or behavior. 
That is, attitudes are defined as tendencies to evaluate objects 
favorably or unfavorably (Bem, 1970; Eagly & Chaiken, 
1993; Fazio, 1990; Olson & Zanna, 1993; Petty, Wegener, & 
Fabrigar, 1997; Wood, 2000). Attitudes can be directed 
toward any identifiable object in our environment, including 
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groups of people (e.g., ethnic groups), controversial issues 
(e.g., legalized abortion), and concrete objects (e.g., pizza). 
In fact, the potentially unlimited range of attitude objects 
sometimes causes confusion about the relations between atti- 
tudes and other social psychological constructs. For example, 
there is conceptual overlap between attitudes and values, 
which are abstract ideals that people consider to be important 
guiding principles in their lives (e.g., freedom; Rokeach, 
1973; Schwartz, 1992). The importance component of values 
makes them distinct from attitudes (Feather, 1995; Maio & 
Olson, 1998), because positive attitudes do not imply that the 
targets are important guiding principles in life. 

One fundamental attribute of attitudes is that they are 
subjective—that is, they reflect how a person sees an object 
and not necessarily how the object actually exists. Conse- 
quently, attitudes should be considered a part of the subjective 
self, which is the stream of thoughts, feelings, and actions that 
govern how someone lives (James, 1890). 


STRUCTURE OF ATTITUDES 


The relevance of attitudes to the subjective self suggests that 
attitudes may be connected to thoughts, feelings, and actions. 
This hypothesis raises the question of how attitudes are struc- 
tured in the human mind. Understanding the mental structure 
of attitudes is potentially as important to attitude research as 
identifying the structure of DNA was to biological research. 
Uncovering the internal structure of attitudes can facilitate 
our understanding of how attitudes form, strengthen, and 
change. 

In this section, we describe four well-established perspec- 
tives on attitude structure and their implications for attitude 
measurement. Two perspectives focus on the content of atti- 
tudes. These perspectives examine how attitudes may express 
more elemental psychological constructs, such as beliefs and 
emotions. The other two perspectives examine the dimension- 
ality of attitudes—that is, these theories consider precisely 
how attitudes summarize positivity and negativity toward the 
attitude object. After reviewing attitude content and dimen- 
sionality, we describe some alternative attitude measures and 
the concept of implicit attitudes. 


Attitude Content 


Two perspectives have dominated research on the content 
of attitudes: the three-component model and the expectancy- 
value model. For both models, we describe their chief 
characteristics, implications for attitude measurement, and 
supporting evidence. 


Three-Component Model 


Guiding Assumptions The three-component model hypoth- 
esizes that attitudes express people’s beliefs, feelings, and past 
behaviors regarding the attitude object (Zanna & Rempel, 
1988). For example, people might form a positive attitude 
toward eating spaghetti because spaghetti tastes good (affec- 
tive component) and they believe that spaghetti is nutritious 
(cognitive component). Moreover, through the process of 
self-perception (Bem, 1972; Olson, 1992), people may decide 
that they like spaghetti because they can recall eating it often 
(behavioral component). Thus, this model suggests that peo- 
ple have positive attitudes toward an object when their beliefs, 
feelings, and behaviors express favorability toward an object, 
whereas people have negative attitudes toward an object when 
their beliefs, feelings, and behaviors express unfavorability to- 
ward the object. 

We think it important to note, however, that the three-com- 
ponent view also regards attitudes as being distinct from the 
beliefs, feelings, and behaviors that influence them— 
following the adage that the whole is not simply the sum of its 
parts (Eagly & Chaiken, 1998; Zanna & Rempel, 1988). The 
attitude per se is a net evaluation of an attitude object; people 
can experience this evaluation when they encounter the atti- 
tude object, and they can store their attitude as a statement in 
memory (e.g., ice cream is good). Similarly, the attitude object 
can evoke the component beliefs, feelings, and behaviors, and 
the components can be subjectively represented in memory. 
Nonetheless, these components are more circumscribed in 
their focus. Beliefs are perceived associations between an ob- 
ject and its attributes, which may be evaluative in nature (e.g., 
ice cream is fattening); feelings are experiences of pleasant or 
unpleasant mood, which may be evoked by particular objects 
(e.g., ice cream makes me relaxed); and behaviors are overt 
acts that involve approaching or avoiding the object in some 
way (e.g., I buy ice cream often). 


Measurement The three-component model indicates 
that it is possible to obtain measures of overall attitudes with- 
out attempting to assess attitude-relevant beliefs, feelings, 
and behaviors. For example, attitudes are frequently mea- 
sured using an attitude thermometer, which asks participants 
to use a thermometer-like scale to indicate the extent 
to which they feel favorable versus unfavorable toward the 
attitude object (Campbell, 1971; Haddock, Zanna, & Esses, 
1993; Maio, Bell, & Esses, 1996; Wolsko, Park, Judd, & 
Wittenbrink, 2000). Using this scale, people can indicate a 
general evaluation, which may be derived from attitude- 
relevant beliefs, feelings, behaviors, or some combination of 
all three. 


Nonetheless, such measures do not utilize the three- 
component model as rigorously as do measures that assess 
directly the attitude-relevant beliefs, feelings, and behaviors. 
Breckler (1984) provided an excellent example of the direct 
assessment of attitude-relevant beliefs, feelings, and behav- 
iors. His research measured attitudes toward snakes and used 
a variety of verbal and nonverbal indicators. The verbal mea- 
sures asked participants to rate (using self-report scales) their 
beliefs, feelings, and past behaviors toward snakes. The non- 
verbal measures assessed attitude-relevant affect and behav- 
ior, using recordings of participants’ heart rate and behavior 
in the presence of a live snake. The verbal and nonverbal 
measures for each component were then aggregated to form 
overall indices for each attitude component. 

Open-ended measures offer another method for assess- 
ing the three components of attitudes. These measures ask 
participants to list their beliefs, feelings, and behaviors re- 
garding the attitude object (Esses & Maio, 2002; Haddock & 
Zanna, 1998). Participants then rate the valence of each 
response by using a semantic-differential scale. This approach 
makes it necessary for respondents to indicate responses 
that are accessible to them, rather than simply rate agree- 
ment with responses that the researcher presents (Esses & 
Maio, 2002). 


Evidence In support of the three-component model, re- 
search has found that people’s beliefs, feelings, and behav- 
iors toward an attitude object are correlated but distinct. For 
example, Breckler (1984) found that people’s beliefs, feel- 
ings, and behaviors toward snakes were moderately corre- 
lated when the components were assessed using verbal and 
nonverbal measures in a context in which a snake was pre- 
sent. His use of verbal and nonverbal measures provides a 
good test of the three-component model, because this tech- 
nique corrects for the systematic measurement error that 
would occur if either technique were used alone. (In fact, the 
components were highly intercorrelated when verbal items 
alone were used in the absence of a snake.) 

Using primarily pen-and-paper measures, additional re- 
search has examined the distinction between the cognitive 
and affective components, and such research has found mod- 
erate correlations for attitudes toward a large variety of ob- 
jects (e.g., birth control, blood donation, microwaves; 
Breckler & Wiggins, 1989; Crites, Fabrigar, & Petty, 1994; 
Haddock & Zanna, 1998). Further, Trafimow and Sheeran 
(1998) found that attitude-relevant feelings and beliefs were 
clustered separately in memory. 

Given the evidence that the cognitive and affective com- 
ponents are distinct, attitudes in different domains may be 
uniquely related to one or the other component. Consistent 
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with this prediction, cognitive responses are strong predictors 
of attitudes toward a variety of controversial issues (e.g., 
capital punishment, legalized abortion, nuclear weapons; 
Breckler & Wiggins, 1991; Crites et al., 1994), whereas 
affective responses are strong predictors of attitudes toward 
blood donation (Breckler & Wiggins, 1989), intellectual 
pursuits (e.g., literature, math; Crites et al., 1994), smoking 
(Trafimow & Sheeran, 1998), and politicians (Glaser & 
Salovey, 1998). 


Belief-Based Attitudes 


Guiding Assumptions It is also possible to view attitudes 
as evaluative responses to an object that are influenced by be- 
liefs alone (e.g., McGuire, 1960; Wyer, 1970). From this per- 
spective, it is important to understand exactly how beliefs are 
interrelated and how beliefs are linked to affective responses. 
For example, a message might argue that it is good to reduce 
waste, and therefore that people should recycle waste. The 
message is persuasive if the message recipient accepts both 
the premise of the argument (i.e., reducing waste is good) 
and the implied link between the premise and the conclusion 
De, recycling will reduce waste). Notice that the evaluative 
nature of the premise (reducing waste is good) introduces an 
evaluative bias into the conclusion—that is, people should 
become more favorable toward recycling because of its 
desirable implications for reducing waste. In this manner, 
attitudes can be evoked by beliefs Ge, premises) that are 
evaluative in nature. 

The notion that attitudes reflect the acceptance or rejec- 
tion of evaluative premises is central to the well-known 
expectancy-value perspective on attitudes (e.g., the theory of 
reasoned action; Fishbein & Ajzen, 1975). According to this 
approach, an attitude is the sum of all of the evaluative beliefs 
regarding the attitude object. For instance, if people believe 
that recycling is easy and that recycling helps the environ- 
ment, people should hold a positive attitude toward recy- 
cling. This attitude is positive because both beliefs link 
positively valued attributes to the behavior. Of course, beliefs 
are rarely held with absolute certainty. For example, a person 
may be only 70% certain that recycling is easy, but also be 
100% certain that recycling helps the environment. Accord- 
ing to the expectancy-value model, beliefs have less impact 
on attitudes when they are less certain. This reasoning is fre- 
quently summarized in a well-known equation: A = Xbe,, 
where A is the total attitude toward the attitude object, b; is 
the subjective belief that the object possesses attribute i (e.g., 
the probability that recycling helps the environment), and e, 
is the evaluation of attribute i (e.g., the positive value at- 
tached to the environment). 
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Measurement The expectancy-value model prescribes 
a method for measuring attitudes: Participants must first 
consider a list of potential attributes of an attitude object and 
then for each attribute rate (a) the probability that the object 
possesses the attribute, and (b) the desirability of the attribute. 
In most research, the probability ratings are made using scales 
from —3 (very improbable) to +3 (very probable) or from 
0 (not at all) to 1 (definitely). The evaluative ratings are 
made using evaluative scales from —3 (e.g., very bad) to +3 
(e.g., very good). To derive the overall attitude, the product 
of the probability and evaluative ratings is computed for 
each attribute, and the products are summed across all of the 
attributes. 

The expectancy-value model is also compatible with an 
open-ended thought-listing procedure for measuring atti- 
tudes. In this procedure, participants list their beliefs about 
the attributes of the attitude object, and they rate the desir- 
ability of each attribute. An overall index of attitude is then 
obtained by summing the desirability ratings. The thought- 
listing procedure does not require probability ratings because 
it elicits attributes that participants perceive as being highly 
associated with the attitude object (Esses, Haddock, & Zanna, 
1993; Esses & Zanna, 1995). 


Evidence Research has examined the utility of the 
expectancy-value model by testing whether people’s reports 
of their own attitudes are correlated with the summed products 
of the attitude-relevant expectancies and values. Results indi- 
cate that there are at least moderate correlations between atti- 
tudes and the expectancy-value products (e.g., Budd, 1986; 
van der Pligt & de Vries, 1998), although there have been sta- 
tistical and methodological criticisms of these findings (e.g., 
Bagozzi, 1984; Sparks, Hedderley, & Shepherd, 1991). 

To test the expectancy-value model directly, it is necessary 
to examine experimentally the causal impact of beliefs and 
evaluations on attitudes. Fortunately, studies of persuasion 
have yielded some support for the idea that persuasive mes- 
sages influence evaluative beliefs, which influence attitudes 
(e.g., Maio, Bell, et al., 1996), although this effect may occur 
only when people are motivated and able to process persua- 
sive messages in a systematic manner (Chaiken, Liberman, & 
Eagly, 1989; Petty & Cacioppo, 1986; E. P. Thompson, 
Kruglanski, & Spiegel, 2000). 


Reconciling the Three-Component 
and Belief-Based Models 


The comprehensiveness of the expectancy-value model is 
challenged by the findings noted earlier—that affect and past 
behavior predict attitudes independently of beliefs. This 


challenge can be met by the argument that affective reactions 
and past behaviors are simply different types of beliefs about 
the attitude objects. For example, the three-component model 
suggests that people may form a positive attitude toward an 
object that makes them feel happy. The expectancy-value 
model can account for this process by suggesting that people 
believe that the object makes them happy and value their 
own happiness—the effect of happiness is reduced to an 
expectancy-value product. Nevertheless, affective beliefs 
and behavioral beliefs are made salient only by considering 
the three-component model. Thus, at the very least, the three- 
component model spurs the expectancy-value formulation to 
consider different types of beliefs. 

On the other hand, the three-component model would be 
more compelling if the relations between the three attitude 
components and attitudes fell into a discernible pattern that 
could be explained by prior theory. Discovering such a pat- 
tern is difficult, partly because there is conflicting evidence 
for some attitude objects. For example, Esses et al. (1993) 
found that affect played the greatest unique role in predicting 
attitudes toward two groups (French Canadian and native 
people), whereas beliefs about out-group values played the 
greatest unique role in predicting attitudes toward two other 
groups (Pakistani and homosexual people). These researchers 
also obtained evidence that the relative dominance of emo- 
tions and cognitions depended on individual differences and 
situational factors. 

These findings indicate that there is a need for theory de- 
scribing when one component should be more influential than 
another. Such a theory would need to consider evidence that 
the roles of affect and cognition may depend on the psycho- 
logical functions fulfilled by attitudes (see Maio & Olson, 
2000). For example, affective reactions may be stronger pre- 
dictors of attitude when the attitude object has a hedonic pur- 
pose than when the object has a utilitarian purpose (Kempf, 
1999). In addition, attitudes toward social partners become 
more imbued with affect as people get older and when they are 
diagnosed with a critical illness—conditions that presumably 
increase the importance of close affective ties with others 
(Carstensen, Isaacowitz, & Charles, 1999). 

It is also important to consider that affect and cognition 
may have different processing requirements. For example, 
affective associations may be more accessible (Verplanken, 
Hofstee, & Janssen, 1998) and they may be processed more 
easily (Reeder & Pryor, 2000). Perhaps the ease of affective 
processing explains why (a) affective reactions exert a 
stronger influence on attitudes when there is a conflict between 
affect and cognition (Lavine, Thomsen, Zanna, & Borgida, 
1998), (b) affect has a stronger influence on mental represen- 
tations of others in general (Jussim, Nelson, Manis, & Soffin, 


1995), and (c) affect is more closely related to the importance 
attached to social values (Maio & Olson, 1998). 


The Dimensionality of Attitudes 


The three-component model and the expectancy-value model 
describe the manner in which attitudes are related to beliefs, 
feelings, and behavior. Neither model, however, specifies 
precisely how attitudes summarize positivity and negativity 
in memory. There are two prominent perspectives on this 
question: the unidimensional model and the bidimensional 
model. 


Unidimensional Model 


Guiding Assumptions The traditional perspective regards 
attitudes as being unidimensional evaluations, which express 
sentiments ranging from extreme unfavorability toward the 
attitude object to extreme favorability toward the attitude 
object. In other words, the unidimensional perspective as- 
sumes that attitudes can take the form of (a) favorability, 
(b) unfavorability, or (c) neither favorability nor unfavorabil- 
ity toward the attitude object. Thus, a person may feel either 
positively or negatively about the object, but not both at the 
same time. 


Measurement The most common measures of attitudes 
are based on the unidimensional perspective. These measures 
include bipolar semantic-differential scales, which are an- 
chored by a negative adjective at one end (e.g., bad) and a pos- 
itive adjective at the other end (e.g., good). For example, 
respondents could be asked to rate their attitude toward cen- 
sorship using a 7-point scale from —3 (very unfavorable) to 
+3 (very favorable), with 0 (neither favorable nor unfavor- 
able) in between. Respondents may be given many semantic 
differential scales, anchored by different adjective pairs (e.g., 
good Vs. bad; negative vs. positive). To yield an overall index 
of attitudes, responses are averaged across the scales. 

Another common procedure uses Likert-like scales. This 
technique utilizes many statements expressing varying 
degrees of favorability or unfavorability toward the attitude 
object. Examples might be Censorship unfairly restricts 
access to information and Censorship is necessary to keep 
obscene material from children. People respond to each item 
ona scale from —2 (strongly disagree) to +2 (strongly agree). 
To yield an overall index of attitudes, responses to the items 
that imply unfavorability toward the attitude object are reverse 
coded (e.g., +2 changes to —2), and responses to all items are 
then averaged. 
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Evidence To some extent, the unidimensional model is 
supported by findings that unidimensional measures of atti- 
tude exhibit substantial criterion validity. That is, semantic- 
differential and Likert scales yield attitude scores that predict 
behavior (Ajzen & Fishbein, 1977; Kraus, 1995). In addition, 
the unidimensional model is consistent with Judd and Kulik’s 
(1980) observation that people are faster at identifying their 
agreement or disagreement with extreme attitude positions 
than with neutral attitude positions. These researchers argued 
that this result should occur if people easily represent strong 
positivity (without any negativity) and strong negativity 
(without any positivity) in their minds. 


Bidimensional Model 


Guiding Assumptions The bidimensional model rejects 
the notion that attitudes exist only on a single evaluative 
continuum from negativity to positivity. Instead, the bidimen- 
sional model suggests that attitudes subsume an evaluative 
tendency that varies in positivity and a separate evaluative 
tendency that varies in negativity. Consequently, attitudes 
can take the form of (a) favorability, (b) unfavorability, (c) nei- 
ther favorability nor unfavorability, and (d) both favorability 
and unfavorability toward the attitude object. 


Measurement To measure attitudes from the bidimen- 
sional perspective, the positive and negative responses must 
be assessed separately. Kaplan (1972) suggested that any sin- 
gle semantic-differential scale could be split to yield separate 
positive and negative dimensions. For example, researchers 
could use a semantic-differential scale from —3 (very bad) to 
0 (neutral) and a semantic-differential scale from 0 (neutral) 
to 3 (very good), rather than use a single semantic-differential 
scale from —3 (very bad) to 3 (very good). In this manner, sep- 
arate negative and positive dimension scores are obtained. 
This approach prevents ambiguous neutral responses (Kaplan, 
1972). That is, in single semantic-differential and Likert items, 
neutrality may stem from an absence of both positivity and 
negativity toward the attitude object, or it may stem from 
the simultaneous presence of both positivity and negativity; 
the split scales can differentiate between these two types of 
neutrality. 

Split scales may be unnecessary when an attitude measure 
includes many items that assess both positive and negative 
attributes of the attitude object. For example, open-ended 
measures of attitude ask participants to list their beliefs about 
an attitude object and the emotions that the object elicits in 
them (see Esses & Maio, 2002; Haddock & Zanna, 1998). 
Using a traditional semantic-differential scale, participants 
then rate the valence of each response. This approach enables 
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respondents to indicate some beliefs and emotions that are 
positive and some beliefs and emotions that are negative. 
Using this technique, a positive-dimension score can be de- 
rived from the sum or average of the positive ratings, and a 
negative-dimension score can be derived from the sum or 
average of the negative ratings (Bell, Esses, & Maio, 1996; 
Maio, Esses, & Bell, 2000). 

Separation of the positive and negative dimensions en- 
ables the calculation of ambivalence, which is the simulta- 
neous existence of positivity and negativity toward the 
attitude object (Kaplan, 1972; Olson & Zanna, 1993). Am- 
bivalence is calculated using formulas that are designed to 
assess the extent to which there are high amounts of positiv- 
ity and negativity rather than a high amount of positivity or 
negativity alone (e.g, Bell et al., 1996; Priester & Petty, 
1996; M. M. Thompson, Zanna, & Griffin, 1995). We find it 
interesting, however, that the scores derived from these for- 
mulas exhibit only moderate correlations with subjective 
self-reports of ambivalence (approximately r = .40; Priester 
& Petty, 1996). Thus, although the objective and subjective 
measures possess some convergent validity, they must be 
tapping psychological processes that are at least somewhat 
distinct. 


Evidence If the bidimensional view is valid, people’s 
favorability toward an attitude object should at least some- 
times be largely unrelated to their unfavorability toward the 
object. In contrast, the unidimensional view suggests that 
there should be a strong negative correlation between posi- 
tivity and negativity. In support of the bidimensional view, 
past research has found only moderate negative correlations 
between positivity and negativity, across a variety of attitude 
objects (e.g., different ethnic groups; Bell et al., 1996; 
Kaplan, 1972; I. Katz & Hass, 1988; M. M. Thompson et al., 
1995; cf. Jonas, Diehl, & Brömer, 1997). 

Cacioppo, Gardner, and Berntson (1997) observed that 
positivity and negativity toward an object do not change in 
parallel: (a) There is a tendency for people to initially pos- 
sess more positivity than negativity toward attitude objects, 
and (b) positivity increases more slowly than does negativ- 
ity. Therefore, it is plausible that positivity and negativity 
summarize different mental processes. Also, if the positive 
and negative dimensions are distinct, they should exhibit 
somewhat different correlations with other variables. Unfor- 
tunately, researchers have not yet systematically examined 
this issue. 

Finally, if the bidimensional view is valid, the simultane- 
ous existence of positivity and negativity (i.e., ambivalence) 
should have unique psychological consequences that are 
not predicted by the unidimensional model. And, indeed, 


researchers have found unique consequences of ambivalence 
(see the section on characteristics of attitudes later in this 
chapter). 


Reconciling the Unidimensional and 
Bidimensional Perspectives 


Despite the empirical support for the bidimensional view, it 
should be noted that most researchers have not examined the 
correlations between positivity and negativity while simulta- 
neously controlling random and systematic measurement 
error. Failure to control for both sources of error can artifac- 
tually decrease the magnitude of the observed correlation 
(Green, Goldman, & Salovey, 1993), leaving the impression 
that the positive and negative dimensions are less strongly re- 
lated than they actually are. 

Even if future evidence supports the bidimensional model, 
it is plausible that the unidimensional model and bidimen- 
sional model are valid at different psychological levels. For 
instance, the bidimensional model may apply to attitude for- 
mation, in which people perceive the attitude object on both 
positive and negative dimensions; these dimensions might 
then be integrated to form a single, unidimensional evalua- 
tion (see Cacioppo et al., 1997). Alternatively, the unidimen- 
sional model may lose predictive validity as knowledge 
about the attitude object becomes more complex, because it 
becomes difficult to integrate the object’s positive and nega- 
tive attributes. 

Neither perspective on attitude dimensionality explicitly 
considers implications of the fact that people can be made 
aware of many different exemplars of the attitude object, in 
addition to many attributes of each exemplar (Lord & Lepper, 
1999). For example, when thinking of their attitude toward 
cheese, people can imagine the most recent type of cheese 
that they ate (e.g., fresh brie vs. processed cheese slices). The 
reported attitude will depend on which exemplar is retrieved 
because different exemplars are often associated with differ- 
ent attributes and evaluations. Thus, it is likely that attitudes 
subsume many different exemplars of the attitude object in 
addition to the varied attributes of the exemplars. 


Alternative Attitude Measures 


Past researchers have most often measured attitudes using 
self-report scales. An important limitation of self-report 
scales is that they are affected by tendencies to respond in 
a socially desirable manner (Paulhus, 1991). For example, 
people might be reluctant to report prejudice against ethnic 
groups because of the social stigma attached to prejudicial 
attitudes. 


To overcome this problem, various techniques have been 
developed. For example, the bogus pipeline procedure 
(Jones & Sigall, 1971) deceives participants into believing 
that the researcher can detect their true feelings about an 
attitude object, after which participants are asked to report 
their attitude toward the attitude object. This technique has 
been shown to reduce social desirability in responses to sim- 
ple self-report attitude measures (Roese & Jamieson, 1993). 

Another approach involves assessing participants’ physio- 
logical responses to attitude objects. Unfortunately, many 
physiological measures are incapable of distinguishing posi- 
tive and negative affective reactions (e.g., skin conductance, 
papillary response; Petty & Cacioppo, 1983; Guglielmi, 
1999). Positive and negative evaluations can be distin- 
guished, however, using facial electromyography (EMG) 
recordings (Cacioppo, Petty, Losch, & Kim, 1986), which 
detect the relative amount of electrical activity in the muscles 
that control smiling and frowning. 

Two other psychophysiological techniques show consider- 
able promise. One technique detects a specific pattern of elec- 
trical activity in the centroparietal region of the brain 
(amplitude of the late positive potential: Cacioppo, Crites, & 
Gardner, 1996; Gardner, Cacioppo, Crites, & Berntson, 1994), 
whereas the other examines the frequency and latency of eye 
blinks for attitude objects (Ohira, Winton, & Oyama, 1998). 
Future research should test whether these techniques are more 
closely linked to one attitude component (e.g., affect) than to 
others and whether the techniques yield support for separate 
positive and negative dimensions in evaluations. 


Implicit Attitudes 


Another limitation of most self-report measures of attitudes 
is that they assess only explicit attitudes, which are con- 
sciously retrievable from memory. As discussed in Petty’s 
chapter on attitude change, explicit, conscious attitudes may 
differ in numerous ways from implicit, nonconscious atti- 
tudes (Greenwald & Banaji, 1995; Wilson, Lindsey, & 
Schooler, 2000). Thus, it is useful to measure directly the 
nonconscious attitudes. 

Several techniques are available to accomplish this 
goal. One approach involves extracting self-report, attitude- 
relevant information without relying directly on partici- 
pants’ conscious determination of their attitude. For example, 
researchers can calculate participants’ attitudes from their 
responses to open-ended measures, even though these mea- 
sures do not directly ask participants to report their at- 
titudes. Other measures circumvent respondents’ inferential 
processes more strongly by recording behavior that occurs 
outside of participants’ conscious control. For example, 
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researchers can unobtrusively measure participants’ non- 
verbal and verbal behaviors toward other people as an indica- 
tion of liking (e.g., Fazio, Jackson, Dunton, & Williams, 
1995; Word, Zanna, & Cooper, 1974). Because people have 
difficulty consciously monitoring such behaviors, their be- 
haviors may often reveal attitudes of which the participants 
are unaware (see Dovidio, Kawakami, Johnson, Johnson, & 
Howard, 1997). 

The most common measures of implicit attitudes use elab- 
orate priming techniques (e.g., Dovidio et al., 1997; Fazio 
et al., 1995; Greenwald, McGhee, & Schwartz, 1998). For 
example, Fazio et al.’s (1995) “bona fide pipeline” presents 
participants with a target attitude object and asks participants 
to classify subsequently presented adjectives as being good or 
bad. Theoretically, positive evaluations should be activated in 
memory after viewing an attitude object that evokes a positive 
attitude. This priming of positive affect should cause partici- 
pants to be faster at classifying positive adjectives (e.g., nice, 
pleasant) than at classifying negative adjectives (e.g., disgust- 
ing, repugnant). In contrast, after viewing an attitude object 
that evokes a negative attitude, participants should be slower 
at classifying positive adjectives than at classifying negative 
adjectives. Indeed, evidence from several studies suggests that 
the latency to classify positive versus negative adjectives is 
affected by the prior presentation of a liked or disliked attitude 
object, particularly when participants hold a strong attitude 
toward the attitude object (Fazio, 1993; Fazio, Sanbonmatsu, 
Powell, & Kardes, 1986; cf. Bargh, Chaiken, Govender, & 
Pratto, 1992). Moreover, attitude scores can be derived from 
the speed of responding to the positive versus negative adjec- 
tives following the positive versus negative primes, and these 
attitude scores predict attitude-relevant behavior toward the 
attitude object (Fazio et al., 1995). Greenwald et al.’s (1998) 
“implicit association test” similarly relies on facilitating ver- 
sus inhibiting effects of evaluation on task performance. An 
interesting issue is whether such measures of implicit attitudes 
can be adapted to test the models of attitude content and atti- 
tude dimensionality. 


ATTITUDE FUNCTIONS 


Although models of attitude structure are useful for describ- 
ing ways in which attitudes may be represented in memory, 
these models do not address attitude functions, which are the 
psychological motivations that attitudes fulfill (Olson & 
Zanna, 1993). Understanding the functions of attitudes 
should clarify why people bother to form and maintain atti- 
tudes, as well as how underlying motivations influence the 
valence and structure of attitudes. 
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Two early theoretical statements are the best-known mod- 
els of attitude function (D. Katz, 1960; Smith, Bruner, & 
White, 1956). Smith et al. (1956) suggested that attitudes serve 
three functions: object appraisal, social adjustment, and exter- 
nalization. Object appraisal refers to the ability of attitudes to 
summarize the positive and negative attributes of objects in 
our environment; social adjustment is served by attitudes that 
help us to identify with people whom we like and to dissociate 
from people whom we dislike; and externalization is fulfilled 
by attitudes that defend the self against internal conflict. 
D. Katz (1960) proposed four attitude functions, which over- 
lap with those proposed by Smith et al. (1956): knowledge, 
utility, value expression, and ego defense. The knowledge 
function represents the ability of attitudes to summarize infor- 
mation about attitude objects; the utilitarian function exists in 
attitudes that maximize rewards and minimize punishments 
obtained from attitude objects; the value-expressive function 
exists in attitudes that express the self-concept and central val- 
ues (e.g., equality, freedom; Maio & Olson, 1998; Rokeach, 
1973; Schwartz, 1992); and the ego-defensive function pro- 
tects self-esteem. 

The object-appraisal function (which combines aspects 
of the utilitarian and knowledge functions) perhaps best ex- 
plains why people form attitudes in the first place. This func- 
tion implies that attitudes classify objects in the environment 
for the purposes of action. Moreover, it can be argued that all 
strong attitudes simplify interaction with the environment in 
this way, regardless of whether the attitudes imply favorabity 
or unfavorability toward the attitude object. 

Two important themes have emerged in research on atti- 
tude functions since these early theoretical statements. First, 
as just noted, evidence suggests that strong attitudes fulfill 
an object-appraisal function. Second, a distinction between 
instrumental attitudes (serving a utilitarian function) and 
symbolic attitudes (serving a value-expressive function) ap- 
pears to be useful. In the following sections, we describe the 
evidence regarding these observations. 


Object Appraisal 


In their description of the object-appraisal function, Smith 
et al. (1956) hypothesized that attitudes are energy-saving 
devices, because attitudes make attitude-relevant judgments 
faster and easier to perform. Two programs of research have 
directly supported this reasoning while suggesting important 
caveats. First, Fazio (1995, 2000) argued that the object- 
appraisal function should be more strongly served by atti- 
tudes that are spontaneously activated from memory when the 
object is encountered than by attitudes that are not sponta- 


neously retrieved. This prediction is based on the assumption 
that activated attitudes guide relevant judgments and behav- 
ior, whereas dormant attitudes have little effect during judg- 
ment and behavior processes. Consistent with this hypothesis 
is that highly accessible attitudes (either measured via re- 
sponse latency or manipulated via repeated attitude expres- 
sion) have been shown to increase the ease with which people 
make attitude-relevant judgments. For example, people who 
have accessible attitudes toward an abstract painting have 
been shown to be subsequently faster at deciding whether 
they prefer the painting over another painting; they also ex- 
hibit less physiological arousal during these preference deci- 
sions than do people who have less accessible attitudes (see 
Fazio, 2000). 

Another program of research has revealed that the strength 
of the object-appraisal motivation is influenced by levels of 
the need for closure, which is a “desire for a definite answer 
on some topic, any answer as opposed to confusion and am- 
biguity” (Kruglanski, 1989, p. 14). Of course, the object- 
appraisal function reflects the notion that attitudes can 
provide such answers because attitudes help people to make 
decisions about attitude objects. Consequently, a high need 
for closure should increase the desire to form and maintain 
attitudes. Kruglanski (1996) has tested this hypothesis using 
an individual difference measure of need for closure and sit- 
uational manipulations of the need for closure (which involve 
imposing or withdrawing situational pressures to resolve un- 
certainty). As expected, the effects of need for closure on at- 
titude change depended on whether participants had already 
formed an attitude toward the assigned topic. If participants 
had already formed an attitude, those who were high in need 
for closure were less persuaded by new information than 
were participants who were low in need for closure. In con- 
trast, if participants had not yet formed an attitude, those who 
were high in need for closure were more persuaded by new 
information than were participants who were low in need for 
closure. Thus, the need for closure was associated with a ten- 
dency to form and maintain attitudes. 


Instrumental Versus Symbolic Attitudes 


Numerous researchers have argued for a distinction between 
instrumental (or utilitarian) and symbolic (or value- 
expressive) attitudes (e.g., Herek, 1986; Prentice, 1987; Sears, 
1988). Instrumental attitudes classify attitude objects accord- 
ing to their ability to promote self-interest, whereas symbolic 
attitudes express concerns about self-image and personal 
values (Herek, 1986; Sears, 1988). This distinction has been 
used to understand attitudes toward many social groups (e.g., 


homosexual persons, persons with HIV, African Americans; 
Herek, 2000; Reeder & Pryor, 2000; Sears, 1988), consumer 
objects (Ennis & Zanna, 2000; Prentice, 1987; Shavitt, 1990), 
altruistic behaviors (Maio & Olson, 1995; Snyder, Clary, & 
Stukas, 2000), and political issues (Kinder & Sears, 1985; 
Lavine & Snyder, 2000). 

At least three lines of research support this distinction. 
First, some attitude objects elicit attitudes that are associated 
primarily with one or the other of these functions. For exam- 
ple, Shavitt (1990) found that people’s thoughts about air 
conditioners and coffee focus on the utility of the objects, 
whereas thoughts about greeting cards and flags tend to focus 
on the objects’ capacity to symbolize the self and social 
values. 

Second, evidence indicates that people are more persuaded 
by messages containing arguments that match the instrumental 
or symbolic functions of their attitudes than by messages con- 
taining arguments that do not match the functions of their atti- 
tudes. For example, Shavitt (1990) found that instrumental ads 
for instrumental products (e.g., an air conditioner) were more 
persuasive than were symbolic ads for instrumental products. 
Similarly, Snyder and DeBono (1985) found that low self- 
monitors (who typically possess instrumental attitudes) were 
more persuaded by instrumental ads for various products (e.g., 
whiskey, cigarettes) than were high self-monitors (whose 
attitudes typically fulfill social-adjustive functions). Also, 
Prentice (1987) found that participants who attached high 
importance to symbolic values (e.g., mature love, self-respect) 
and symbolic possessions (e.g., family heirlooms) were less 
persuaded by messages that contained instrumental arguments 
than by messages that contained symbolic arguments. Presum- 
ably, these match effects occurred because people scrutinize 
arguments that match the function of their attitude more 
carefully than they scrutinize arguments that do not match 
the function of their attitude (Petty & Wegener, 1998). As a re- 
sult, match effects occur only when the persuasive arguments 
are strong, but not when the persuasive arguments are weak 
(Petty & Wegener, 1998). 

Finally, the distinction between instrumental and sym- 
bolic attitudes improves the measurement of attitudes and the 
prediction of behavior. Regarding attitude measurement, 
many studies have shown that attitudes toward ethnic groups 
are related to beliefs about the group members’ values, over 
and above beliefs about the group members’ implications for 
personal well-being (e.g., Esses et al., 1993; I. Katz & Hass, 
1988; see also Schwartz & Struch, 1989). Also, when an atti- 
tude serves a symbolic function, personal values enhance the 
prediction of attitude-relevant behavior over and above be- 
liefs about the positive or negative instrumental attributes of 


Attitudes and Higher-Order Constructs 307 


the behavior and perceptions of group norms (Beck & Ajzen, 
1991; Maio & Olson, 1995). Values exhibit weaker relations 
to attitudes and behaviors that serve utilitarian functions 
(Kristiansen & Zanna, 1988; Maio & Olson, 1994, 1995; cf. 
Maio & Olson, 2000). 


ATTITUDES AND HIGHER-ORDER CONSTRUCTS 


Attitudes do not, of course, exist in isolation from each other 
or from other constructs. For example, people who favor so- 
cial assistance payments to the poor may on average possess 
positive attitudes toward other social welfare programs such 
as national health care and subsidized housing. The positive 
attitudes toward all of these programs may in turn arise be- 
cause the person attaches high importance to the social value 
of helpfulness. Such relations among attitudes and values 
may have implications for stability and change in attitudes. In 
this section, we consider how attitudes are structurally and 
functionally related to each other and how sets of attitudes 
may be related to higher-order constructs such as values and 
ideologies. 


Interattitude Structure 


Heider’s (1958) balance theory is one of the earliest models 
of relations between attitudes. This theory examined a situa- 
tion in which a person (P) holds a positive or negative atti- 
tude toward another person (O), and both people (P and O) 
hold a positive or negative attitude toward a particular object 
(X). According to Heider, such P-O-X triads are balanced 
when P likes O and they hold the same (positive or negative) 
attitude toward X, or when P dislikes O and they hold differ- 
ent attitudes toward X. A state of imbalance occurs when P 
likes O and they hold different attitudes toward X, or when P 
dislikes O and they hold the same attitude toward X. In other 
words, balance exists when a person agrees with someone 
whom he or she likes, or when a person disagrees with some- 
one whom he or she dislikes. 

Heider (1958) predicted that unbalanced states create an 
unpleasant tension, which causes people to prefer balanced 
states. Subsequent research documented that participants 
report more discomfort with hypothetical unbalanced triads 
than with hypothetical balanced triads (e.g., Jordan, 1953). 
Individuals can convert unbalanced states to balanced states 
by using three strategies: Change the attitude toward O or X 
(attitude change), change the beliefs about O’s attitude 
(belief change), or focus on some aspect of O or X that bal- 
ances the triad (differentiation). In cases in which attitude 
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change is the selected route to imbalance reduction, Heider 
did not indicate whether the attitude toward O or the attitude 
toward X is more likely to change. 

Osgood and Tannenbaum’s (1955) congruity theory ad- 
dressed this latter issue by proposing that attitudes toward both 
O and X would change in the face of imbalance. In addition, 
these researchers predicted that the amount of attitude change 
would depend on the extremity of each attitude, such that the 
more extreme attitude would change the least. (Balance the- 
ory did not consider the role of attitude extremity.) These 
predictions have received some support (e.g., Tannenbaum, 
1966), with important exceptions (e.g., Tannenbaum & 
Gengel, 1966). 


Relations Between Attitudes, Values, and Ideologies 
Attitudes and Values 


Not only are different attitudes interconnected, but they may 
also be related to other, higher-order constructs such as val- 
ues. The capacity of attitudes to express values is highlighted 
by theories describing the value-expressive function of atti- 
tudes (e.g., Herek, 1986; D. Katz, 1960) and by measures that 
specifically include value-relevant beliefs in the assessment 
of attitude components (e.g., Esses et al., 1993). In addition, 
Rokeach’s (1973) seminal theory of values emphasized the 
role of values in driving attitudes. He suggested that a rela- 
tively small set of social values underlie most attitudes. Con- 
sistent with this reasoning, rankings of the importance of 
values have been shown to predict a large variety of attitudes 
and behavior (e.g., Maio, Roese, Seligman, & Katz, 1996). 
Moreover, priming a value makes accessible a variety of 
value-relevant attitudes, but priming value-relevant attitudes 
does not make accessible a variety of values (Gold & 
Robbins, 1979; Thomsen, Lavine, & Kounios, 1996), sug- 
gesting that values are above attitudes in the hierarchical 
network of attitudes, beliefs, and values. 

The potential centrality of values is also reflected in 
Rosenberg’s (1960, 1968) evaluative-cognitive consistency 
theory. According to this theory, people strive for consis- 
tency between their attitudes and social values. This pursuit 
of consistency is similar to the pursuit of balance in P-O-X 
triads. Specifically, people seek consistency across a series of 
person-value-object (P-V-X) triads or bands. Each band 
contains the person’s attitude toward the attitude object 
(e.g., censorship), the person’s belief in the importance of a 
particular value (e.g., freedom), and the perceived relation 
between the attitude object and the value (e.g., censorship 
threatens freedom). Moreover, for any given attitude object, 
the number of bands equals the number of relevant values, 
such that the bands differ only in their referent values (e.g., 


P-V,-X, P-V,-X, P-V-X). Rosenberg (1960) suggested that 
people are unlikely to restore consistency by changing per- 
sonal values because each value can be relevant to many 
attitudes. Thus, changing a value may balance triads for one 
attitude object, but could also create imbalance in other 
triads. Consistent with this reasoning, Rosenberg observed 
that people were more likely to change their beliefs about the 
relations between an attitude object and relevant values than 
to change the values themselves. 


Attitudes and Ideologies 


Attitudes may also express ideologies, which are clusters of 
thematically related values and attitudes (Converse, 1964; 
McGuire, 1985). Liberalism and conservatism are well- 
known ideologies. Liberal ideologies encompass attitudes 
and values that promote universal rights and benevolence, 
whereas conservative ideologies encompass attitudes and 
values that promote freedom and self-enhancement (e.g., 
Kerlinger, 1984). 

If the liberal-conservative dimension is a valid means for 
sorting political attitudes, then people should tend to endorse 
either conservative attitudes or liberal attitudes, but not both. 
Yet people’s actual endorsement of liberal and conservative 
attitudes does not follow this simple pattern (Converse, 1964; 
Fleishman, 1986). Multidimensionality is most evident 
among people who lack expertise in political issues (Lavine, 
Thomsen, & Gonzales, 1997; Lusk & Judd, 1988). Re- 
searchers have found at least two distinct ideological dimen- 
sions within political attitudes: attitudes toward moral 
regulation versus individual freedom, and attitudes toward 
compassion versus competition (e.g., Ashton, Esses, & Maio, 
2001; Boski, 1993). 

There has been recent interest in ideologies from re- 
searchers examining nonpolitical attitudes as well. For exam- 
ple, researchers in the area of intergroup attitudes have 
examined several ideological dimensions, including multi- 
culturalism versus color blindness (Wolsko et al., 2000) and 
individualism versus communalism (I. Katz & Hass, 1988). 
Diverse ideologies have also been examined in studies of at- 
titudes toward gender roles (Spence, 1993), body weight and 
obesity (Quinn & Crocker, 1999), ways of life (de St. Aubin, 
1996), and violence (Cohen & Nisbett, 1994). 

At present, there is little evidence documenting precisely 
how attitudes express broad values and ideologies. For exam- 
ple, values may occasionally function as post hoc justifica- 
tions for attitudes, rather than as their psychological basis 
(Kristiansen & Zanna, 1988). When causal influences of val- 
ues and ideologies do occur, the effects may be indirect or 
direct. In an indirect effect, values and ideologies influence a 
specific attitude indirectly through other attitudes, whereas a 


direct effect occurs when people perceive the value itself as 
relevant to their attitude (Maio & Olson, 1994, 1995). The 
latter, direct process may be more likely when the value and 
the reasons for its importance have been consciously articu- 
lated (Maio & Olson, 1998). 


CHARACTERISTICS OF ATTITUDES 


Attitudes vary along numerous dimensions, or characteris- 
tics, that have significant implications for information pro- 
cessing, persistence, and behavior. A continuing issue in the 
literature on attitude has been the relations among these 
dimensions; some researchers have argued that the various 
characteristics are distinct and should be treated as indepen- 
dent, but other researchers have argued that the characteristics 
are interdependent and should be treated as manifestations of 
a smaller set of constructs. In this section, we briefly describe 
these dimensions and address the controversy surrounding 
the interrelations among them. 


Extremity 


Attitude extremity is the oldest and most basic dimension of 
attitudes. Extremity refers to the extent to which the attitude 
deviates from a neutral midpoint—that is, the extent to which 
the individual’s evaluation is strongly favorable or strongly 
unfavorable. Extreme attitudes (compared to moderate atti- 
tudes) are more resistant to influence (e.g., Osgood & 
Tannenbaum, 1955), more likely to be projected onto others 
(e.g., Allison & Messick, 1988), and more likely to predict 
behavior (e.g., Fazio & Zanna, 1978). Attitude theorists have 
generally assumed that extreme attitudes develop over time, 
often resulting from actions that publicly commit the individ- 
ual to his or her position. 


Direct-Indirect Experience 


Attitudes can be based on direct, personal experience with 
the attitude object, or they can be based on indirect infor- 
mation from others about the object. For example, students’ 
attitudes toward chemistry courses can be based on their own 
experiences with previous chemistry courses or on things 
they have heard from others who have taken chemistry 
courses. Researchers have found that attitudes based on di- 
rect experience (compared to those based on indirect experi- 
ence) are more confidently held (e.g., Fazio & Zanna, 1978), 
more stable over time (e.g., Doll & Ajzen, 1992), more resis- 
tant to influence (e.g., Wu & Shaffer, 1987), and more likely 
to predict behavior (e.g., Fazio & Zanna, 1981). Presum- 
ably, these effects of direct experience reflect that we trust 
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our own senses more than we do others’ reports, which in- 
creases confidence in attitudes based on direct experience. 


Accessibility 


Accessibility refers to the ease of activation (activation 
potential) of a construct (Higgins, 1996). Highly accessible 
attitudes are evaluations that come to mind quickly and spon- 
taneously when the attitude object is encountered. Accessi- 
bility depends at least in part on the frequency with which the 
attitude has been activated in the recent past. Researchers 
have found that highly accessible attitudes (compared to 
less accessible attitudes) are more resistant to change (e.g., 
Bassili, 1996), more likely to influence perceptions of atti- 
tude-relevant events (e.g., Houston & Fazio, 1989), and more 
likely to predict behavior (e.g., Fazio & Williams, 1986). 
These effects of accessibility presumably reflect that highly 
accessible attitudes are always activated by the attitude ob- 
ject, so they exert an impact (compared to low accessibility 
attitudes, which are more likely to remain dormant). 


Embeddedness 


Attitude embeddedness (also called working knowledge) 
refers to the amount of attitude-relevant information, such as 
beliefs and experiences, that is linked to the attitude (Scott, 
1968; Wood, 1982). The more information that comes to 
mind when one encounters the attitude object, the more em- 
bedded is the attitude. Highly embedded attitudes are more 
resistant to change (e.g., Wood, Rhodes, & Biek, 1995), more 
likely to influence perceptions of attitude-relevant stimuli 
(e.g., Vallone, Ross, & Lepper, 1985), and more predictive of 
behavior (e.g., Kallgren & Wood, 1986) than are attitudes 
with low embeddedness. These effects of embeddedness pre- 
sumably reflect that attitudes based on a lot of information 
are held more confidently and provide the individual with 
many bits of knowledge to counteract the potential influence 
of new information. Also, embedded attitudes can be more 
accessible than are attitudes low in embeddedness (see Wood 
et al., 1995). 


Evaluative Consistency 


Evaluative consistency refers to the degree of consistency be- 
tween the overall attitude (the evaluation) and one of its compo- 
nents (cognitive, affective, or behavioral information). 
Evaluative consistency occurs when the favorability of the 
overall evaluation of the object is similar to (a) the favorability 
implied by the individual’s beliefs about the object (evaluative- 
cognitive consistency), (b) the favorability implied by the indi- 
vidual’s feelings toward the object (evaluative-affective 
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consistency), or (c) the favorability implied by the individual’s 
behavioral experience with the object (evaluative-behavioral 
consistency). Most past research has examined evaluative- 
cognitive consistency (see Eagly & Chaiken, 1993, 1998). 
Attitudes that are high in evaluative consistency are more 
stable (e.g., Rosenberg, 1968), more resistant to change (e.g., 
Chaiken & Baldwin, 1981), more likely to influence informa- 
tion processing (e.g., Chaiken & Yates, 1985), and more likely 
to predict behavior (e.g., Chaiken, Pomerantz, & Giner-Sorolla, 
1995) than are attitudes that are low in evaluative consistency. 
These effects of evaluative consistency probably reflect—at 
least in part—that consistent attitudes yield similar evaluative 
reactions to the object regardless of the situational salience of 
attitude components. Consistent attitudes might also be held 
more confidently and be more accessible than are inconsistent 
attitudes (see Chaiken et al., 1995). 


Ambivalence 


Ambivalence refers to the simultaneous presence of conflict- 
ing positive and negative elements within an attitude (Bell 
et al., 1996; Kaplan, 1972; I. Katz & Hass, 1988; M. M. 
Thompson et al., 1995). Ambivalence can occur between ele- 
ments of the same component of an attitude, such as when 
people possess both positive and negative feelings about a 
minority group (intracomponent ambivalence), or between 
two components of an attitude, such as when people possess 
negative beliefs but positive feelings about junk food (inter- 
component ambivalence). Attitudes that are ambivalent are 
likely also to be low in evaluative consistency, but the con- 
structs are distinct: Low consistency refers to discrepancies 
between the overall evaluation and one component, whereas 
ambivalence refers to discrepancies between elements of a 
component or between components (Maio et al., 2000). Am- 
bivalent attitudes have been shown (compared to nonambiva- 
lent attitudes) to be easier to change (e.g., Armitage & 
Conner, 2000) and to be less predictive of behavior (e.g., 
Lavine et al., 1998). Ambivalent attitudes have also been 
shown to polarize judgments when one of the conflicting 
elements is made more salient than another. For example, 
MacDonald and Zanna (1998) showed that individuals with 
ambivalent attitudes toward feminists made either more fa- 
vorable or more unfavorable judgments about a feminist job 
applicant, depending on whether positive or negative infor- 
mation was made salient, whereas individuals with nonam- 
bivalent but equally extreme attitudes were not affected by 
the salience of positive or negative information. Ambivalent 
attitudes are hypothesized to have these polarizing effects 
because such attitudes contain both positive and negative 
information; priming can make available one or the other 


category of information, which then influences judgments. 
There is also some evidence that ambivalent attitudes are less 
accessible than are nonambivalent attitudes (Bargh et al., 
1992), which might explain in part why the former are more 
pliable and less predictive of behavior (Armitage & Conner, 
2000). 


Strength: An Integrative Concept? 


The characteristics of attitudes discussed to this point overlap 
in several ways. First, they all tend to influence the degree to 
which attitudes are stable, resist change, affect the perception 
of attitude-relevant stimuli, and influence behavior. Also, the 
characteristics tend to be interrelated. For example, attitudes 
based on direct experience tend to be more extreme, less am- 
bivalent, and more accessible; evaluatively consistent atti- 
tudes tend to be more accessible and less ambivalent; 
ambivalent attitudes tend to be less extreme and less accessi- 
ble; and so on. 

Intuitively, all of these characteristics reflect the extent to 
which attitudes are important to individuals (Krosnick, 1989) 
or are held with conviction (Abelson, 1988). The term 
attitude strength has become a common label for this quality 
(e.g., Petty & Krosnick 1995). Theorists have linked many 
attitude characteristics with strength, including extremity, 
intensity, certainty, importance, embeddedness, direct experi- 
ence, accessibility, conviction, evaluative consistency, am- 
bivalence, and vested interest (see Bassili, 1996; Krosnick & 
Abelson, 1992; Raden, 1985). Each of these characteris- 
tics incorporates aspects of subjective certainty, personal 
importance, and significant psychological and behavioral 
consequences. 

Given the conceptual overlap among these various charac- 
teristics, theorists have wondered whether the variables rep- 
resent more-or-less-interchangeable terms for attitude 
strength—in other words, whether the characteristics form a 
single dimension ranging from weak to strong attitudes. The 
most common way to investigate this issue has been to mea- 
sure numerous characteristics and conduct a factor analysis 
of the data. If a single factor emerged, the unidimensional at- 
titude strength notion would be supported, whereas if multi- 
ple factors emerged, a more complex framework would be 
suggested. Such studies have generally supported the multi- 
dimensional view (e.g., Abelson, 1988; Krosnick, Boninger, 
Chuang, Berent, & Carnot, 1993), although the precise struc- 
tures of the factors emerging from the analyses have been in- 
consistent. Based on these data, the most common conclusion 
has been that the various characteristics should be viewed as 
distinct but related constructs (e.g., Krosnick et al., 1993; 
Raden, 1985). 


Bassili (1996) proposed a distinction between “operative” 
and “meta-attitudinal” measures of attitude strength. Opera- 
tive measures reflect ongoing processes that are related to 
attitude strength, whereas meta-attitudinal measures reflect 
the individual’s conscious judgments about qualities that 
are related to the strength of his or her attitude. For example, 
response latency is an operative measure—it unobtrusively 
reveals the accessibility of the evaluation (one feature of 
attitude strength). In contrast, a subjective judgment of the 
importance of the attitude is a meta-attitudinal measure—it 
reflects a conscious judgment about a strength-related fea- 
ture. In two studies, operative and meta-attitudinal measures 
of attitude strength were obtained from participants and used 
to predict resistance to influence and stability (two presumed 
consequences of attitude strength). Results showed that the 
operative measures predicted the criteria better than did 
the meta-attitudinal measures. Bassili concluded that opera- 
tive measures of attitude strength are more valid than meta- 
attitudinal measures. 


ATTITUDE FORMATION 


Where do attitudes come from? How do they develop? As de- 
scribed in the earlier section on attitude structure, attitudes 
can be based on cognitive, affective, and behavioral informa- 
tion. Each of these possible avenues of attitude formation is 
discussed in the following section; a biological perspective 
on attitude formation is also introduced. 

It is important to note that the psychological processes 
involved in attitude formation can also lead to attitude 
change Oe, the alteration of an existing attitude to a differ- 
ent evaluative position), and that theories of attitude forma- 
tion are also theories of attitude change. Because there is 
another chapter in this volume dedicated entirely to attitude 
change (see the chapter by Petty in this volume), we de- 
scribe the mechanisms involved in attitude formation only 
briefly here. 


Cognitive Processes 


One crucial source of attitudes is cognitive information about 
the target—that is, beliefs about the attributes of the target. 
Indeed, as discussed in the section of this chapter on attitude 
structure, beliefs play a prominent role in both major models 
of attitude content. Knowledge about an object can come 
either from direct experience with the object or from indirect 
sources such as parents, peers, and the media. As already 
noted, attitudes based on direct experience tend to be stronger 
than are attitudes derived from indirect information. 


Attitude Formation 311 


The best-known theory of attitude formation based on 
cognitive beliefs is the theory of reasoned action (Fishbein & 
Ajzen, 1975), which is an expectancy-value model in which 
salient (i.e., highly accessible) beliefs are hypothesized to 
combine additively to form the overall evaluation of the tar- 
get (attitude toward the target). As noted earlier in the chap- 
ter, many researchers have documented a strong relation 
between attitudes and expectancy-value products (e.g., 
Budd, 1986; van der Pligt & de Vries, 1998). This model of 
attitudes is based on a conception of humans as rational, de- 
liberate thinkers who base their attitudes and behavior on in- 
formation about the positive and negative consequences of 
various actions. 


Affective Processes 


Individuals’ evaluations of targets can also be based on how 
the target makes them feel—that is, on the emotions or affect 
aroused by the target. Indeed, as noted in this chapter’s sec- 
tion on attitude structure, affect sometimes predicts attitudes 
better than does cognition (e.g., see Esses et al., 1993). Of 
course, affect and cognition are often (or even usually) con- 
sistent with one another because these processes are mutually 
interdependent (e.g., knowledge can influence feelings, and 
feelings can guide thoughts). 

Although affect toward objects can spring from beliefs 
about those objects, there are a number of processes that can 
result in affect’s becoming associated with an object inde- 
pendently of cognition (i.e., independently of information 
about the characteristics of the object). These processes are 
discussed in detail in the section entitled “Low-Effort 
Attitude Change Processes” in the chapter on attitude change 
(see the chapter by Petty in this volume), so we only mention 
them here briefly. One process is classical conditioning, 
which occurs when a stimulus comes to evoke a response 
that it did not previously evoke, simply by being paired 
with another stimulus that already evokes that response. For 
example, the receptionist at a dental office might come to 
evoke negative affect for patients who are very fearful of den- 
tal work. Although a conditioning perspective on attitudes 
has been around for many years in social psychology (e.g., 
Staats & Staats, 1958), the past decade has continued to see 
very sophisticated studies documenting conditioning effects 
on attitudes (e.g., Cacioppo, Marshall-Goodell, Tassinary, & 
Petty, 1992). 

A second process through which affect can become linked 
to objects without necessary cognitive mediation is mere ex- 
posure. The mere exposure effect (Zajonc, 1968) occurs when 
repeated, simple exposure to an object (i.e., exposure without 
reinforcement feedback) leads to more favorable feelings 
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toward the object. For example, an abstract painting that ini- 
tially evokes confusion might come to be liked over time— 
simply because the painting is more familiar. The results of 
several fascinating studies have shown that conscious recog- 
nition that stimuli are familiar is not necessary for the mere 
exposure effect to occur (e.g., Moreland & Beach, 1992), nor, 
in fact, is conscious perception of the object—subliminal ex- 
posures can increase liking for a stimulus (e.g., Bornstein & 
D’ Agostino, 1992). 


Behavioral Processes 


A third potential source of attitudes is behavioral 
information—specifically, knowledge of one’s previous ac- 
tions toward a target. This knowledge can influence attitudes 
through a variety of processes, including dissonance arousal 
and self-perception processes. From the perspective of disso- 
nance theory (Festinger, 1957), knowing that one has acted 
favorably or unfavorably toward a target will motivate an indi- 
vidual to evaluate the target in a manner consistent with those 
actions (e.g., Cooper & Fazio, 1984). From the perspective of 
self-perception theory (Bem, 1972), individuals might logi- 
cally infer that their attitudes are consistent with their actions 
(e.g., Olson, 1992). Thus, an effect of past behavior on atti- 
tudes may reflect both cognitive and affective processes. 

In a recent paper, Albarracin and Wyer (2000) reported 
several studies in which they cleverly tested the effects of 
knowledge about past behavior by leading participants to be- 
lieve that they had expressed either support for or opposition 
to a particular position without being aware of it. Because 
participants had not actually engaged in such behavior, the 
research tested directly the effects of believing that one has 
behaved in a certain fashion. Results showed that participants 
reported attitudes that were consistent with the alleged past 
behavior and that subsequent behavior toward the target also 
tended to be consistent with the alleged prior action. Thus, 
behavioral information had a direct effect on attitudes and 
subsequent behavior. 


Biological Processes 


Social psychologists have directed little attention to biologi- 
cal processes in attitude formation. A few biological issues 
have been examined, including physiological concomitants 
of attitudes (e.g., Cacioppo & Petty, 1987), the impact of cer- 
tain drugs on attitudes and persuasion (e.g., MacDonald, 
Zanna, & Fong, 1996), and the role of physiological arousal 
in specific attitudinal phenomena (e.g., Zanna & Cooper, 
1974). In general, however, biological processes have been 
neglected by attitude researchers. 


A provocative biological perspective on attitudes concerns 
the role of genetic factors. The field of behavioral genetics 
has begun to influence social psychologists, including atti- 
tude researchers. It is extremely unlikely, of course, that 
there are direct, one-to-one connections between genes and 
attitudes (e.g., a gene that causes attitudes toward capital 
punishment). Nevertheless, genes could establish general 
predispositions that shape environmental experiences in 
ways that increase the likelihood of an individual’s develop- 
ing specific traits and attitudes. For example, children who 
are naturally small for their age might be picked on by other 
children more than their larger peers are, with the result that 
the smaller children might develop anxieties about social in- 
teraction, resulting in consequences for their attitudes toward 
social events. 

Arvey, Bouchard, Segal, and Abraham (1989) found that 
approximately 30% of the observed variance in job satisfac- 
tion in their sample of identical twins raised apart was 
attributable to genetic factors. Thus, respondents’ attitudes 
toward their jobs appeared to be partly inherited. In addition, 
Eaves, Eysenck, and Martin (1989) reported the results of 
two surveys involving almost 4,000 pairs of same-sex twins. 
A variety of social attitudes were assessed, including those 
toward crime, religion, race, and lifestyle. Heritability esti- 
mates for individual items ranged from 1% to 62%, with a 
median of 39%. 

But how do genes impact attitudes? What are some 
specific, genetically influenced characteristics that can sys- 
tematically bias environmental experience so as to induce 
particular attitudes? Tesser (1993) identified several possibil- 
ities, including intelligence, temperament, and sensory struc- 
tures. Olson, Vernon, Harris, and Jang (2001) measured some 
potential mediators of attitude heritability, including physical 
characteristics and personality factors, in a study of more 
than 300 pairs of same-sex twins. Most of these possible me- 
diators were themselves highly heritable in the sample of 
twins, and multivariate analyses showed that several of the 
variables correlated at a genetic level with attitudes that were 
heritable. For example, the personality trait of sociability 
yielded a significant heritability coefficient and significant 
genetic correlations with five of the six heritable attitude 
measures. These data suggest that the heritability of sociabil- 
ity (see Zuckerman, 1995) might account in part for the heri- 
table components of some attitudes. 

Tesser (1993) hypothesized that attitudes that are highly 
heritable might have a biological basis that makes attitude 
change difficult, which could lead individuals to develop psy- 
chological defenses to protect the attitudes. For example, 
niche building might occur (see Plomin, DeFries, & Loehlin, 
1977), such that individuals seek out environments that are 


compatible with their highly heritable attitudes. Tesser (1993; 
Tesser & Crelia, 1994) tested this idea in several ingenious 
ways. In all of his studies, attitudes that had been shown by 
Eaves et al. (1989) to have either high or low heritability co- 
efficients were studied. In one study, individuals were found 
to provide answers more quickly for high than for low heri- 
tability attitudes. In another study, individuals were found to 
be less affected by conformity pressure when reporting high 
than when reporting low heritability attitudes. In a third 
study, interpersonal similarity on high heritability attitudes 
was shown to affect liking for others more than did similarity 
on low heritability attitudes. Finally, in two studies, individu- 
als found agreement feedback more reinforcing when the 
agreement occurred for highly heritable attitudes than when 
it occurred for less heritable attitudes. These findings suggest 
that attitude strength is positively correlated with attitude 
heritability (see also Olson et al., 2001). 


ATTITUDES AND INFORMATION PROCESSING 


One of the fundamental functions of attitudes, as discussed 
earlier, is the object-appraisal function, which refers to the 
capacity of attitudes to facilitate both the identification of 
objects and the rapid appraisal of the objects’ implications 
for the self. This function underscores that attitudes influ- 
ence how objects are perceived and how information about 
those objects is processed. In this section we review research 
on the effects of attitudes on information processing. The 
theme of this section is selectivity—attitudes tend to facilitate 
the processing of information that is consistent with them and 
to inhibit the processing of inconsistent information. 


Selective Attention 


Festinger (1957) proposed in his dissonance theory that peo- 
ple want to believe that their decisions and attitudes are cor- 
rect. Whereas individuals attend in an unbiased way to 
information prior to making decisions or forming attitudes, 
Festinger argued that after attitudes are formed, they moti- 
vate people to pay attention to consistent information and 
avoid inconsistent information. Early tests of this selective 
exposure hypothesis yielded little support (see Freedman & 
Sears, 1965), but researchers gradually identified boundary 
conditions for the effect (see Frey, 1986). For example, the 
utility, novelty, and salience of consistent versus inconsis- 
tent information must be controlled so that the effects of 
attitudinal consistency can be tested clearly. Researchers 
have documented selective attention in the laboratory (e.g., 
Frey & Rosch, 1984) and in field settings (e.g., Sweeney & 
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Gruber, 1984), and there is evidence that individuals with 
repressing-avoidance defensive styles may exhibit selective 
attention to consistent information more than do individ- 
uals with ruminative-approach defensive styles (Olson & 
Zanna, 1979). 

There is also some evidence of a broader form of selective 
attention, which relates to the existence of strong attitudes 
per se. Specifically, Roskos-Ewoldsen and Fazio (1992) 
showed that objects toward which individuals have highly 
accessible attitudes (whether positive or negative) are more 
likely to attract attention than are objects toward which indi- 
viduals have less accessible attitudes. Presumably, this selec- 
tivity effect is not motivated by a desire to believe one’s 
attitudes to be correct, but rather by the functional value of 
quickly attending to objects that personal experience has 
shown to be potentially rewarding or punishing. 


Selective Perception 


Many researchers have shown that attitudes influence the per- 
ception or interpretation of attitude-relevant information, with 
the effect generally of interpreting information as more sup- 
portive of one’s attitudes than is actually the case. For ex- 
ample, Vidmar and Rokeach (1974) found that viewers’ 
perceptions of the television show All in the Family were re- 
lated to their racial attitudes: Low-prejudice viewers saw the 
bigoted character of Archie Bunker as the principal target of 
humor and sarcasm in the show, whereas high-prejudice view- 
ers saw Archie sympathetically and considered his liberal son- 
in-law Mike to be the principal target of humor and sarcasm. 
Similarly, Lord, Ross, and Lepper (1979) found that individ- 
uals’ attitudes toward capital punishment predicted their as- 
sessments of the quality of two alleged scientific studies, one 
supporting and one questioning the deterrence value of the 
death penalty: Participants evaluated the study that apparently 
supported their own view more favorably than they evaluated 
the study that apparently disconfirmed their view. Houston 
and Fazio (1989) replicated this study and showed that the bi- 
asing effect of attitudes on the interpretation of information 
was significant only when the attitudes were highly accessible 
(see also Fazio & Williams, 1986; Schuette & Fazio, 1995). 
In another domain, Vallone et al. (1985) found that indi- 
viduals’ evaluations of the media coverage of an event were 
biased by their relevant attitudes (see also Giner-Sorolla & 
Chaiken, 1994). 

If there is a general bias to perceive the world as consistent 
with one’s attitudes, then existing attitudes might reduce the 
ability of perceivers to detect that the attitude object has 
changed. Indeed, Fazio, Ledbetter, and Towles-Schwen (2000) 
have documented such an effect and related it to attitude 
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accessibility. Specifically, attitudes tended to interfere with 
participants’ ability to perceive change in an attitude target, 
and this effect was stronger for highly accessible attitudes than 
for less accessible attitudes. In another set of studies, Stewart, 
Vassar, Sanchez, and David (2000) showed that participants’ 
attitudes toward women’s and men’s societal roles influenced 
whether they individuated male or female targets more: Indi- 
viduals with traditional sex-role attitudes individuated male 
targets more than they did female targets, whereas individuals 
with nontraditional sex-role attitudes individuated female tar- 
gets more than they did male targets. 


Selective Memory 


Attitudes have long been thought to influence memory and 
learning of attitude-related information. A variety of processes 
could contribute to selective memory, including paying more 
attention to attitudinally consistent information (but see 
Roberts, 1985), finding it easier to store attitudinally consistent 
information, and finding it easier to retrieve attitudinally con- 
sistent information from memory. Early studies (e.g., Levine & 
Murphy, 1943) indicated that individuals learned and recalled 
information that was consistent with their attitudes better 
than they did information that was inconsistent with their atti- 
tudes. Subsequent researchers, however, had difficulty obtain- 
ing significant selective memory effects and questioned 
the reliability of the phenomenon (e.g., Greenwald & 
Sakumura, 1967). 

In a comprehensive and detailed review and meta-analysis 
of research on attitude-memory effects, Eagly, Chen, 
Chaiken, and Shaw-Barnes (1999) concluded that the hypoth- 
esized attitude congeniality effect (i.e., information congenial 
with one’s attitudes is more memorable than is uncongenial 
information) has been small in magnitude and inconsistent 
across studies. Especially worrisome was evidence that the 
effect has grown weaker in more recent experiments (com- 
pared to earlier experiments), because the recent studies have 
generally used more rigorous methods. It appears that selec- 
tive memory may be a phenomenon weaker than selective 
attention and selective perception. 

Perhaps the clearest evidence of selective memory has 
been obtained in studies testing whether individuals use their 
attitudes as clues for searching memory (i.e., studies specifi- 
cally testing selective search and retrieval effects, as opposed 
to selective learning and memory in general). Ross (1989) 
reviewed a number of studies showing that people used their 
attitudes as clues for searching memory, reconstructing past 
events, or both. For example, Ross, McFarland, and Fletcher 
(1981) exposed respondents to one of two messages that had 
previously been shown to have reliable persuasive effects in 


opposite directions. In an apparently separate study, respon- 
dents exposed to the persuasive message provided reports of 
the frequency with which they had performed a number of 
behaviors in the past month, including some behaviors 
related to the target of the persuasive message. Respondents 
reported more frequent behaviors consistent with the attitude 
promoted in their message than with the attitude promoted in 
the opposing message. Presumably, respondents used their 
newly formed attitudes to search their memories and to re- 
construct their behaviors in the previous month. 


Attitude Polarization 


Attitudes guide information processing in another way— 
namely, they guide spontaneous thinking about the attitude 
object. Tesser (1978) showed that simply thinking about an 
attitude object tended to polarize the evaluation even in the 
absence of any new information. For example, simply think- 
ing about a person who was either likable or unlikable led to 
stronger evaluations (positive for the likable target, negative 
for the unlikable partner) than did a control condition in which 
participants performed a distracting task. Presumably, the ex- 
isting attitude led participants to generate thoughts that were 
consistent with it. This interpretation is supported by findings 
that polarization effects are stronger when the individual is 
knowledgeable about the attitude object and when the exist- 
ing attitude is high in evaluative-cognitive consistency (see 
Chaiken & Yates, 1985). 


ATTITUDES AND BEHAVIOR 


We discussed earlier how attitudes fulfill various functions 
for individuals, including the rapid appraisal of attitude ob- 
jects (object-appraisal function), the approach of rewarding 
objects and the avoidance of punishing objects (utilitarian 
function), the expression of underlying values and identity 
(value-expressive function), and so on. All of these hypoth- 
esized functions are predicated in part on the assumption 
that individuals behave in ways that are consistent with their 
attitudes—in other words, on the assumption that attitudes 
influence action. In this final section, we review some of the 
literature on attitude-behavior consistency. 

The hypothesized strong relation between attitudes and be- 
havior has sometimes proven difficult to document. For exam- 
ple, Wicker (1969) reviewed 30 studies that examined 
attitude-behavior consistency and concluded that there was 
“little evidence to support the postulated existence of stable, 
underlying attitudes within the individual which influence both 
his verbal expressions and his actions” (p. 75). Fortunately, 


since that time, researchers have identified several factors that 
influence attitude-behavior consistency, and the appropriate 
conclusion seems to be that measures of attitudes and behavior 
are closely related in some circumstances but not in others. We 
outline these factors in the following sections. 


Compatibility of Attitude and Behavior Measures 


An important conceptual advance came from Fishbein and 
Ajzen’s (1975) theory of reasoned action. These theorists dis- 
tinguished between attitudes toward objects and attitudes to- 
ward behaviors—a distinction that can also be viewed as 
general attitudes versus specific attitudes. Hypothetically, 
attitudes toward objects should influence the favorability of 
the class of behaviors related to the object, whereas attitudes 
toward behaviors should influence the favorability of those 
specific behaviors. 

The point made by Fishbein and Ajzen was that for there 
to be a strong relation between measures of attitudes and be- 
havior, the measures must be compatible (or congruent) in 
terms of their specificity: Measures of general attitudes (to- 
ward objects) predict general or broad behavior measures 
(encompassing the class of relevant behaviors, also called 
multiple act behavioral criteria), whereas measures of spe- 
cific attitudes (toward behaviors) predict specific behavior 
measures (the single, focal behavior). Single behaviors can 
be specified along four dimensions: action (e.g., giving 
money), target (e.g., to a homeless person), context (e.g., on 
the street), and time (e.g., at lunchtime today). To predict 
single behaviors maximally, the measure of attitude should 
correspond on as many dimensions of specification as possi- 
ble. For example, a measure of the individual’s attitude to- 
ward giving money to a homeless person on the street at 
lunchtime today would be the best predictor of this specific 
behavior, whereas measures of attitudes that corresponded 
only on the action dimension (attitudes toward giving 
money) or only on the target dimension (attitudes toward 
homeless people) would rarely yield strong correlations. 
Many early researchers inappropriately used general attitude 
measures (e.g., participants’ attitudes toward an ethnic 
group) to try to predict specific behavior measures (e.g., how 
participants behaved toward a particular member of the eth- 
nic group in a particular setting at a particular time). When 
measures of attitudes and behavior have been highly com- 
patible in terms of their specificity, attitude-behavior correla- 
tions have been substantial (see Ajzen & Fishbein, 1977; 
Kraus, 1995). 

The impact of another kind of compatibility on attitude- 
behavior consistency was investigated by Lord, Lepper, and 
Mackie (1984). These researchers proposed a “typicality 
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effect,” such that attitudes toward a social group would pre- 
dict individuals’ behavior toward typical members of the 
group better than the same attitudes would predict behavior 
toward atypical members of the group. Results showed that 
individuals’ attitudes toward gay men predicted how they be- 
haved toward a gay man who closely matched the stereotype 
better than the same attitudes predicted how they behaved 
toward a gay man who differed substantially from the stereo- 
typical image of gay men. Thus, compatibility between group 
stereotypes and individual group members influences 
whether attitudes toward the group predict behavior toward 
those individual members (see also Blessum, Lord, & Sia, 
1998). 


Nature of the Behavior 


Certain kinds of behavior are more predictable from attitudes 
than are other kinds of behavior. In particular, attitudes are 
hypothesized to guide only volitional actions—behaviors that 
individuals are free to perform or to not perform. When 
strong external incentives or constraints exist regarding an 
action, attitudes may not play much role in determining be- 
havior. For example, politeness norms may cause people to 
say hello to coworkers whom they dislike. This conceptual 
point—that social pressures often guide behavior—was rec- 
ognized in the theory of reasoned action (Fishbein & Ajzen, 
1975) by including subjective norms as a determinant of be- 
havioral intentions that was distinct from attitudes. Subjective 
norms refer to individuals’ perceptions that other people who 
are important to them want them to act in certain ways. 

Researchers have identified several factors, including the 
nature of the behavior, that influence the degree of impact 
that attitudes and norms exert on behavior. For example, 
Ybarra and Trafimow (1998) showed that increasing the ac- 
cessibility of individuals’ private self cognitions (i.e., assess- 
ments of the self by the self) led participants to place more 
weight on attitudes than on perceived norms in behavioral 
choices, whereas increasing the accessibility of individuals’ 
collective self cognitions (assessments of the self by other 
people and reference groups) led participants to place more 
weight on perceived norms than on attitudes in behavioral 
choices. Presumably, these findings reflected that attitudes 
derive from personal preferences, whereas norms derive 
from other people. 

Ajzen (1985, 1991) proposed a revision to the theory of 
reasoned action, which he labeled the theory of planned 
behavior (for a review, see Conner & Armitage, 1998). This 
model includes perceived behavioral control as another 
determinant of intentions and behavior, distinct from both 
attitudes and subjective norms. The construct of perceived 
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behavioral control extends the model to behavior that is not 
fully under volitional control; for example, individuals who 
believe that they cannot easily perform a behavior might not 
do it even if they have a positive attitude toward the behavior, 
perceive that other people want them to perform it, or both 
(see also Gollwitzer & Moskowitz, 1996). Thus, the nature of 
the behavior—specifically, its controllability or difficulty— 
influences the strength of the attitude-behavior relation. 


Nature of the Attitude 


Characteristics of the attitude also influence the strength 
of the attitude-behavior relation, a point that we noted earlier 
in this chapter. One of the first attitude qualities to be studied 
in this regard was direct versus indirect experience: Attitudes 
that are based on direct experience with the attitude object 
predict behavior better than do attitudes that are based on in- 
direct experience (see Fazio & Zanna, 1978, 1981). Presum- 
ably, these findings reflected that attitudes based on direct 
experience are stronger—more confidently held, more stable, 
and so on—than are attitudes based on indirect experience. 
Indeed, in a meta-analysis of the attitude-behavior consis- 
tency literature, Kraus (1995) concluded that such attitudinal 
qualities as direct experience, certainty, and stability pre- 
dicted the strength of the attitude-behavior relation. 

Fazio (1990) has proposed that the effects of direct expe- 
rience operate through another manifestation of attitude 
strength—namely, the accessibility of the attitude. As noted 
earlier, accessible attitudes are more likely to be associated 
with biased perceptions of stimuli (e.g., Houston & Fazio, 
1989; Schuette & Fazio, 1995). If accessible attitudes are 
more likely to be evoked spontaneously in the presence of the 
attitude object (e.g., Fazio et al., 1986) and to guide individ- 
uals’ perceptions of situations, then they seem likely to serve 
as the basis for action as well. In line with this reasoning, 
Fazio and Williams (1986) found that voters who reported 
their evaluations of candidates quickly were subsequently 
more likely to vote for their preferred candidate than were 
voters who reported their evaluations more slowly. In his 
meta-analysis of past studies, Kraus (1995) also found that 
attitude accessibility predicted attitude-behavior consistency. 

Another attitude characteristic that has been related to 
attitude-behavior consistency is ambivalence, which (as 
described in earlier sections) refers to inconsistency within 
or between the components of an attitude (e.g., affective- 
cognitive ambivalence involves oppositely valenced affect 
versus cognition). Ambivalent attitudes are generally less 
predictive of behavior than are nonambivalent attitudes (eg. 
Armitage & Conner, 2000), presumably because the conflict- 
ing elements may become differentially salient at various 


times or in various settings, thus inducing inconsistent ac- 
tions. In an interesting twist on this reasoning, however, 
Jonas et al. (1997) showed that encountering a new attitude 
object that has both positive and negative aspects can arouse 
attitudinal ambivalence, which in turn can cause individuals 
to process information systematically (because of uncer- 
tainty; see also Maio, Bell, et al., 1996—which in turn can 
actually produce higher attitude-behavior consistency. Thus, 
although ambivalence in existing attitudes may serve to re- 
duce attitude-behavior consistency, ambivalence in newly 
forming attitudes may have the opposite effect. 

Lavine et al. (1998) showed that when attitudes were high 
in affective-cognitive ambivalence, the affective component 
predicted behavior better than did the cognitive component. 
In contrast, for nonambivalent participants, the affective and 
cognitive components of attitudes were equally predictive of 
behavior. MacDonald and Zanna (1998) found that manipu- 
lations of evaluative priming had a significant effect on the 
behavioral intentions of attitudinally ambivalent individuals 
but did not affect the intentions of nonambivalent individu- 
als. Consistent with the studies described in the preceding 
paragraph, these data indicate that ambivalent attitudes yield 
greater behavioral variability across time and settings than do 
nonambivalent attitudes. 


Personality Variables 


Finally, some people may behave in accordance with their at- 
titudes to a greater extent than do other people—that is, col- 
lapsing across attitude-behavior domains (hence, ignoring 
the nature of the attitude and the nature of the behavior), 
personality variables might predict the strength of attitude- 
behavior consistency. 

The variable that has received the most attention in this re- 
gard is self-monitoring (Snyder, 1987). Self-monitoring re- 
flects the extent to which people base their behavioral choices 
on internal versus external cues. Low self-monitors rely on 
internal cues to guide their behavior, whereas high self- 
monitors use external, situational cues as guides to action. 
Given that attitudes are an internal construct, low self- 
monitors should exhibit stronger attitude-behavior consis- 
tency than do high self-monitors. Several researchers have 
produced data in support of this prediction (e.g., Snyder & 
Kendzierski, 1982; Zanna, Olson, & Fazio, 1980). Ajzen, 
Timko, and White (1982) obtained evidence that the source 
of the heightened attitude-behavior consistency for low self- 
monitors was that these individuals were more likely to fol- 
low through on their behavioral intentions than were high 
self-monitors. Presumably, high self-monitors are easily di- 
verted from their intended courses of action by unanticipated 


situational demands. Kardes, Sanbonmatsu, Voss, and Fazio 
(1986) found that the attitudes of low self-monitors were 
more accessible than were the attitudes of high self-monitors, 
perhaps because low self-monitors think about their attitudes 
more often than do high self-monitors. As noted earlier, atti- 
tudes high in accessibility predict behavior better than do 
attitudes low in accessibility (e.g., Fazio & Williams, 1986). 
Thus, there are probably multiple determinants of the differ- 
ences in attitude-behavior consistency between low and high 
self-monitors. 

Private self-consciousness (Fenigstein, Scheier, & Buss, 
1975) is another variable that has been examined in the 
attitude-behavior consistency literature. Private self-con- 
sciousness reflects the extent to which individuals are aware of 
their internal states (e.g., moods, values, and attitudes); it 
corresponds to stable individual differences in the state of 
objective self-awareness or self-focused attention (Duval & 
Wicklund, 1972). If awareness of one’s attitudes increases the 
likelihood of attitude-consistent behavior (which seems 
likely), then individuals who are high on this dimension 
should exhibit stronger attitude-behavior correlations than 
should those who are low in private self-consciousness. This 
result has in fact been obtained (Scheier, Buss, & Buss, 1978; 
Wicklund, 1982). 


Composite Model of Attitude-Behavior Consistency 


As the preceding sections have indicated, social psycholo- 
gists have made significant advances in the understanding of 
when and how attitudes predict behavior. Distinctions be- 
tween types of attitudes, types of behaviors, and personality 
subgroups have all helped to clarify the relation between 
measures of attitudes and measures of behavior. 

Numerous models have been proposed to account for 
attitude-behavior consistency, such as the theory of rea- 
soned action (Fishbein & Ajzen, 1975), the theory of planned 
behavior (Ajzen, 1985), and the MODE (motivation and 
opportunity as determinants of how attitudes influence be- 
havior) model (Fazio, 1990). Based on a comprehensive 
review of the literature and building on these previously 
proposed models, Eagly and Chaiken (1993, 1998) devel- 
oped a composite model of attitude-behavior consistency. 
The model parallels the theory of reasoned action by propos- 
ing that attitudes toward behaviors predict intentions, which 
in turn predict behaviors. Working back from attitudes 
toward behaviors, however, the model identifies five factors 
that influence attitudes toward behaviors: habits (past behav- 
ior), attitudes toward targets (especially the target of the be- 
havior), utilitarian outcomes (rewards and punishments that 
are expected to follow from engaging in the behavior), 
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normative outcomes (approval and disapproval from others 
that are expected to follow from engaging in the behavior), 
and self-identity outcomes (implications of engaging in the 
behavior for the self-concept). Some of these factors are also 
hypothesized to influence either intentions or behavior di- 
rectly; for example, habits are proposed to affect behavior 
directly (i.e., not via attitudes toward the behavior). 

Eagly and Chaiken’s model is unique in its inclusion of 
both attitudes toward behaviors and attitudes toward targets 
as predictors of specific actions. The inclusion of habits is 
also noteworthy; many researchers have found that past 
behavior predicts future behavior even when attitudes and 
norms are held constant (see Ouellette & Wood, 1998). Fi- 
nally, the categorization of expected outcomes into utilitarian, 
normative, and self-identity classes extends previous models. 
Although this model has not yet been exposed to direct em- 
pirical tests, it provides a heuristically useful framework for 
future research. 


Applications to Social Behavior 


We have reviewed various theories of attitude-behavior con- 
sistency and outlined the conditions under which strong rela- 
tions between attitudes and behavior can be expected. The 
title of this chapter is “Attitudes in Social Behavior,” so we 
close with the consideration of some of the important social 
behaviors to which the concept of attitude can be applied. In 
each case, data support the hypothesis that attitudes facilitate 
attitude-consistent behaviors. 

For example, this Handbook contains several chapters that 
rely heavily on attitudes to understand social behavior. The 
chapter on prejudice, racism, and discrimination (see the 
chapter by Dion in this volume) explores interpersonal and 
intergroup settings in which negative attitudes toward an out- 
group (prejudice) can cause conflict and violence. Prejudice 
is one of the oldest topics in social psychology; it continues 
to be a vibrant research area today—trecent attention has ex- 
panded to incorporate the consequences of being a target of 
prejudice (e.g., impaired performance on intellectual tasks 
caused by fear of confirming a stereotype; Spencer, Steele, & 
Quinn, 1999; Steele & Aronson, 1995). Altruism (see the 
chapter by Batson & Powell in this volume) is another do- 
main in which attitudes are important. Positive evaluations of 
potential recipients of assistance serve to motivate prosocial 
actions (e.g., Goodstadt, 1971). Similarly, attitudes influence 
individuals’ responses to situations involving justice consid- 
erations (see the chapter by Montada in this volume). For 
example, individuals are more likely to tolerate a situation in 
which distributive or procedural justice was violated when 
they have positive attitudes toward the responsible authority 
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(see Tyler & Smith, 1998). Family and close relationships 
(see the chapter by Clark & Grote in this volume) are built 
upon positive attitudes, encompassing such concepts as love, 
trust, caring, and intimacy. Positive attitudes encourage good 
communication, which is the basis of effective relationships. 
Research on consumer behavior is one of the clearest ex- 
amples of the application of social psychological findings on 
attitudes (e.g., Cialdini, 1993; Reardon, 1991), with topics 
ranging from advertising to purchasing behavior. Finally, re- 
search on expectancies (see Olson, Roese, & Zanna, 1996) 
has shown that expectancies influence information process- 
ing and behavior. One of the principal sources of expectan- 
cies is attitudes—we expect good things from positively 
evaluated objects and bad things from negatively evaluated 
objects. These attitude-induced expectancies can lead to er- 
rors in information processing, biased hypothesis-testing, and 
self-fulfilling prophesies (see Olson et al., 1996). 


CONCLUSIONS 


Many issues and questions must be addressed in future re- 
search on attitudes in social behavior. One important issue is 
the internal structure of attitudes, including the dimensional- 
ity of attitudes and the conditions under which different com- 
ponents of attitudes are more influential than are other 
components. A related issue is the distinction between implicit 
attitudes and explicit attitudes (or between implicit and ex- 
plicit measures of attitudes), including the question of which 
sorts of behavior are best predicted by each type of attitude 
(measure). The connections between attitudes and broader 
constructs like values and ideologies also need to be clarified. 
Turning to a different domain, the role of biological factors in 
attitude formation and change seems likely to receive more at- 
tention over the next decade. Finally, the connection between 
attitudes and behavior will continue to interest social psychol- 
ogists, with models of attitude-behavior consistency becom- 
ing increasingly complex. For example, prediction may be 
improved by simultaneously taking into account attitudes to- 
ward all of the different behavioral options in a setting. 

In closing, the evidence described in this chapter supports 
the importance of the construct of attitude. Because of their 
broad evaluative nature, attitudes may potentially reflect di- 
verse beliefs, feelings, and behaviors. In addition, these eval- 
uations serve a number of attitude functions and vary on 
several characteristics (e.g., ambivalence, certainty). Most 
important is that attitudes influence a wide variety of impor- 
tant social behaviors. Indeed, no matter what the setting, per- 
sonal evaluations play a role in information processing and in 
behavior. The obsession of popular culture with the concept 


of attitude, noted at the outset of this chapter, is comprehen- 
sible when the ubiquity of attitudes is recognized. 
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It is difficult to think about the self without referring to other 
people. Although the very concept of the self seems to denote 
individualism, the self is nevertheless incomplete without 
acknowledging our interactions with others. People often 
describe themselves in terms of relationships (husband, son, 
mother) or as a member of a profession (and thus as a member 
of a social group). Even personality traits are usually concep- 
tualized in comparison to other people (one is not extraverted 
per se, but extraverted compared to others). Self-esteem re- 
flects what others think (Leary, Tambor, Terdal, & Downs, 
1995). Attempts at self-control can benefit or harm others 
(e.g., smoking and drinking; Baumeister, Heatherton, & Tice, 
1994). People’s behavior can be radically affected by social 
rejection or exclusion (Twenge, Baumeister, Tice, & Stucke, 
2001; K. D. Williams, Cheung, & Choi, 2000). Selves do 
not develop and flourish in isolation. People learn who and 
what they are from other people, and they always have identi- 
ties as members of social groups. By the same token, close 
personal relationships are potent and probably crucial to the 
development of selfhood. A human being who spent his or 
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her entire life in social isolation would have a stunted and 
deficient self. 

In addition, the self is inherently interpersonal because 
relating to others is part of what the self is for. The self is con- 
structed, used, altered, and maintained as a way of connect- 
ing the individual organism to other members of its species. 
By this we are not positing a mysterious homunculus that cre- 
ates the self to serve its own purposes. Instead, we begin by 
acknowledging that the need to belong is a fundamental 
human need that serves the innate biological goals of survival 
and reproduction (see Baumeister & Leary, 1995), and so 
psychological mechanisms such as the self are likely to be 
shaped to foster interpersonal connection. The biological 
evolution of the species presumably established the cognitive 
and motivational basis of self, and the experiences of the in- 
dividual within an immediate social context builds on these 
bases to shape the self in ways that lead to establishing and 
maintaining some important social bonds. If no one likes you, 
the odds are that you will start asking “What’s wrong with 
me?’—and making changes to the self when you reach some 
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answers. In this chapter, we will explore how individual 
selves affect others and how others affect individual selves. 

The interpersonal self is one of three major facets of the 
self (Baumeister, 1998). The other two main aspects are the 
experience of reflexive consciousness, which involves being 
aware of oneself and constructing knowledge structures (in- 
cluding self-concept and self-esteem) about the self, and the 
executive function, which controls the decisions and actions 
of the self. As argued previously, the social self provides a 
crucial piece of this puzzle. 


BELONGINGNESS, SOCIAL EXCLUSION, 
AND OSTRACISM 


Theoretical Background 


Meaningful human relationships are a crucial part of the self. 
Baumeister and Leary (1995) have proposed that the need to 
belong is one of the most fundamental human motivations, 
underlying many emotions, actions, and decisions through- 
out life. Belongingness theory predicts that people seek to 
have close and meaningful relationships with others, perhaps 
because such relationships increase the likelihood of sur- 
vival and reproduction (Shaver, Hazan, & Bradshaw, 1988). 
Social exclusion may have hampered reproductive success; 
it is difficult to find a mate when one is isolated from others 
or devalued by others. Likewise, social exclusion probably 
lowered chances of survival during hunter-gatherer times 
due to lack of food sharing, the difficulty of hunting alone, 
and lack of protection from animal and human enemies 
(e.g., Ainsworth, 1989; Hogan, Jones, & Cheek, 1985; 
Moreland, 1987). 

Several motivational and cognitive patterns support the 
view that people are innately oriented toward interpersonal 
belongingness (see Baumeister & Leary, 1995, for review). 
People form relationships readily and with minimal external 
impetus. They are reluctant to break off a relationship even 
when its practical purpose has ended. They also seem to cat- 
egorize others based on their relationships. In general, hu- 
mans are social animals, and people seek relationships with 
others as a fundamental need. What happens, however, when 
this need is not met—when people feel disconnected from 
social groups and lonely from a lack of close relationships? 
That is, how does the lack or loss of interpersonal relation- 
ships affect the self and behavior? 

Previous research suggests that social exclusion is corre- 
lated with a variety of negative circumstances, including 
poor physical and mental health (Bloom, White, & Asher, 
1979; D. R. Williams, Takeuchi, & Adair, 1992), crime and 


antisocial behavior (Sampson & Laub, 1993), alcohol and drug 
abuse (D. R. Williams et al., 1992), and even reckless driving 
(Harano, Peck, & McBride, 1975; Harrington & McBride, 
1970; Richman, 1985). People who are ostracized by others re- 
port negative emotions and a feeling of losing control (K. D. 
Williams et al., 2000). In general, social exclusion leads to 
negative emotional experiences such as anxiety, depression, 
loneliness, and feelings of isolation (Baumeister & Leary, 
1995; Baumeister & Tice, 1990; Gardner, Pickett, & Brewer, 
2000). Leary et al. (1995) showed that social rejection leads 
to considerable decreases in feelings of self-esteem. Their so- 
ciometer theory posits that self-esteem is primarily a measure 
of the health of social relationships. That is, high self-esteem 
comes from believing that other people will want to spend time 
with you and maintain long-term relationships with you. Low 
self-esteem arises when people experience rejection or fear 
that they will end up alone in life. 

Conversely, fulfilled belongingness needs seem to serve as 
an inoculation against negative outcomes and a predictor of 
positive ones. An influential review by Cohen and Wills 
(1985) concluded that high social support is correlated with 
lower self-reports of anxiety and depression. Baumeister 
(1991) and Myers (1992) both reviewed the empirical litera- 
ture on happiness and concluded that the strongest predictor 
of happiness was social connectedness. People who are rela- 
tively alone in the world are much less happy than people 
who have close connections with others. All other objective 
predictors of happiness, including money, education, health, 
and place of residence, are only weakly correlated with hap- 
piness. The importance of social ties for positive life out- 
comes suggests that social connection carries considerable 
explanatory power. Social exclusion may be connected to 
many of the personal and social problems that trouble mod- 
ern citizens, including aggression and lack of prosocial be- 
havior. In addition, it may be linked to many self-defeating 
behaviors (such as overeating and taking excessive risks). 
Last, social exclusion may cause cognitive impairment. 


Aggressive Behavior and Prosocial Behavior 


During the late 1990s, a series of shootings occurred at 
American schools, leading to the deaths of a number of 
young people and the serious injury of many others. In almost 
every case, the perpetrators were boys who felt rejected by 
their peers (Leary, 2000). Apparently these young men re- 
sponded to this rejection with violence, walking into their 
schools with guns and shooting their fellow students. These 
tragedies were consistent with several broader patterns of 
correlation between antisocial, violent behavior and lack of 
social connections. Garbarino’s (1999) studies confirmed that 


many perpetrators of violence are young men who feel 
rejected from family and peer groups (see also Leary, 2000; 
Walsh, Beyer, & Petee, 1987). 

Prior research provides partial support for a connection be- 
tween social exclusion and aggressive behavior. Rejected 
children are more physically aggressive and more disruptive, 
and issue more verbal threats than other children (Coie, 1990; 
Newcomb, Bukowski, & Pattee, 1993). Compared to married 
men, single men are more likely to speed and drive recklessly, 
two antisocial behaviors that can lead to injury and death 
(Harano et al., 1975; Harrington & McBride, 1970). Marital 
status also correlates with criminal behavior. Stable relation- 
ships in adulthood (especially good marriages) are connected 
to lower incidence of crime and delinquency (Sampson & 
Laub, 1990, 1993). On the other hand, Wright and Wright 
(1992) found no link between criminality and marital status in 
itself. Apparently only a happy (or reasonably happy) mar- 
riage is incompatible with criminal behavior. 

However, these findings are correlational, so the direction 
of causation is not clear. For example, men with criminal ten- 
dencies may be less likely to find someone to marry. Children 
who are aggressive are not likely to keep friends. Even third- 
variable causal explanations are plausible. For example, per- 
haps lack of money makes poor men both more prone to 
criminal activity and less desirable as potential husbands. 

In order to determine the direction of causation between 
social exclusion and aggressive behavior, we performed a 
series of experimental studies (Twenge, Baumeister, et al., 
2001). We manipulated social exclusion either by false feed- 
back on a personality test (in the crucial condition, partici- 
pants heard they would end up alone later in life) or by peer 
rejection (participants heard either that everyone or no one in 
a group of their peers chose them as a desirable partner for 
further interaction). Consistent across several studies, re- 
jected participants were more aggressive toward other peo- 
ple. First, rejected participants issued negative written 
evaluations of a target when the target had insulted them. Re- 
jected participants also chose to blast the target with higher 
levels of stressful, aversive noise during a reaction time game 
after a target issued an insult. In the last study, however, 
the participant had no interaction (positive or negative) with 
the target. Even under these conditions, rejected participants 
were more aggressive toward the target. Thus rejected partic- 
ipants were willing to aggress more even against an innocent 
third party. 

In another series of studies, we examined the effect of so- 
cial exclusion on prosocial behavior (Twenge, Ciarocco, & 
Baumeister, 2001). Across five studies, socially excluded 
people were less prosocial than others. They donated less 
money to a student fund, were less willing to volunteer for 
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more experiments, were less helpful to the experimenter after 
a mishap, and were less cooperative in a prisoner’s dilemma 
game. This effect held regardless of whether the prosocial be- 
havior involved a cost to the self, no cost or benefit to the self, 
or even a benefit to the self. Combined with the aggression 
studies, the implication of these findings is that social exclu- 
sion leads to a reduction in prosocial behavior and an in- 
crease in antisocial behavior. 

Self-reports of mood consistently failed to mediate the re- 
lationship between social exclusion and aggressive or proso- 
cial behavior. In addition, the effects were not due to simply 
hearing bad news. A misfortune control group heard that they 
would be accident prone in the future. This group demon- 
strated significantly less aggressive behavior and more proso- 
cial behavior compared to the social exclusion group. These 
manipulations of social exclusion are weak compared to real- 
life experiences such as romantic breakups or ostracism 
by friends. This makes it less surprising that rejections out- 
side the laboratory can sometimes lead to lethally violent 
reactions. 

These results linking exclusion to more antisocial behav- 
ior and less prosocial behavior are especially interesting 
given some previous studies. A recent paper (K. D. Williams 
et al., 2000) examined ostracism (being ignored by others) 
during an Internet ball-tossing game. Participants who were 
ostracized were subsequently more likely to conform to oth- 
ers’ judgments in a line-judging task. The authors suggest 
that the ostracized participants were thus more willing to 
make amends and conform in exchange for social accep- 
tance. A previous study also found that female participants 
who were ostracized socially compensated by working 
harder on a group task (K. D. Williams & Sommer, 1997). 
One interpretation of these results is that social exclusion 
leads to prosocial behavior—thus the opposite results to the 
Twenge et al. studies. However, there are several explana- 
tions for this discrepancy. First, the ostracized participants in 
the K. D. Williams et al. (2000) studies may have conformed 
out of passivity rather than out of a desire to rejoin the group. 
Another difference lies in motivation: the participants in the 
K. D. Williams et al. (2000) study and the Williams and 
Sommer (1997) study might have felt more confident that 
they could regain the favor of the group members in further 
interaction. In our studies, rejected participants were interact- 
ing with someone they did not expect to meet in person. This 
may have reduced their desire to act prosocially and encour- 
aged them to indulge their antisocial, aggressive impulses. In 
other words, they might have felt that there was no clear route 
back to social acceptance. 

Could it be that socially rejected people simply lose inter- 
est in connecting with others? There is some evidence against 
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this view. Gardner et al. (2000) presented participants with 
acceptance and rejection experiences. Rejected participants 
later demonstrated better memory for the social aspects and 
events in a diary they had read earlier. Thus the experience of 
rejection seems to make people focus on social events to a 
greater extent. 


Self-Defeating Behavior 


Psychologists have often been fascinated with self-defeating 
behavior because of its fundamental and paradoxical nature 
(for reviews, see Baumeister, 1997; Baumeister & Scher, 
1988). It seems irrational for people to act in ways that are ul- 
timately self-defeating. Why do people do things that bring 
them suffering, failure, and other misfortunes? A broad range 
of social problems (e.g., drug addiction, overeating, under- 
achievement, excessive risk-taking) can be regarded as self- 
defeating acts. Many of these problems are caused by failures 
of self-control or self-regulation (Baumeister, Heatherton, 
et al., 1994), which occur when people find it difficult to resist 
tempting impulses. In addition, a loss of self-control can lead 
to taking self-defeating risks (Leith & Baumeister, 1996), 
which in turn may cause undesirable outcomes such as poor 
health, drug and alcohol abuse, and harmful accidents. 
Self-control loss is also detrimental for relationships. 
Living together with other people requires some degree of 
accommodation and compromise, because the self-interest 
of the individual is sometimes in conflict with the best interests 
of the group. Sharing, showing humility, respecting the rights 
and property of others, and other socially desirable acts require 
some degree of self-control. Few people want to live with 
someone who continually exploits others, breaks promises, 
abuses drugs, lashes out in anger, and takes stupid risks. Hence 
people must use their self-control to curb these impulses, if 
they want to maintain good interpersonal relationships. 
Evidence from the sociological literature suggests that 
marriage (which is one important form of belongingness) in- 
oculates against many self-defeating behaviors. When com- 
pared to unmarried or divorced individuals, married people 
are less likely to abuse alcohol and drugs (D. R. Williams et al., 
1992). As mentioned earlier, married men are less likely to be 
arrested for speeding or reckless driving (Harrington & 
McBride, 1970) and are less likely to be involved in car acci- 
dents (Harano et al., 1975), especially in those related to 
alcohol (Richman, 1985). In one of the first works of modern 
sociology, Durkheim (1897/1951) found that suicide— 
perhaps the ultimate self-defeating behavior—was more com- 
mon among people who were unmarried or otherwise socially 
unconnected. These correlational studies suggest a relation- 
ship between belongingness and self-defeating behaviors, 


including loss of self-control and risk taking. As noted previ- 
ously, however, these studies are limited due to their correla- 
tional design and their exclusive focus on marriage. 

In addition, married people are often mentally and physi- 
cally healthier than single, divorced or widowed individuals. 
The correlation between marital status and health may have 
several causes. First, it is possible that spouses provide prac- 
tical support for health behaviors, such as by reminding their 
partners to keep physicians’ appointments, eat well, and ex- 
ercise regularly. The social interaction of a marital relation- 
ship may also directly increase mental health, which may 
increase physical health in turn. Third, and most relevant 
here, not being involved in a close relationship may encour- 
age risky, self-defeating behaviors. Just as single and di- 
vorced people are more likely to take risks while driving, 
they may also take more risks with their health. We have al- 
ready established that unmarried people are more likely to 
abuse alcohol and drugs. The same risk-taking, self-defeating 
tendency may also lead the unmarried to neglect their health 
by missing appointments, declining to seek health informa- 
tion, and taking a passive role toward health maintenance. It 
seems that many people feel that life is not worth living (or 
not as worth living) without close relationships. However, the 
causation may work the other way; it is certainly plausible 
that unhealthy people are not as likely to marry or have as 
many close social relationships. 

Like the previous evidence on antisocial behavior, the 
evidence on social exclusion and self-defeating acts is pri- 
marily correlational. We performed a series of experiments to 
determine the causal path between social exclusion and self- 
defeating behavior (Twenge, Catanese, & Baumeister, 2001). 
We manipulated social exclusion using the same methods 
employed in the research on aggressive and prosocial behav- 
iors (future prediction of a life devoid of social relationships, 
or rejection by peers). These experiments found that ex- 
cluded participants consistently displayed more self- 
defeating behavior. Compared to the other groups, excluded 
participants procrastinated longer, took irrational risks in a 
lottery choice, and made more unhealthy choices. These ef- 
fects were not mediated by mood, no matter how mood was 
measured (we used three different mood measures). The mis- 
fortune control group, who heard that they would be accident 
prone later in life, did not show significant increases in self- 
defeating behavior. Thus it appears to be specifically social 
exclusion that makes people self-destructive. 


Cognitive Impairment 


If mood does not mediate the relationship between social 
exclusion and negative outcomes, what does? One possibility 


is cognitive impairment. Social exclusion may impair the 
ability to reason effectively, and this in turn could lead to 
self-defeating behavior (which is usually a failure to ratio- 
nally consider the outcomes of one’s actions: Leith & 
Baumeister, 1996). Cognitive impairment could also lead to 
antisocial behavior, as socially excluded individuals may 
give in to aggressive impulses without considering the conse- 
quences. This decrease in the ability to reason may result 
from numbness or excessive rumination. 

Our research has found that social exclusion does reduce 
the ability to reason effectively (Baumeister, Twenge, & 
Nuss, 2001). Socially excluded participants obtained lower 
scores on a timed test of intelligence. In a reading compre- 
hension task, social exclusion led to impairments in the abil- 
ity to retrieve information. Participants read a passage under 
normal conditions, received the exclusion feedback, and 
were then asked to recall what they had read. Excluded par- 
ticipants did not answer as many questions correctly as com- 
pared with participants in the other conditions. However, 
their ability to store information was apparently intact. Be- 
cause the recall questions were difficult, the results could 
have been due to deficits in either recall or reasoning. We 
tested pure recall by asking participants to memorize a list of 
nonsense syllables. They then received the belongingness 
feedback and were asked to recall the syllables. Social exclu- 
sion did not affect the retrieval of simple information; how- 
ever, we found that it did affect reasoning. Participants were 
given a timed reasoning test (taken from a Graduate Record 
Exam analytical section). Those in the excluded condition 
answered fewer questions correctly than those in the other 
groups. Thus social exclusion does not affect the storage of 
information or the retrieval of simple information, but it does 
affect higher reasoning. 


Larger Social Trends in Belongingness 
and Negative Outcomes 


Social exclusion may be important for understanding recent 
changes in American society. Several authors have argued 
that the changes of the last 40 years have led to a society in 
which people lack stable relationships and feel disconnected 
from each other. Putnam (1995, 2000) found that Americans 
are now less likely to join community organizations and visit 
friends than they were in the 1950s and 1960s. The propor- 
tion of the population living alone has nearly doubled in re- 
cent decades, from 13% in 1960 to 25% in 1997 (U.S. Bureau 
of the Census, 1998). The substantially increased divorce 
rate, another indicator of unstable social relations, accounts 
for a large part of this change. The later age of first marriage 
has also contributed to the increase in living alone. At the 
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same time, violent crime has skyrocketed, property crime has 
increased, and people trust and help each other less than they 
once did (Fukuyama, 1999). 

This breakdown in relationships has occurred alongside 
several negative social trends. Depression rates (Klerman & 
Weissman, 1989; Lewinsohn, Rohde, Seeley, & Fischer, 
1993) and feelings of anxiety (Twenge, 2000) have increased 
markedly. The increase in anxiety is directly linked to 
decreases in social connectedness such as divorce rates, levels 
of trust, and the percentage of people living alone (Twenge). 
In addition, crime and antisocial behavior have increased; 
violent crime is more than 4 times as common as it was in 
1960 (6 times as common as in 1950). In fact, Lester (1994) 
found that statistics measuring social integration (divorce, 
marriage, and birth rates) were almost perfectly correlated 
with homicide rates when examined in a time-series analysis. 
Self-defeating behaviors have also escalated in the last few 
decades (see Baumeister, Heatherton, et al., 1994). Although 
it is notoriously difficult to prove which causal processes are 
operating at the macrosocial level in the complex world, we 
think that the declines in social integration and belongingness 
have contributed to the rise of negative social indicators and 
social problems. 


THE SELF AS AN INTERPERSONAL ACTOR 


Once people have social relationships, how do these relation- 
ships influence their selves, and vice versa? One reason peo- 
ple have selves is to facilitate interactions and relationships 
with others. For example, it is difficult to go out on a first date 
if one is in the middle of an identity crisis. Accordingly, Erik 
Erikson (1950, 1968) famously asserted that identity is a pre- 
requisite for intimacy. People must settle the problems of 
identity before they are developmentally ready for intimate 
relations. The sequence may not be that simple, because iden- 
tity and intimacy seem to develop together, but the link be- 
tween the two is hard to deny (Orlofksy, Marcia, & Lesser, 
1973; Tesch & Whitbourne, 1982). 

Identity is also constructed out of social roles. A series of 
cluster analyses by Deaux, Reid, Mizrahi, and Ethier (1995) re- 
vealed five main types of social identities: relationships (hus- 
band, sibling), vocational or avocational role (coin collector, 
teacher), political affiliation (Republican, feminist), stigma- 
tized identity (homeless person, fat person), and religion or 
ethnicity (Jewish, Hispanic). As products of the culture and 
society, roles again reveal the interpersonal dimension of 
selfhood. To fulfill a relationship-oriented role (such as mother 
or police officer), one must make the self fit a script that is 
collectively defined. Each person may interpret a given role 
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in a slightly different way, but the role is nonetheless under- 
stood by the social group and is a way of relating to others. 
Reflexive consciousness itself may depend partly on inter- 
personal contact. Sartre’s (1953) famous analysis of conscious- 
ness emphasized what he called “the look,” that is, the 
subjective experience of looking at someone else and knowing 
that that person is looking at you. The rise in adolescent self- 
consciousness and social awkwardness is in part a result of the 
increased cognitive ability to understand how one appears to 
others. Teenagers feel self-conscious because they are begin- 
ning to fully realize how they are being judged by other people. 
How do interpersonal interactions shape the self? The 
tabula rasa view of human nature holds that selves are the 
products of interpersonal relations. That is, people start off as 
blank slates, and experiences gradually produce the unique 
individuality of the complex adult self. Although such views 
are elegant and sometimes politically appealing, they may 
suggest too passive or simple a role of the self. The self plays 
an active role in how it is influenced by others. The broader 
issue is how selfhood is maintained in an interpersonal envi- 
ronment. For example, part of the self exists in other people’s 
minds; other people know about us and what we are like. 
Selfhood cannot be achieved or constructed in solitude. 


Self-Esteem and Interpersonal Relationships 


Self-esteem may be defined as a person’s evaluation of self. 
Thus, self-esteem is a value judgment based on self- 
knowledge. Because much self-knowledge concerns the 
person’s relations with others, it is not surprising that self- 
esteem is heavily influenced by interpersonal relationships. 


Sociometer Theory 


Leary et al. (1995) proposed that self-esteem is a sociometer: 
that is, an internal measure of how an individual is succeeding 
at social inclusion (see also Leary & Baumeister, 2000). In 
their experimental studies, participants are told that no one has 
chosen them as a partner for further interaction. This expe- 
rience causes a decline in state self-esteem. In contrast, being 
chosen by group members increases state self-esteem. Leary 
et al. (1995) compare self-esteem to a car’s gas gauge. The gas 
gauge itself does not affect the mechanical functioning of the 
car, but it serves a crucial function by showing the driver how 
much fuel is in the tank. Leary et al. (1995) suggest that human 
drivers are strongly motivated to keep their automobiles’ gas 
gauges from reading “empty,” because most people seek out 
relationships whenever they see the needle moving in that 
direction. Self-esteem lets people know when they need 
“refueling” in the form of human interaction. 


The sociometer theory is important for an interpersonal 
view of the self, because it takes one of the best-known and 
most prominent intrapsychic variables (self-esteem) and 
recasts it in interpersonal terms. Concern with self-esteem can 
easily seem like a private, inner matter. Itis easy to assume that 
self-esteem goes up and down in the person’s own inner world 
with only minimal connection to the environment, and that 
people accept or reject environmental input according their 
own choices (e.g., one can either be in denial about a problem, 
or acknowledge and deal with the problem). Yet the sociome- 
ter theory proposes that self-esteem is not purely personal but 
instead fundamentally relies on interpersonal connection. 

There is abundant evidence that people are consistently 
concerned with the need to form and maintain interpersonal 
connections (Baumeister & Leary, 1995), and so it seems 
quite likely that there would be a strong set of internal moni- 
tors (possibly including self-esteem) to help the person 
remain oriented toward that goal. The sociometer view can 
also readily explain why so much emotion is linked to self- 
esteem, because strong emotional responses are generally as- 
sociated with interpersonal relationships. In addition, people 
tend to derive their self-esteem from the same traits that lead 
to social acceptance (e.g., competence, likability, attractive- 
ness). When people feel socially anxious, however, self- 
esteem suffers. A review of multiple studies concluded that 
the average correlation between social anxiety and self- 
esteem is about —.50 (Leary & Kowalski, 1995). That is, 
there is a substantial and robust link between worrying about 
social rejection and having low self-esteem. 

Why, then, do people need self-esteem to register changes 
in social connection, when emotion seems to serve the same 
purpose? Leary and Baumeister (2000) argue that self-esteem 
registers long-term eligibility for relationships, rather than 
just responding to current events. Hence someone might have 
low self-esteem despite being socially connected—if, for ex- 
ample, she believed that she has managed to deceive people 
about her true self and personality. If people were to find out 
what she is really like, she thinks, they might abandon her. 
Conversely, someone might have high self-esteem despite 
having no close friends at the moment, because he might at- 
tribute this dearth of friendships to the situation or to the lack 
of suitable people. He might believe that he will have plenty 
of friends as soon as there are enough people around who can 
appreciate his good qualities. 

There are several possible objections to the sociome- 
ter view. It does seem that people can have high self-esteem 
without having any close relationship at that moment. There 
is also no direct and simple link between one’s immediate 
social status and self-esteem. Self-esteem seems more sta- 
ble than social-inclusion status. Shifting the emphasis from 


current relationships to perceived eligibility for such relation- 
ships is one way to address this problem, but more research is 
needed to verify whether that solution is correct. 


Social and Interpersonal Patterns 


Self-esteem is also associated with different patterns of social 
behavior. Indeed, such differences constituted one of the orig- 
inal sources of research interest in self-esteem. Janis (1954) 
hypothesized that people with low self-esteem are more easily 
persuaded than people with high self-esteem. One of the most 
influential and popular measures of self-esteem was devel- 
oped specifically for use in studies of attitude change (Janis & 
Field, 1959). This measure, usually known as the Janis-Field 
Feelings of Inadequacy scale, cemented the view that individ- 
uals with low self-esteem feel little self-confidence and are 
easily swayed by other people’s arguments. 

The view that low self-esteem is associated with greater 
persuasibility was supported in those early studies, and sub- 
sequent work built upon those studies to link low self- 
esteem to a broad range of susceptibility to influence and 
manipulation. A seminal review article by Brockner (1984) 
concluded that low self-esteem is marked by what he called 
“behavioral plasticity” —the idea that people with low self- 
esteem are broadly malleable and easily influenced by others. 
For example, anxiety-provoking stimuli produce stronger 
and more reliable effects in people with low self-esteem; 
their reactions are more influenced by the anxiety-provoking 
situation than are those of people with high self-esteem. 
People with low self-esteem also show stronger responses to 
expectancy effects and self-focus inductions. 

Self-esteem also effects choices between self-enhancement 
and self-protection. Many self-esteem differences occur more 
frequently (or only) in interpersonal situations, and self- 
esteem may be fundamentally tied toward self-presentational 
patterns (see Baumeister, Tice, & Hutton, 1989, for review). In 
general, people with high self-esteem are oriented toward self- 
enhancement, whereas people with low self-esteem tend to- 
ward self-protection. People with high self-esteem want to 
capitalize on their strengths and virtues and are willing to take 
chances in order to stand out in a positive way. On the other 
hand, people with low self-esteem want to remedy their defi- 
ciencies and seek to avoid standing out in a negative way. 

High self-esteem people’s tendency toward self- 
enhancement can sometimes make them less likable to others. 
After receiving a negative evaluation, people high in self- 
esteem emphasized their independence and separateness 
from others, whereas people with low self-esteem empha- 
sized their interdependence and connectedness with others 
(Vohs & Heatherton, 2001). These self-construals had direct 
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consequences for interpersonal perceptions. Interaction part- 
ners saw independent people as less likable and interdepen- 
dent people as more likable. Given the differences in behavior 
based on level of self-esteem, this meant that partners saw 
low self-esteem individuals as more likable than high self- 
esteem individuals. However, these differences occurred only 
after the individuals being perceived had received negative 
evaluations; presumably self-esteem moderates reactions to 
ego threat. 

The evidence reviewed thus far does not paint an entirely 
consistent picture of people with low self-esteem. On the one 
hand, people with low self-esteem seem to desire success, ac- 
ceptance, and approval, but on the other hand they seem 
skeptical about it and less willing to pursue it openly. Work 
by Brown (e.g., 1993) has addressed this conflict directly by 
proposing that people with low self-esteem suffer from a mo- 
tivational conflict. Brown and McGill (1989) found that pos- 
itive, pleasant life events had adverse effects on the physical 
health of people with low self-esteem; such people actually 
became ill when too many good things happened. In contrast, 
people with high self-esteem are healthier when life treats 
them well. It may be that positive events exceed the expecta- 
tions of people with low self-esteem. This may force them to 
revise their self-concepts in a positive direction, and these 
self-concept changes may be sufficiently stressful to make 
them sick. 


Social Identity Theory 


Another way that interpersonal relationships influence self- 
esteem is through group memberships. Social identity theory 
(e.g., Tajfel, 1982; Tajfel & Turner, 1979; Turner, 1982) 
argues that the self-concept contains both personal and social 
attributes. Self-esteem usually focuses on personal attributes, 
but group memberships are also important. A person will ex- 
perience higher self-esteem when his or her important social 
groups are valued and compare favorably to other groups (see 
also Rosenberg, 1979). Empirical research has confirmed this 
theory; collective self-esteem (feeling that one’s social groups 
are positive) is correlated with global personal self-esteem 
(Luhtanen & Crocker, 1992). This is particularly true for 
members of racial or ethnic minorities (Crocker, Luhtanen, 
Blaine, & Broadnax, 1994). This most likely occurs because 
minority group members identify more strongly with their 
ethnic groups, and these groups are obvious and salient to 
others. In addition, improving the status of the group tends to 
increase personal self-esteem. For example, favoring in- 
groups over out-groups in allocation of points or rewards can 
enhance self-esteem, even when the self does not personally 
benefit from those allocations (e.g., Lemyre & Smith, 1985; 
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Oakes & Turner, 1980). Thus self-esteem is not only per- 
sonal: It also includes a person’s evaluations of the groups to 
which he or she belongs. 


Is High Self-Esteem Always Good 


To place the findings about self-esteem in perspective, it is 
useful to ask how important and beneficial high self-esteem 
actually is. In America today, many people seem to believe 
that high self-esteem is extremely beneficial. The strong be- 
lief in the benefits of self-esteem is a major reason it remains 
a popular topic of discussion and research. By one count, 
there are almost 7,000 books and articles about self-esteem 
(Mruk, 1995). The belief that high self-esteem is a vital as- 
pect of mental health and good adjustment is strong and 
widespread (e.g., Bednar, Wells, & Peterson, 1989; Mruk, 
1995; Taylor & Brown, 1988). In many studies, in fact, self- 
esteem is measured as an index of good adjustment, so that 
even the operational definition of healthy functioning in- 
volves self-esteem (e.g., Kahle, Kulka, & Klingel, 1980; 
Whitley, 1983). 

However, there is a “dark side” to high self-esteem, espe- 
cially concerning interactions with others. In one study, peo- 
ple with high self-esteem were more likely than most people to 
aggress against others and to be interpersonally violent 
(Baumeister, Smart, & Boden, 1996). Aggression seems to be 
most common among people who think well of themselves but 
then interact with someone who disputes their favorable self- 
appraisal. In particular, inflated, unrealistic, or fluctuating 
forms of high self-esteem predict outbursts of violence and ag- 
gression. This most likely occurs because these types of self- 
esteem are the most vulnerable to ego threats (e.g., Blaine & 
Crocker, 1993; Kernis, Granneman, & Barclay, 1989). People 
appear to lash out at others who criticize them as a way of 
avoiding any decrease in their self-esteem and the accompa- 
nying negative emotion (especially shame; see Tangney, 
Wagner, Fletcher, & Gramzow, 1992). Normally, people with 
high self-esteem do not seem defensive, but that may be 
because they usually think highly of themselves and expect to 
succeed at most things. When they do fail or are rejected, they 
are very surprised and thus may respond dramatically. 

Inflated self-esteem also predicts social maladjustment. In 
one study, researchers compared people’s self-descriptions 
with the descriptions of their friends (Colvin, Block, & 
Funder, 1995). This identified a group of people who thought 
more highly of themselves than warranted by the opinions of 
their friends. When followed over time in a longitudinal de- 
sign, these self-enhancing people displayed poor social skills 
and decreased psychological adjustment. In a laboratory 


study, the people in this group tended to express hostility, 
interrupt others, be socially awkward, irritate others, talk at 
people instead of talking with them, and perform a variety of 
other negatively evaluated behaviors. The composite picture 
is one of a self-centered, conceited person who lacks genuine 
regard for others. This picture is quite consistent with the lit- 
eral meaning of high self-esteem, even though it does not fit 
the popular stereotype. 


Narcissism and Interpersonal Relationships 


Another individual difference likely to affect interpersonal 
relationships is narcissism, usually defined as an exaggerated 
view of one’s importance, influence, and entitlements. People 
high in self-esteem are more likely to be high in narcissism, 
although the correlation is low to moderate rather than high. 
The imperfect correlation probably reflects the fact that 
high self-esteem is a very heterogeneous category, including 
plenty of arrogant, narcissistic people as well as others who 
simply accept themselves without assuming they are superior 
to others. Put another way, narcissism is a subcategory of 
high self-esteem; very few people score high in narcissism 
but low in self-esteem. 

Generally, narcissists tend not only to feel good about 
themselves, but also to expect deference and recognition from 
others. Thus, in some ways narcissism is more interpersonally 
relevant than self-esteem. Campbell (1999) found that narcis- 
sists were more interpersonally attracted to highly positive 
and highly admiring individuals. Narcissists were less at- 
tracted to people who offered greater amounts of emotional 
intimacy. This occurred because narcissists preferred partners 
who were more self-oriented rather than other-oriented, 
as part of a strategy to enhance self-esteem. Thus, narcissists 
found it more important to be with someone who made them 
look good rather than to be with someone who truly cared for 
them. This overall strategy for self-enhancement is linked to 
a noncaring, nonintimate experience of interpersonal rela- 
tionships in general. Compared to non-narcissists, narcissists 
report lower levels of empathy (Watson, Grisham, Trotter, & 
Biderman, 1984), intimacy (Carroll, 1987), communion 
(Bradlee & Emmons, 1992), caring (Campbell, 1999), and 
selflessness (Campbell, Foster, & Finkel, 2000). It seems that 
a narcissist’s first question in a relationship is “What can you 
do for me?” 

Narcissists also tend to react badly when they are criti- 
cized or challenged by others. In laboratory experiments, 
Bushman and Baumeister (1998) found that narcissists were 
considerably more aggressive toward someone who had in- 
sulted them, as compared to non-narcissists. When the 


researchers controlled for narcissism statistically, self-esteem 
did not predict aggressive behavior. Thus it appears that 
narcissism is the better predictor of interpersonal hostility. 
This fits the view that aggression comes from only a subset of 
people with high self-esteem, while other people with high 
self-esteem are not aggressive. 

Other research has found that narcissists are willing to 
derogate others after receiving threatening feedback (e.g., 
Kernis & Sun, 1994). They react with hostility, denigration, 
and aggression when they feel threatened (Rhodewalt & 
Morf, 1998). In fact, a recent study found that men incarcer- 
ated in prisons scored significantly higher in narcissism than 
samples of male college students (Bushman, Baumeister, 
Phillips, & Gilligan, 2001). Levels of self-esteem, however, 
did not differ between the two groups. Thus narcissists tend 
to be more personally sensitive to criticism, but insensitive to 
how their behavior affects others. Like the research on self- 
esteem presented earlier, these results suggest that inflated 
self-views can often lead to poor consequences for interper- 
sonal relationships. 


Reflected Appraisals 


The reflected appraisals model suggests that people learn 
about themselves by interacting with others. People find out 
what other people think of them and then internalize these 
opinions into their self-views. In addition, information about 
the self often is meaningful only in comparison to others, as 
social comparison theory emphasizes. One is only fat or thin, 
intelligent or stupid, friendly or hostile in comparison to 
other people. In these cases and many others, self-knowledge 
can grow only when people make these implicit comparisons. 
Much of reflected appraisals theory stems from symbolic in- 
teractionism (e.g., Mead, 1934). Mead’s theory argues that 
most self-knowledge comes from social interactions. The 
process of reflected appraisals (i.e., how other people’s ap- 
praisals of you shape your self-understanding) is often de- 
scribed with Cooley’s (1902) term the looking-glass self 
Using an antiquated term for a mirror, the looking-glass self 
posits that other people provide the mirror through which in- 
dividuals see and understand themselves. 

Cooley (1902) argued that the self-concept consists of 
“the imagination of our appearance to the other person; the 
imagination of his judgment of that appearance, and some 
sort of self-feeling, such as pride or mortification” (p. 184). 
Thus our self-esteem is also heavily influenced by what 
others think of us. Mead (1934) elaborated on this notion by 
suggesting that the self is also shaped by our vision of how 
a generalized other perceives us. The generalized other is 
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basically the person’s whole sociocultural environment. If a 
society has a negative view of children at a given time, for 
example, children are likely to internalize this negative view 
of the generalized other. 

An influential literature review by Shrauger and 
Schoeneman (1979) concluded that symbolic interactionism 
was partially supported by data. The review gathered data 
comparing self-concepts with the views of others. Although 
these correlations were positive, they were rather small. Sub- 
sequent studies have confirmed that symbolic interaction ef- 
fects are significant but small (Edwards & Klockars, 1981; 
Malloy & Albright, 1990). Even some of these weak links 
can be questioned on methodological grounds, as noted by 
Felson (1989). 

On the other hand, Shrauger and Shoeneman (1979) found 
that self-concepts were highly correlated with how people be- 
lieved that others perceived them (and subsequent work has 
replicated this conclusion). Therefore, there is a meaningful 
link between self-perceptions and other-perceptions (although 
the causal direction is unclear and probably bidirectional). The 
discrepancy arises between how people actually perceive Bob 
and how Bob thinks other people perceive him—but Bob’s 
view of himself is quite similar to how he thinks others see 
him. Thus others do shape self-views, even though people are 
not always accurate about how others perceive them. 

There seem to be two major reasons for these inaccuracies 
(see Felson, 1989). First, people do not generally tell some- 
one precisely what they think of him or her. The exchange of 
interpersonal evaluations is highly distorted. People do not 
want to offend or distress someone by an honest, negative 
evaluation, and they are often afraid that the person they crit- 
icize will no longer like them. (This is a legitimate fear; most 
humans tend to like people who like them, and distrust those 
who criticize them.) When refusing a date, for example, 
people tend to give false and misleading explanations, often 
resulting in their being unable to discourage further invita- 
tions from the same person (e.g., Folkes, 1982—although 
some of these explanations have become so popular that they 
are now more easily understood as a genuine brush-off: “It’s 
not you, it’s me.” Translated: “It’s totally you. You are the big 
problem. I’m fine.”). Even when people are engaging in 
deliberate self-presentation, they are not very accurate at 
estimating the impression they actually make on others (e.g., 
DePaulo, Kenny, Hoover, Webb, & Oliver, 1987). Given the 
dearth of honest and precise negative feedback from others, it 
is not surprising that people’s self-views remain blissfully 
unaffected by those concealed opinions and appraisals. 

The other source of distortion is self-deception. Often, 
people do not accept information directly into their views of 
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themselves. Instead, they filter it, bias it, and adapt it to fit in 
with what they already believe and what they prefer to 
believe. Hence, even if others do tell Bob exactly what they 
think of him, he may discount or ignore the unwelcome 
parts of the message. Some authors have argued that a degree 
of optimistic self-deception is necessary for psychological 
adjustment (Taylor & Brown, 1988). 


Influence of Others’ Expectancies 


As discussed in the previous subsection, it seems that people 
do not directly internalize other’s opinions of them. How- 
ever, people might still change their behavior and beliefs 
according to other people’s expectations. For example, 
Rosenthal and Jacobson (1968) provided a demonstration of 
the effect of the self-fulfilling prophecy in a study that has be- 
come a classic. Teachers were told that certain students were 
about to experience a leap forward in intelligence and acade- 
mic success. Although this expectancy was not true (in fact, 
the supposedly newly intelligent students were chosen at ran- 
dom), the chosen students nevertheless showed increases in 
academic performance. With new faith in these students’ 
abilities, the teachers presumably provided more encourage- 
ment of the students and expected more of them. These ex- 
pectancies were enough to produce results, even though they 
originated from outside the students. 

Do self-concepts change in response to others’ expectan- 
cies? Darley and Fazio (1980) argued that a self-fulfilling 
prophecy can produce three different types of change: change 
in the perceiver’s final belief, in the target’s actual behavior, 
or in the target’s self-appraisal. Out of the three, the evidence 
for the last (the target’s self-appraisal) was the weakest. Thus, 
perceivers see that the target changes his or her behavior and 
believe that their expectancies are confirmed. However, tar- 
gets do not usually come to share the perceivers’ initially 
false belief about themselves. 

One of the most widely cited studies of self-fulfilling 
prophecies was performed by Snyder, Tanke, and Berscheid 
(1977). In their study, male subjects each saw a bogus 
photograph of a female interaction partner and then had a 
telephone conversation with a woman they believed was 
the woman in the photograph they saw; actually, the 
photographs were varied randomly. The men who saw a photo- 
graph of an attractive woman perceived their telephone part- 
ner as more attractive and socially charming than those who 
saw a photograph of an unattractive woman. These expectan- 
cies were confirmed when the women’s responses varied 
depending on how the men interacted with them. However, the 
women did not accept the way the men treated them when it 
was unfavorable. When the man thought the woman was unat- 


tractive and treated her accordingly, she tended to reject and 
to discount as inaccurate his view of her. 

In another study, Snyder and Swann (1978) created the ex- 
pectation that an interaction partner would be hostile. These 
expectancies were confirmed in the interaction that followed; 
the perceiver expected hostility and the target delivered by act- 
ing in a hostile way. The next question was, would the target 
(who had been perceived as hostile) go on to be hostile with a 
new interaction partner? That is, would the treatment the target 
experienced in one interaction carry over to another? Snyder 
and Swann found a carryover effect only when targets were 
encouraged to attribute their behavior during the first interac- 
tion to their own personalities. The hostile behavior did not 
carry over without this experimental manipulation; people did 
not naturally attribute their hostile behavior to themselves. 
These results again suggest that it is not easy to alter a person’s 
self-appraisal. 


SELF-PRESENTATION 


The most obvious and proactive way that the self parti- 
cipates in social life is through self-presentation. Self- 
presentation is defined as people’s attempts to convey 
information about themselves to others. Some authors have 
emphasized self-presentation to such an extent that they see 
life as an ongoing series of roles, played out as if by an actor 
on stage (Goffman, 1959). 

People seem to be inherently and pervasively concerned 
with self-presentation. Baumeister (1982) showed that many 
of social psychology’s effects occurred because of self- 
presentation. For example, subjects in the Asch line-judging 
study conformed to others’ judgments more strongly when 
these other people were watching (see especially Deutsch & 
Gerard, 1955). When judgments were anonymous, the con- 
formity effect was substantially weaker. Cognitive and in- 
trapsychic theories that explained many effects seemed to be 
missing something, because the effects depended on interper- 
sonal contexts. Thus, aggression, helping, attitude change, 
emotion, attributional patterns, and other responses seemed 
to change when the individual’s acts would be seen by others. 
Leary (1995) has furnished an even longer and more impres- 
sive list, showing effects of self-presentation in contexts 
ranging from sports teams to business meetings to the beach 
to mental hospitals. 


Favorability of Self-Presentation 


In general, people want to present themselves favor- 
ably. However, people are sometimes torn between self- 
enhancement and being seen as likable. The basic question is 


this: How favorably should one present oneself? People’s 
answers appear to depend on several factors. In Schlenker’s 
(1980, 1986) terms, self-presentation is often the result of 
a trade-off between the opposing forces of favorability and 
believability. People often make positive claims about 
themselves in order to make a good impression. However, 
excessively positive claims might not be believed, and they 
could even be discredited. (For example, you could try to 
make a good impression by saying you are a good basket- 
ball player. Someone might not believe you, or worse, you 
might later play basketball and perform poorly). Boasting 
about one’s abilities and being proven wrong leaves a bad 
impression. 

In one of the earliest and most often cited experiments on 
self-presentation, Schlenker (1975) gave participants moder- 
ately negative feedback about their abilities on a novel task 
prior to a session in which group members would perform 
the task. Participants were then asked to describe them- 
selves to the group members. Schlenker wanted to see if 
participants would self-present in positive terms or incorpo- 
rate the negative feedback they had just received. As it 
turned out, the favorability of self-presentation depended on 
whether the upcoming group performance was expected to 
be public or private. If it would be private, so that no one 
would know anyone else’s performance, then participants 
presented themselves in rather favorable terms. If they 
thought other people would be able to see how well they 
did, however, they refrained from boasting. Thus, people 
seemed to present themselves as favorably as they could get 
away with: They boasted when it was safe to do so but re- 
mained modest when it seemed likely that the truth would 
be found out. 

The possibility of future discreditation is not the only con- 
straint on the favorability of self-presentation. It is also lim- 
ited by past actions and other socially available information. 
After all, people do not simply form wholly new impressions 
of others with every single interaction. New information is 
added to old information. The self-presenter must anticipate 
this and know that whatever he or she does now will be 
combined, in the observer’s mind, with what the observer 
already knows. 

An early study of the effects of prior knowledge on self- 
presentation was conducted by Baumeister and Jones (1978). 
Subjects were told that their interaction partners would read 
their personality profiles. As in Schlenker’s (1975) study, 
people felt constrained to be consistent with independent in- 
formation. In this case, they altered their self-presentations to 
fit the randomly assigned feedback. This occurred even when 
the personality profiles were unfavorable. Yet they did not 
leave the matter at that: They sought to compensate for the 
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unfavorable image of themselves by presenting themselves 
extra-favorably on other, unrelated dimensions. Thus, people 
felt constrained to be consistent with what the observer al- 
ready knew about them, but they tried to compensate for a 
bad impression by balancing it with unrelated, highly favor- 
able information. 

The general trend toward favorable self-presentation may 
therefore have significant limits. An additional and quite im- 
portant limit was identified by Tice, Butler, Muraven, and 
Stillwell (1995). These authors pointed out that nearly all 
self-presentation research had been done on first meetings 
between strangers. However, the vast majority of actual so- 
cial interactions take place between people who already 
know each other. The studies they performed showed that 
people tend to be positive and self-enhancing when interact- 
ing with strangers, but they more modest and neutral when 
presenting themselves to friends. This occurs in part because 
of differences in the perceivers’ knowledge. Strangers know 
nothing about you, and so it is necessary to convey one’s 
good traits in order to make a favorable impression on them. 
In addition, a stranger will not be able to dispute an overly fa- 
vorable self-presentation. On the other hand, friends already 
have substantial information about you, and so it is not nec- 
essary to name all of your good traits. Meanwhile, friends 
will know when you are exaggerating. Even if you are being 
honest, friends will probably not respond well to bragging 
and self-aggrandizement. 


Cognition and Self-Presentation 


Do people know what impressions they convey to others? 
DePaulo et al. (1987) investigated that question by having 
subjects interact in a round-robin pattern. Each subject 
interacted with three others, one at a time, in interactions 
structured around different tasks (e.g., a teaching task vs. 
a competition). After each interaction, both subjects reported 
their impressions of the partner and the impressions they 
thought they had made on the partner. The researchers 
were then able to determine whether there were discrepancies 
in perceived versus actual impressions. The answers were 
mixed. There was indeed significant accuracy, although most 
of the correlations were rather low. People could tell in a gen- 
eral way how the other persons’ impressions of them changed 
over time. They were not, however, very effective at guessing 
which partner liked them the most or perceived them as most 
competent. In other words, people cannot often tell who likes 
them the most. In another analysis, the authors found that 
people believed that they had made similar impressions on 
everyone in the group; in fact, different partners reported 
very different impressions of the same person. People seem 
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to think that they come across the same way to everyone, but 
they do not. 

Baumeister, Hutton, and Tice (1989) studied the cognitive 
processes behind self-presentation. In this study, subjects 
were interviewed in pairs. An experimenter instructed one 
member of each pair to self-present in either a modest or a self- 
enhancing fashion. After the interview, subjects were given 
surprise recall tests for both their own and their partners’ self- 
presentations, as well as for their impressions of their part- 
ners. Subjects who had been instructed to be modest and 
self-effacing showed impaired memory for the interaction. 
Apparently, acting modestly (which is an unusual way to act 
with strangers) causes greater cognitive load and interferes 
with the memory storage process during the interaction. In ad- 
dition, subjects seemed unaware of the influence they had on 
others (see Gilbert & Jones, 1986). Thus, for example, if John 
presents himself by saying highly favorable things about 
himself, Bill may also begin to boast. This might lead John to 
conclude that Bill must be rather conceited (or at least very 
self-confident). In fact, Bill’s self-promotion was merely a 
response to John’s. 

The increase in cognitive load caused by effortful self- 
presentation may explain some of the findings of DePaulo 
et al. (1987). When one is concentrating on trying to make a 
certain impression, he or she may not be fully able to attend 
to how the other person is responding. After a series of inter- 
actions, people may remember merely that they tried to make 
roughly the same good impression on each interaction part- 
ner. However, they might not remember that the partners 
responded to them differently. Thus, self-presentation is not 
always successful because it is difficult cognitive work. Mak- 
ing a good impression consumes so many resources that 
people find it hard to attend to other people’s responses and 
adjust that impression. 


Harmful Aspects of Self-Presentation 


Through various means, self-presentation can lead to health 
risks (Leary, Tchividjian, & Kraxberger, 1994). For exam- 
ple, concern about the impression one is making can lead to 
risky and harmful behaviors; at times, the drive to impress 
others can outweigh self-preservation. How does this occur? 
Appearance concerns are a relevant example. On the one 
hand, people believe that having a suntan is attractive; on 
the other, most people have heard the warnings about skin 
cancer. Leary and Jones (1993) showed that the risky behav- 
iors of sunbathing were mainly linked to concern over phys- 
ical appearance and to the lack of concern about health. 
People sunbathe to make themselves attractive, often ignor- 
ing the physical danger involved. High-heeled and platform 


shoes are another example: many women wear them because 
they think it makes them look attractive despite the pain, 
back problems, and lack of coordination that such shoes 
often cause. Risky sexual behavior is also influenced by self- 
presentation. Condoms are generally regarded as the safest 
method for having intercourse outside of stable, monoga- 
mous relationships, but many people do not use them. 
People often cite self-presentational concerns when explain- 
ing their lack of protection, such as embarrassment when 
buying them and the fear of making a bad impression on an 
anticipated sexual partner (Leary, 1995). Other risks re- 
viewed by Leary et al. (1994) include hazardous dieting and 
eating patterns, use of alcohol and illegal drugs, cigarette 
smoking, steroid use, not wearing safety equipment and 
other behaviors that may cause accidental injury and even 
death, and submitting to cosmetic surgery and risk of its sub- 
sequent complications. Taken together, these provide strong 
evidence that self-presentational concerns often take prece- 
dence over concerns with maintaining health and even pro- 
tecting life. 


INTERPERSONAL CONSEQUENCES 
OF SELF-VIEWS 


Clearly, characteristics of the self exert an influence on inter- 
personal relations. One of the best-known findings in social 
psychology is the link between similarity and attraction 
(Byrne, 1971; Smeaton, Byrne, & Murnen, 1989); that is, 
people like those who resemble them (or at least, they avoid 
and dislike people who are different from them; Rosenbaum, 
1986). Similarities on important, heritable traits are espe- 
cially potent bases for liking and disliking others (Crelia & 
Tesser, 1996; Tesser, 1993). 


Self-Views Alter Person Perception 


Evidence suggests that self-views affect how people under- 
stand others. Markus, Smith, and Moreland (1985; see also 
Fong & Markus, 1982) examined the role of self-schemas in 
person perception. They proposed that someone who has a 
self-schema in a particular domain will behave like an expert 
in that domain. For example, schematic people will spot 
domain-relevant information faster, integrate it into exist- 
ing information better, and fill in gaps in information more 
thoroughly. In Markus et al.’s research, people who were 
schematic for masculinity tended to group more items together 
when judging the masculinity-relevant behavior of a stimulus 
person. They also saw the stimulus person as more masculine 
and more like themselves than did aschematic individuals. 


Thus, aspects of self-concept can influence the perception 
of others (however, it is also possible that greater interest in 
the area relevant to the self leads to the expertise). The key 
point appears to be that a particularly well developed aspect 
of self-knowledge makes one act like an expert in that sphere. 
If your view of yourself emphasizes loyalty, for example, you 
will probably be more sensitive to loyalty or disloyalty in 
others. 

One mechanism driving the link between self-views and 
person perception is the self-image bias (Lewicki, 1983, 
1984). According to this bias, people tend to judge others on 
the basis of traits in their own areas of strength. Thus there is 
a correlation between the favorability and the centrality of 
self-ratings (Lewicki, 1983). That is, people’s most favorable 
traits are also those that are most central and important 
for their judgments of others—people judge others by a stan- 
dard that favors them (the perceiver). For example, students 
who did well in a computer science course tended to place 
more emphasis on computer skills when judging others than 
did students who did not perform well in the computer course 
(Hill, Smith, & Lewicki, 1989). Lewicki (1984) showed that 
the self-image bias serves a defensive function: When people 
receive negative feedback, the effect of self-image bias on 
perception of others is increased. Along these lines, Dunning, 
Perie, and Story (1991) found that people construct proto- 
types of social categories such as intelligence, creativity, and 
leadership in ways that emphasize their own traits. Thus, in- 
quisitive people think inquisitiveness is a valuable aid to cre- 
ativity, but noninquisitive people do not believe that 
inquisitiveness has any far-reaching implications for other 
outcomes. These prototypes thus influence how people eval- 
uate others. 

Rejecting a view of self through a defensive process also 
affects person perception. Newman, Duff, and Baumeister 
(1997) proposed a new model of the Freudian defense 
mechanism of projection (basically, seeing one’s faults in 
other people rather than in oneself). This model builds on 
evidence that suggests that when people try not to think 
about something, it instead becomes highly accessible in 
memory (Wegner & Erber, 1992). Newman et al. showed 
that when people tried to suppress thoughts about a bad trait 
that had been attributed to them, they then interpreted other 
people’s behavior in terms of that bad trait. Thus, person 
perception can be shaped by the traits you are trying to deny 
in yourself, just as much as by the traits that you do see in 
yourself. 

All of these effects can be explained by accessibility. The 
attributes the self emphasizes, and those the self seeks to deny, 
operate as highly accessible categories for interpreting others’ 
behavior (Higgins, King, & Mavin, 1982). Social perception 
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thus tends to be self-centered and self-biased. Still, these ef- 
fects appear to be specific and limited; not all interpersonal 
perception is wildly distorted by self-appraisals. In particular, 
these effects seem to be limited to situations in which infor- 
mation about the target person is ambiguous (Lambert & 
Wedell, 1991; Sedikides & Skowronski, 1993). 


Self-Evaluation Maintenance 


Several important links between self-esteem and inter- 
personal relations have been elaborated in Tesser’s (1988) 
self-evaluation maintenance theory. Among other conse- 
quences, this theory explains how people may become closer 
to or more distant from relationship partners as a result of 
pressures to maintain self-esteem. According to Tesser, two 
main processes link self-views to interpersonal outcomes. 
First is the process of reflection; one can gain esteem when a 
close other achieves something. One’s self-esteem may get a 
boost simply from having an uncle who is a Congressman or 
a child who is quarterback of the football team; from sleeping 
with a movie star, or from learning one’s college basketball 
team has won a championship. Cialdini and his colleagues 
have shown how people bask in reflected glory of institu- 
tions, for example, by wearing school colors more frequently 
following a team victory than following a defeat (Cialdini & 
Richardson, 1980). 

The other process is one of comparison (see Festinger, 
1954; Wills, 1981); this process can instead lead to a decrease 
in self-esteem. People may compare themselves with others 
close to them and feel bad if the other person is outperform- 
ing them. If your sibling gets better grades than you, if your 
dimwit brother-in-law earns double your salary, or if your 
friend wins a scholarship or a job you wanted, you may lose 
esteem. 

Thus, the processes of reflection and comparison with 
close others produce opposite effects on self-esteem. Tesser’s 
work has therefore gone on to look for factors that determine 
which process will operate in a given situation. One factor is 
the relevance of the accomplishment to one’s self-concept. 
Thus, a friend’s football victory may bring you esteem, as the 
reflection process predicts—but only if your own football- 
playing ability is not highly relevant to your own self-esteem. 
If you played in the same football game and performed terri- 
bly, your friend’s success would make you look that much 
worse by comparison. For this reason, people sometimes pre- 
fer to see strangers succeed rather than close friends, because 
the stranger’s success does not invite comparison and is less 
humiliating. Tesser and Smith (1980) showed that people will 
do more to help a stranger than a friend to succeed at a task 
that is relevant to the person’s own self-esteem. 
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Meanwhile, the closer the relationship, the greater the 
effect. You gain (or lose) more esteem if your spouse wins a 
major award than if your hairdresser wins it. Thus the com- 
parison process may be especially disruptive to close relation- 
ships. If a romantic partner succeeds on something irrelevant 
to your self-esteem, you may feel closer to that partner. If he 
or she succeeds at something highly relevant to your own 
goals, then you may feel jealous or threatened, and the inti- 
mate relationship may be damaged (Beach, 1992). When the 
comparison process makes you look bad, the only way to 
limit the damage may be to reduce closeness. Research con- 
firms that people distance themselves from someone who per- 
forms too well on something that is highly relevant to their 
own self-concepts (Pleban & Tesser, 1981). 


Self-Monitoring 


Snyder (1974, 1987) proposed an early and influential theory 
about individual differences in how the self structures inter- 
personal processes. He was first interested in cross-situational 
consistency, stimulated by Bem and Allen’s (1974) suggestion 
that some people are more consistent in their traits than others. 
Snyder introduced the concept of self-monitoring as an indi- 
vidual difference, distinguishing between high self-monitors 
and low self-monitors. A high self-monitor looks to others for 
cues, modifying his or her behavior to fit the situation and the 
people in it. A low self-monitor, on the other hand, is more 
consistent and does not try to alter behavior very much across 
situations. Subsequent research (Snyder & Swann, 1976) 
showed that low self-monitors had high attitude-behavior 
consistency: Their attitudes predicted their verdicts in a simu- 
lated jury case. In contrast, high self-monitors’ attitudes did 
not predict their behavior very well, probably because they 
modified their statements on the jury case to fit the immediate 
situational demands and cues. It seems that high self-monitors 
do not see any necessary relation between their private be- 
liefs and their public actions, and so discrepancies do not 
bother them (Snyder, 1987). Thus, there is a basic difference in 
how these two types of people regard themselves. Low self- 
monitors believe that they have strong principles, and they 
consistently strive to uphold them. High self-monitors see 
themselves as pragmatic and flexible rather than principled. 
They respond to the situation and do what they regard as 
appropriate, which often includes altering their own self- 
presentations. 

Further research addressed the interaction patterns associ- 
ated with the different levels of self-monitoring. Low self- 
monitors base friendships on emotional bonds, and they 
prefer to spend most of their time with the people they like 
best. In contrast, high self-monitors base friendships on 


shared activities. Thus they spend time with the people who 
are best suited to the relevant activity. For example, the low 
self-monitor would prefer to play tennis with his or her best 
friend, regardless of how well the friend plays. The high self- 
monitor would rather play tennis with the best tennis player 
among his or her acquaintances (or the one best matched to 
his or her own abilities). Consequently, the social worlds of 
high self-monitors are very compartmentalized, with differ- 
ent friends and partners linked to specific activities. On the 
other hand, the social worlds of low self-monitors are rela- 
tively uncategorized by activities, with friends chosen 
instead on the basis of emotional bonds. 

These interpersonal patterns carry over into romantic rela- 
tionships (Snyder & Simpson, 1984; Snyder, 1987). For ex- 
ample, high self-monitoring males choose dating patterns 
based mainly on physical appearance, whereas low self- 
monitors place more emphasis on personality and other inner 
qualities. High self-monitors tend to have more romantic and 
sexual partners than lows. When it comes to marriage, high 
self-monitors again look for shared activities and interests, 
whereas low self-monitors emphasize mainly the pleasures 
and satisfactions of simply being together. 


Partner Views of Self 


Swann (1996) advanced a simpler theory of how interper- 
sonal relationships are shaped by self-views. Extending self- 
verification theory, Swann argued that people prefer romantic 
partners who see them as they see themselves. People are 
sometimes torn between a desire to see themselves favorably 
and a desire to confirm what they already think of themselves 
(as we discussed earlier). If love is truly blind, a person in 
love would see the beloved partner in an idealized way. 
Would that be helpful or harmful? 

Swann and his colleagues (Swann, Hixon, & De La 
Ronde, 1992) have examined such dilemmas in various rela- 
tionships, ranging from roommates to spouses. On a variety 
of measures, they found that people would rather be with 
someone who confirms their self-views (as opposed to some- 
one who saw them favorably). People choose, like, and retain 
partners who see them accurately. This research might ex- 
plain why some people have partner after partner who treats 
them badly: They somehow feel that they do not deserve to 
be treated well. In this view, the idealizing effects of love are 
dangerous and harmful to the relationship. Apparently people 
want their friends and lovers to see all their faults. 

However, a large independent investigation found that fa- 
vorability is more important than consistency with self- 
views. Murray, Holmes, and Griffin (1996a) found that 
favorable views of one’s partner were associated with better 


relationships. Idealization was associated with greater satis- 
faction and happiness about the relationship. A follow-up 
study (Murray, Holmes, & Griffin, 1996b) found that favor- 
able views of one’s partner predicted greater stability and 
durability of the relationship. This research suggests that per- 
haps love should be blind (or at least nearsighted enough to 
wear rose-colored glasses when looking at the loved one). 
The authors argue that idealized love is not blind, but instead 
farsighted; partners who idealized each other created the re- 
lationships they wanted. Idealization and positive illusions 
about one’s partner seem to strengthen the relationship, mak- 
ing it more pleasant and more likely to last. Seeing the real 
person beneath the facade is not always the beginning of real 
intimacy: Sometimes it is the beginning of the end. 

These somewhat discrepant results do at least agree that it 
is quite important for people to believe that their friends and 
lovers appreciate their good points. It is less clear whether 
people want their partners to also see their faults and flaws. 
One possible explanation for the discrepant results of the two 
authors is that most of Swann’s self-consistency work has 
emphasized traits that the person is highly certain of and 
committed to having. On the other hand, Murray’s favorabil- 
ity effects tend to emphasize a broader spectrum of less cer- 
tain traits. People might want their close relationship partners 
to recognize one or two favorite faults but otherwise maintain 
a highly favorable view of them. 

There is also intriguing but preliminary evidence that rela- 
tionship partners can help sustain consistency. Swann and 
Predmore (1985) gave people feedback that was discrepant 
from their self-views and watched how they and their roman- 
tic partners responded. When the subject and his or her part- 
ner agreed that the feedback was wrong, the pair then joined 
forces to reject it: They discussed its flaws and decided how 
best to refute or dismiss it. In contrast, when the partner’s 
view of the subject differed from the subject’s self-view, 
the discrepant feedback led to further disagreements between 
the subject and partner. It may be that one vital function of 
close relationship partners is to help maintain and defend 
one’s self-concept against the attacks of the outer world (see 
also De La Ronde & Swann, 1998). 

Sedikides, Campbell, Reeder, and Elliott (1998) explored 
another important link between self-deception and the inter- 
personal self. They examined the self-serving bias, a classic 
pattern of self-deception that occurs when people take credit 
for success but deny blame for failure. When people work in 
groups, the self-serving bias produces the tendency to claim 
all the credit for success at joint tasks but to dump the blame 
for failure on the other group members. However, the authors 
found that self-serving bias is mitigated when the group 
members feel a close interpersonal bond with each other. 
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Thus, people will flatter themselves at their partner’s 
expense—but only when they do not care much about the part- 
ner. The interpersonal context dictates whether people will 
display the self-serving bias. 


Self-Handicapping 


When someone self-handicaps, he or she tries to explain 
away failure (or even possible failure) by attributing it to 
external causes (often external causes of his or her own 
making). Self-handicapping is usually studied within the 
context of individual performance, but it has a strong inter- 
personal aspect as well. One study manipulated whether 
several crucial aspects of the situation were public (known 
to others) or private (known only to the subject; Kolditz & 
Arkin, 1982). Self-handicapping emerged mainly in the 
public conditions, when the subject’s handicap and subse- 
quent performance would be seen by others. In contrast, 
subjects did not self-handicap when the experimenter was 
unaware of the handicap. Apparently, self-handicapping is 
primarily a self-presentational strategy used to control the 
impression one makes on other people. Self-handicapping 
rarely occurs when people are concerned only with their 
private self-views. 


EMOTIONS AND THE INTERPERSONAL SELF 


Emotions often reflect value judgments relevant to the self. 
Recent work has increasingly emphasized interpersonal 
determinants and processes of emotion (Tangney & Fischer, 
1995). 


Shame and Guilt 


Both shame and guilt have strong interpersonal components. 
The two terms were used in varying ways for decades, but in 
recent decades a consensus has emerged about how to define 
the two. The distinction was first proposed in the theoretical 
work by Lewis (1971), based on clinical observations, and it 
received considerable elaboration and further support from 
factor analytic and scale-development studies by researchers 
such as Tangney (1992, 1995). The difference between the 
two lies in how much of the self is affected: Guilt denounces 
a specific action by the self, whereas shame condemns the 
entire self. 

Shame is usually the more destructive of the two emo- 
tions. Because shame signifies that the entire self is bad, sim- 
ple reparations or constructive responses seem pointless. This 
absence of constructive solutions probably leads to many of 
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the pathological outcomes connected with shame, such as 
suicide and major depression (Tangney, Burggraf, & Wagner, 
1995). Shame also seems to produce socially undesirable out- 
comes such as, for some people, a complete withdrawal from 
others. Other people, however, respond to shame with anger 
(Tangney et al., 1992). The shift from shame into anger may 
be a defensive effort to negate the global negative evaluation. 
There is some evidence that this shift in emotions can lead 
to violent outbursts (Baumeister et al., 1996). Kitayama, 
Markus, and Matsumoto (1995) have proposed that the 
movement from shame to anger reflects the independent self- 
hood model common to Western cultures and may not occur 
in cultures that emphasize more interdependent selves. 

In contrast, guilt is more reparable and less socially disrup- 
tive than shame. Guilt has a strong basis in relationships even 
when no transgression is involved. For example, some people 
feel survivor guilt because they have survived when others 
have died or suffered. The term originated in studies of sur- 
vivors of the Holocaust and the Hiroshima bombing (Lifton, 
1967). More recently, survivor guilt emerged during episodes 
of corporate downsizing, when people who kept their jobs felt 
guilty while others were fired (Brockner, Davy, & Carter, 
1985). In general, people may feel guilty when they outper- 
form others (Exline & Lobel, 1999). 

According to Baumeister, Stillwell, and Heatherton 
(1994), guilt is mainly interpersonal and seems designed to 
strengthen relationships. People may try to avoid hurting 
close others because it makes them feel guilty. After a trans- 
gression, guilt makes people seek to make amends or rectify 
the situation in an attempt to repair the damage to the relation- 
ship. It makes people change their behavior so that they will 
not repeat the damaging behavior. It makes them try to live up 
to the expectations of others. Feeling guilty is sometimes ben- 
eficial to the relationship in and of itself, because guilty feel- 
ings confirm that the person cares about the relationship (even 
if the transgression made it appear that he or she did not care). 
In addition, people sometimes exaggerate how hurt or upset 
they are by another person’s actions, in order to make that per- 
son feel guilty. The guilt makes the other person more willing 
to comply with the wishes of the person who felt hurt. This 
tactic can be used to redistribute power in a relationship: Guilt 
enables otherwise powerless people to sometimes get their 
way. Usually, the person who is hurt makes his or her feelings 
and disappointment clear. If the other person cares about your 
welfare, he or she will want to avoid hurting you, because 
hurting you will make him or her feel guilty. Hence the person 
will do what you want. 

Baumeister, Reis, and Delespaul (1995) confirmed that 
guilt plays an important role in close relationships. The 
authors asked participants to describe their most recent 
experiences of six different emotions, including guilt. These 


were then coded for the level of interpersonal connection. 
Guilt scored the highest of the six major emotions on interper- 
sonal connection. That is, hardly any guilt stories referred to 
solitary experiences or interactions with strangers; the over- 
whelming majority of guilt stories involved partners in close 
relationships, such as family members or romantic partners. 


Embarrassment 


Similar to shame and guilt, embarrassment seems to be a mix- 
ture of self and interpersonal concerns. Modigliani (1971) 
linked embarrassment to the public self by showing that the 
best predictor of embarrassment was a situational, perceived 
loss of others’ good opinion. In addition, embarrassability 
correlates more highly with public self-consciousness than 
with private self-consciousness (Edelmann, 1985). 

In an influential review, Miller (1995) argued that two 
theoretical perspectives on embarrassment are predominant. 
The first theory emphasizes concern over being evaluated by 
others; to be embarrassed, you must first be concerned about 
others’ evaluations. The alternative view invokes the un- 
pleasant nature of awkward social interactions. In one study, 
Parrott, Sabini, and Silver (1988) presented participants with 
a hypothetical scenario in which someone refused a date. 
People reported they would feel less embarrassed if the rejec- 
tor used an obvious excuse than if the rejector bluntly 
rejected them, even if the person’s rejection was equally neg- 
ative. However, making an excuse may itself convey a posi- 
tive evaluation, such as concern for the rejected person’s 
feelings (Miller, 1995). Miller concluded that both perspec- 
tives are valid; nevertheless, the concern over social evalua- 
tion is the more common cause of embarrassment. 

Blushing is one common sign of embarrassment, but peo- 
ple sometimes blush even when there is no obvious social 
evaluation. Leary, Britt, Cutlip, and Templeton (1992) con- 
cluded that unwanted social attention is the most common 
cause of blushing. In general, people blush as an appease- 
ment to others after violating social norms. People hope that 
looking embarrassed after a transgression will inform other 
people that they feel remorseful. Apparently, embarrassment 
is effective in minimizing negative evaluations. Semin and 
Manstead (1982) found that subjects expressed greater liking 
toward someone who was embarrassed after an accidental 
transgression. When the target person was not embarrassed, 
subjects did not like the person as much. 


Social Anxiety 


Schlenker and Leary (1982) argued that social anxiety is di- 
rectly linked to self-presentation. In their view, social anxiety 
arises when someone wants to make a particular, desired 


impression but fears that he or she will fail to do so. As Leary 
and Kowalski (1995) describe it, social anxiety is essentially 
a concern about controlling public impressions. Making a 
particular impression is important for gaining the acceptance 
of others and for achieving status (two important interper- 
sonal goals). Given the importance of being perceived 
positively by others, it is hardly surprising that some people 
become extremely concerned and anxious during social 
situations. 


Disclosing Emotion and Personal Information 


So far, we have discussed the interpersonal roots of emotion. 
In what way do interpersonal situations, however, affect the 
expression of these emotions? Clark, Pataki, and Carver 
(1995) found that people are careful about how much happi- 
ness they express when they are concerned about the impres- 
sion they are making on others. As an influential review 
showed, people are concerned that their success will create 
feelings of jealousy and dislike (Exline & Lobel, 1999). 
Clark et al. (1995) also found that people express anger in an 
attempt to get their own way. Sadness, too, can be used as an 
interpersonal lever; people show sadness when they want 
others to see them as dependent in order to gain their help. 
These strategies correspond to the self-presentational tactics 
of ingratiation, intimidation, and supplication (E. E. Jones & 
Pittman, 1982). A more general statement was provided by 
DePaulo (1992): People can either exaggerate or downplay 
their emotional reactions in order to meet their self-presenta- 
tional goals. That is, sometimes it is best to pretend to be hav- 
ing a strong emotional reaction, and other times it is 
advantageous to conceal one’s emotions. 

Levels of self-disclosure are also affected by self-control. 
When one’s self-control is depleted by a self-regulatory task, 
one is less able to maintain an appropriate level of self- 
disclosure. People with an avoidant attachment style with- 
draw too much during interactions after being depleted, 
whereas those with an anxious attachment style disclose too 
much (Vohs, Ciarocco, & Baumeister, 2002). Because a mod- 
erate amount of self-disclosure is best for smooth interaction, 
self-regulatory depletion affects the quality of interactions 
through disrupting self-disclosure. 


CULTURAL AND HISTORICAL VARIATIONS 
IN SELFHOOD 


Most of the research presented so far has studied North 
American college students at specific points in time (usually 
between 1975 and 2000). Although this research is informa- 
tive, it does not capture the variations in selfhood across the 
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cultures of the world and the decades of the century. Given 
that the self is an inherently social construct, there should be 
considerable cultural and historical variation. 


Culture and Society 


The past 15 years have brought much interest in the cultural 
determinants of selfhood. By way of summary, it is useful 
to draw from an influential review article by Triandis (1989). 
This review identified several key features of selfhood that 
vary across different cultures. First, cultures vary in concep- 
tions of the private self, or how people understand them- 
selves (e.g., self-regard, self-esteem, introspection, and 
individual decision making). Second, the public self refers to 
how the individual is perceived by other people, thus includ- 
ing issues such as reputation, specific expectations of others, 
and impression management. Third, the collective self 
involves memberships in various social groups, from the 
family to an employing organization or ethnic group. Trian- 
dis argues that individualistic societies such as that in much 
of the United States emphasize the private and public selves 
and downplay the collective self, whereas other (e.g., Asian) 
societies tend to emphasize the collective self while down- 
playing the private self. Variation in these conceptions may 
also occur within a society. For example, some authors have 
argued that African-Americans show more collectivistic 
tendencies compared to White Americans (e.g., Baldwin & 
Hopkins, 1990). 

Triandis (1989) also proposed several important cultural 
dimensions that have important implications for the self. 
One dimension is individualism versus collectivism. Individ- 
ualistic societies support diversity, self-expression, and the 
rights of individuals, whereas collectivistic societies promote 
conformity and a sense of obligation to the group. As a 
general rule, Western societies such as the United States are 
more individualistic, whereas Asian and African societies are 
more collectivistic. In general, relationships are closer in col- 
lectivistic societies. The concept of an independent, individ- 
ual self is not as common; rather, a person sees his or her self 
as overlapping with the selves of close others. 

Another dimension that varies between societies is tight- 
ness, or the amount of social pressure on individuals. Tight 
societies demand that individuals conform to the group’s 
values, role definitions, and norms. In contrast, loose soci- 
eties allow people more freedom to do what they want. (For 
that reason, tight societies tend to promote the public and col- 
lective selves, whereas loose ones allow more scope for the 
private self to flourish.) 

A third dimension of cultural variation proposed by Trian- 
dis (1989) is societal complexity. In a complex society, an 
individual tends to belong to many different groups; thus it is 
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less imperative to stay on good terms with any one of these 
groups. The collective self is therefore not so crucially 
important. In addition, complex societies allow greater devel- 
opment of the private self (because of the greater availability 
of many social relationships). The public self is also quite 
important because it is the common feature of all one’s social 
relations. In contrast, in a simple society people belong to 
relatively few groups, each of which is then quite important 
in defining the self. The collective self flourishes in adapting 
to these memberships, and the need to conform to the group 
tends to stifle the private self. 

Triandis (1989) illustrated some of his central ideas by 
contrasting American and Japanese societies. Japan tends to 
be tighter and more collectivistic than the United States, and 
as a result there is much greater homogeneity: Japanese 
citizens tend to eat the same foods, whereas Americans use 
the prerogative of the private self and choose from a broad 
assortment. Certain Asian traditions, such as having the 
oldest male order the same food for the entire table, would be 
unthinkable in the United States, where each individual’s 
special preferences are honored. 

Furthermore, Americans place a premium on sincerity. At 
its base, sincerity is the congruency between public and 
private selves: You are supposed to say what you mean and 
mean what you say. In Japan, however, public actions are more 
important than private sentiments. For example, Americans 
object to hypothetical dilemmas in which people think one 
thing and say another, whereas Japanese respondents approve 
of these options. 

Markus and Kitayama (1991) proposed that Asian and 
Western cultures primarily vary in independence versus in- 
terdependence. Western cultures, they argue, emphasize the 
independent self: People are supposed to attend to them- 
selves, to discover and express their unique attributes, and to 
try to stand out in important ways. In the West, as they say, 
the squeaky wheel gets the grease. In contrast, Asian cultures 
emphasize interdependence. Asians are expected to attend to 
others, to conform to group demands and role obligations, 
and to try to fit into the group. In Asia, “the nail that stands 
out gets pounded down.” To the Western mind, the self is 
an autonomous unit that is essentially separate and unique, 
whereas the Asian view begins with an assumption of the 
basic and pervasive connectedness of people. 

Multiple consequences flow from this idea. As might be 
expected in an interdependent culture, relationship harmony 
was more important to self-esteem for students in Hong Kong 
compared to students in the United States (Kwan, Bond, & 
Singelis, 1997). Because relationships are more intertwined 
with the self in these cultures, they are more important to self- 
esteem and life satisfaction. Self-enhancing biases also differ 


between the two types of cultures. In general, Americans tend 
to self-enhance, whereas the Japanese tend to self-criticize 
(Kitayama, Markus, Matsumoto, & Norasakkunkit, 1997). 

People from independent cultures also tend to describe oth- 
ers in terms of cross-situational, person-centered traits (e.g., 
“He is stingy”). In contrast, people from interdependent cul- 
tures tend to describe others more in terms of specific contexts 
(e.g., “He behaves properly with guests but feels sorry if 
money is spent on them”; Markus & Kitayama, 1991, p. 232). 
Self-descriptions also vary between cultures (Bond & 
Cheung, 1983; Cousins, 1989). Japanese college students 
asked to finish a sentence beginning with “I am...’ were more 
likely to respond with social roles (“brother,” “student at 
Tokyo University”), whereas American college students were 
more likely to respond in terms of personal attributes (“outgo- 
ing,” “blonde”). Thus, members of independent societies see 
themselves and others in terms of relatively constant personal- 
ity traits, whereas members of interdependent societies see 
personality and behavior as more dependent on the situation. 

In addition, interdependent societies do not emphasize 
consistency among private thoughts and feelings as much as 
independent societies do. In an interdependent society, it is 
more important to be accommodating and kind than to be in- 
ternally consistent. Among independent selves, politeness 
means giving the other person the maximum freedom to ex- 
press unique, special, and changing wants. Among interde- 
pendent selves, however, politeness means anticipating what 
the other might want and showing appreciation for their ac- 
tions. There are also emotional consequences, as Markus and 
Kitayama (1991) explain. In the West, the expression versus 
suppression of anger has long been a point of controversy; 
anger is socially disruptive, but it also expresses the needs of 
the individual. In Asian cultures, however, there is no contro- 
versy: Anger is to be avoided at all costs. 

Thus it is important to consider culture when studying the 
self. Most research on the self, like that on most psychologi- 
cal topics, has involved participants from Western countries. 
As a result, it may exaggerate the fundamental nature and 
pervasiveness of the independent self. Although cultures 
share many conceptions of selfhood, many others show strik- 
ing differences. 


Historical Evolution of Self 


It is not necessary to visit multiple cultures to find variations 
in selfhood. There is often ample variation within a single 
culture, because cultural change over time modifies the soci- 
ety. This is the root of research on birth cohort differences 
(e.g., Caspi, 1987; Stewart & Healy, 1989; Twenge, 2000, 
2001a, 2001b): Your generation influences the culture you are 


exposed to and thus your individual characteristics. Western 
culture’s dominant ideas about selfhood have changed and 
evolved dramatically over the past few centuries (see 
Baumeister, 1987; Twenge & Campbell, 2001). Thus, the 
special nature of the modern Western form of selfhood can 
be understood in a historical context as well as in the context 
of cross-cultural comparisons. These changes are important 
for the interpersonal self because many of these trends have 
affected personal relationships and the independent- 
interdependent nature of the self. Just as some cultures (such 
as the West) are more independent, so are some time periods 
(such as 1970-2000). In addition, shifts in self-views due to 
societal trends demonstrate the inherently social nature of the 
self: It changes in response to the larger society and one’s 
generational peers. 


Medieval Times to the Twentieth Century 


During medieval times, people did not have identity crises the 
way we do today (see Baumeister, 1987, for a review). In ear- 
lier times, age, gender, and family were the decisive determi- 
nants of life outcomes and thus of identity. There were set 
patterns for life, depending on the constraints of these ascribed 
attributes; if you were born a peasant worker, you remained a 
peasant worker. Upward mobility was almost nonexistent, and 
most men entered their father’s professions or were appren- 
ticed to professions chosen by their parents. Religion dictated 
strict standards for behavior and worship. Many marriages 
were arranged. To put it crudely, a rigid society told our an- 
cestors who they were, and there was not much they could do 
about it. In general, these societies were tighter and more col- 
lectivistic than Western societies are today. 

Over the course of several centuries, Western societies be- 
came looser and more individualistic. For example, modern 
selves are based on changing rather than stable attributes. 
Gender and family background slowly became less important 
than more changeable attributes such as ability, diligence, 
and personality. The modern Western self can be defined and 
redefined much more than the self of earlier eras. This greater 
freedom has also shifted the burden of defining the self onto 
the individual; today everyone can choose from a wide spec- 
trum of possible identities. This freedom can cause anxiety, 
however, because these choices can be overwhelming in their 
scope and direction. It also requires great self-knowledge, 
because decisions about careers and romantic partners are 
based on suitability (What is the best job for me? Is this per- 
son the one I’m supposed to marry?) The burden falls most 
heavily on adolescents, because adolescence ends with the 
formation of adult identity (e.g., Erikson, 1968). Hence, in 
the twentieth century adolescence has become a period of in- 
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decision, uncertainty, experimentation, and identity crisis 
(see Baumeister & Tice, 1986). 


The 1960s to the Present 


The trend toward greater focus on the self has accelerated in 
recent decades. Over the last 30 years, the self has become 
increasingly more individualized and autonomous. During 
the late 1960s and 1970s, popular culture promoted self- 
fulfillment, self-love, and “being your own best friend” 
(Ehrenreich & English, 1978). Pollsters noted that “the rage 
for self-fulfillment” had spread everywhere (Yankelovich, 
1981). At one time, duty and modesty were the most favorable 
traits; during the 1970s, however, self-help books advised 
“a philosophy of ruthless self-centeredness” that informed 
people that “selfishness is not a dirty word” (Ehrenreich & 
English, 1978, p. 303). The preoccupation with self so perme- 
ated the society that Lasch (1978) called it “The Culture of 
Narcissism”; L. Y. Jones (1980, p. 260) spoke of the decade’s 
“orgy of self-gratification”; and the young adults of the 1970s 
acquired the label “The ‘Me’ Generation.” Increasingly, pro- 
claiming that you loved, cherished, and valued yourself was 
no longer an immodest proposition (L. Y. Jones, 1980; Rosen, 
1998; Swann, 1996). By the 1980s, Whitney Houston could 
sing (without irony) that “the greatest love of all” was for 
oneself. 

This emphasis on individualism had specific consequences 
for many interpersonal relationships. Because spouses and 
children necessarily hindered the expression of unfettered 
individualism, writers and commentators increasingly por- 
trayed marriage and children as “a drag” (Ehrenreich & 
English, 1978, p. 295). For example, if there was a conflict be- 
tween what is best for the marriage and what is best for the 
self, earlier generations often placed the obligation to mar- 
riage as the supreme duty, but more recent generations placed 
the self higher (Zube, 1972). “From now on, Americans 
would live for themselves,” notes David Frum in his cultural 
history of the 1970s (2000; p. 58). “If anyone or anything else 
got in the way—well, so much the worse for them.” It is prob- 
ably not a coincidence that divorce rates began to rise sub- 
stantially during the late 1960s and early 1970s, just as this 
new individualism was taking hold (Frum). 

In addition, many authors have argued that the 1970s 
promoted negative attitudes toward children—what the 
Germans call Kinderfeindlichkeit, or hostility toward 
children (see, e.g., Holtz, 1995; Strauss & Howe, 1991). 
According to some authors, the growing emphasis on indi- 
vidualism tended to decrease the priority parents placed on 
children’s needs as opposed to their own (Ehrenreich & 
English, 1978). At the same time, the birth rate declined 
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during the 1970s, reaching historic lows that have not been 
equaled since. Children did not fit into the picture of individ- 
ual self-fulfillment—after all, what could they really do for 
their parents? 

Not only did the general societal ethos promote the self, but 
a self-esteem movement (an offshoot of the human-potential 
and self-growth movements) gained prevalence, arguing that 
“the basis for everything we do is self-esteem” (MacDonald, 
1986, p. 27; quoted in Seligman, 1995). During the early 
1980s, educators began to actively promote self-esteem in 
school children. This was partially accomplished by affirma- 
tion (children were given T-shirts that said “I’m lovable and 
capable” or sang songs about self-love; e.g., Swann, 1996). In 
addition, many schools discouraged criticism, telling teachers 
not to correct misspellings or grammar mistakes, so as not to 
harm a child’s self-esteem (Sykes, 1995). Thus the culture in- 
creasingly promoted self-esteem as an end unto itself, rather 
than as an outcome of accomplishment or meaningful per- 
sonal relationships. 

This popular interest in the self also meant that young peo- 
ple became increasingly exposed to self-esteem as a desirable 
goal. Gergen (1973) argued that the popularization of 
psychological concepts often creates changes in the responses 
of the subject populations. Self-esteem is a prime candidate 
for changes based on popularization. Not only has self-esteem 
been directly trumpeted by social movements and promoters, 
but the concept has received wide media attention in newspa- 
pers, magazines, television programs, and popular music 
(Whitney Houston sings about it, and a popular song in the 
mid-1990s explained the singer’s misguided actions as result- 
ing from “low self-esteem’). If anything, this attention in- 
creased during the 1980s; while the self-esteem and human 
potential movements reached only some people in the 1970s, 
the 1980s and 1990s saw talk about self-esteem enter the 
mainstream. 

Empirical searches show that coverage of self-esteem has 
increased substantially in the popular press (these searches 
were originally performed for Twenge & Campbell, 2001). In 
1965, the Reader’s Guide to Periodical Literature did not even 
include a listing for self-esteem (nor did it list any articles 
under self-respect or self-love). In 1995, the Reader e Guide 
listed 27 magazine articles devoted solely to the topic of self- 
esteem. In addition, a search of the Lexis-Nexus database for 
1995 articles mentioning self-esteem exceeded the search 
limit of 1,000 articles; the 1,000-article limit was still ex- 
ceeded even when the search was limited to a single month 
(June 1995). In the academic literature, PsycLit also shows a 
steady increase in articles mentioning self-esteem. From 1970 
to 1974, .6% of all articles in the database mentioned self- 
esteem. This number increased steadily, reaching .10% from 


1975 to 1979 and .12% from 1980 to 1984; the number has 
since leveled off at .12% to .13%. Thus, over the time period in 
question, academic publications examining self-esteem have 
doubled. 

One consequence of these cultural changes has been in- 
creases in self-esteem as measured by popular question- 
naires. Twenge and Campbell (2001) found that college 
students’ scores on the Rosenberg Self-Esteem Scale rose 
more than a half a standard deviation between the late 1960s 
and the early 1990s. Children’s scores on the Coopersmith 
Self-Esteem Inventory also increased from the early 1980s to 
the early 1990s. The authors argued that much of this change 
can be traced to the self-esteem movement and the general 
emphasis on the individual self in the larger society. In- 
creases in assertiveness (Twenge, 2001b) and extraversion 
(Twenge, 2001a) complete the picture of a generation in- 
creasingly concerned with the self, individual rights, and 
self-expression. 

To sum up: The self cannot be fully understood without 
reference to culture, whether that culture differs with respect 
to region or with respect to time. Research on cultural differ- 
ences has blossomed into an extensive and growing subfield, 
while research on birth cohort and change over time is just 
beginning to be conducted. As Caspi (1987) argued, many 
aspects of development and personality must be understood 
within the context of time, because the larger sociocultural 
environment changes so much from decade to decade (also 
see Gergen, 1973). When we are born, grow up, and discover 
our adolescent and adult identities has a substantial effect on 
how we see the self as an entity. 
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Attitudes refer to the general and relatively enduring evalua- 
tions people have of other people, objects, or ideas. These 
overall evaluations can be positive, negative, or neutral, and 
they can vary in their extremity. For example, one individual 
might view jazz music in a mildly positive way, whereas an- 
other might be wildly positive and another might be some- 
what negative. Individuals can hold attitudes about very 
broad or hypothetical constructs (e.g., anarchy) as well as 
about very concrete and specific things (e.g., a particular 
brand of chewing gum). Before turning to our primary focus 
on the processes involved in changing attitudes, we address 
some important background issues on the nature and struc- 
ture of attitudes. Following this background discussion, we 
describe ways to change attitudes that involve relatively high 
versus low amounts of cognitive effort and the consequences 
of these different strategies. 


BACKGROUND ISSUES 


Bases of Attitudes 


Attitudes can be based on different types of information. 
One popular conceptualization of the attitude construct, the 
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tripartite theory, holds that there are three primary types 
of information on which attitudes can be based (Breckler, 
1984; Rosenberg & Hovland, 1960; Zanna & Rempel, 1988): 
cognitions or beliefs (e.g., This car gets 10 miles per gallon), 
affect or feelings (e.g., Owning this car makes me happy), 
and actions or behavior (e.g., / have always driven this brand 
of car). The basis of the attitude object can have important 
implications for attitude change (see also the chapter by 
Olson & Maio in this volume). For example, it may generally 
be more effective to change attitudes that are based on emo- 
tion with emotional strategies rather than with more cogni- 
tive or rational ones (Edwards, 1990; Fabrigar & Petty, 
1999). 


Attitude Storage Versus Construction 


Implied in our definition of attitudes is the notion that atti- 
tudes are stored memorial constructs. Some researchers have 
argued that attitudes may in fact not be stored in memory and 
instead be newly constructed, based upon salient beliefs, 
emotions, and behaviors each time the individual is asked to 
report his or her attitude (Schwarz & Bohner, 2001; Wilson & 
Hodges, 1992). This perspective seems rooted primarily in 
the finding that attitude reports are susceptible to a variety of 
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contextual biases that can contaminate attitude reports (see 
Schwarz, 1999). 

Although attitude reports are clearly influenced by the im- 
mediate context, a strict constructivist view of attitudes seems 
implausible for a variety of reasons. First, as we review later in 
this chapter, research has demonstrated that individuals expe- 
rience aversive arousal when they violate their existing atti- 
tudes (e.g., Elliot & Devine, 1994; Elkin & Leippe, 1986; 
Losch & Cacioppo, 1990), and individuals are often motivated 
to defend their attitudes in the face of counterattitudinal 
appeals (e.g., Ditto & Lopez, 1992; Ditto, Scepansky, Munro, 
Apanovitch, & Lockhart, 1998; Edwards & Smith, 1996; 
Kunda, 1990; Petty & Cacioppo, 1979a). These findings are 
consistent with the view that some attitudinal representation 
exists in memory. Furthermore, research has delineated the 
conditions under which motivated defense versus attitude con- 
struction processes will operate (e.g., Fazio, Zanna, & Cooper, 
1977). Second, attitudes can be automatically activated under 
response conditions that would make spontaneous construc- 
tion seem unlikely (Bargh, Chaiken, Govender, & Pratto, 
1992; Bargh, Chaiken, Raymond, & Hymes, 1996; Fazio, 
Sanbonmatsu, Powell, & Kardes, 1986). Third, it would seem 
to be functionally maladaptive for individuals to store a lot of 
attitude-relevant beliefs for attitude reconstruction in the ab- 
sence of summary evaluative representations (see also Lingle 
& Ostrom, 1981). Fourth, research has uncovered structural 
properties of attitudes that can influence their persistence 
across a variety of contexts (see Petty & Krosnick, 1995). 

If there were no stored attitudes, and evaluations were 
simply constructed anew each time the attitude object was en- 
countered, many of the processes described in this chapter 
would have little theoretical utility. Instead, attitude change 
researchers would better spend their time focusing solely on 
context effects rather than procedures aimed at changing 
memorial evaluative representations. In our view, the strict 
constructivist approach does not seem prudent. In this 
chapter, attitudes are conceptualized as stored memorial con- 
structs that may or may not be retrieved upon encountering 
the attitude object (see Fazio, 1990). 

In using this conceptualization, we do not mean to imply 
that attitudes are not susceptible to context effects or are 
never constructed from scratch. Most obviously, when indi- 
viduals do not have attitudes about a particular attitude ob- 
ject, they may simply construct an attitude when asked for 
one (Converse, 1970). Also, when individuals are instructed 
to think about their attitude before reporting it, they may 
sometimes selectively focus on a subset of attitude-relevant 
information and this salient information would influence 
the attitude reported (e.g., Wilson & Kraft, 1993). Similarly, 


individuals may report different attitudes when contextual 
variables like conversational norms or social desirability 
concerns operate (e.g., Fazio, Jackson, Dunton, & Williams, 
1995; Schwarz, 1999). However, the fact that contextual vari- 
ables can sometimes influence attitude reports is not tanta- 
mount to establishing that there are no stored evaluations for 
any attitude objects. Rather, attitude construction processes 
probably occur mostly when no stored evaluation is readily 
accessible or when contextual factors contribute to current at- 
titude reports by modifying or shading a retrieved global 
evaluation (Petty, Priester, & Wegener, 1994). 


Attitude Strength 


Although we define attitudes as relatively enduring con- 
structs (i.e., stored representations), attitudes can certainly 
change over time. Attitudes can change from being nonexis- 
tent to having some valence, or they can change from one 
valence to another. Most of this chapter focuses on the 
processes responsible for changes in attitudes. Polarization 
refers to instances in which an existing attitude maintains the 
same valence but becomes more extreme. Moderation refers 
to those instances in which an individual’s existing attitude 
becomes less extreme and moves toward the point of neutral- 
ity. One’s attitude can also cross the neutral point and change 
valence. 

Attitudes may be fruitfully conceptualized as falling along 
a continuum ranging from nonattitudes to strong attitudes 
(see Converse, 1970). Strong attitudes are those that influ- 
ence thought and behavior, are persistent over time, and are 
resistant to change (Krosnick & Petty, 1995). A large variety 
of strength indicators have been identified and studied empir- 
ically, including attitude accessibility (e.g., Bassili, 1995; 
Fazio, 1995), certainty (e.g., Gross, Holtz, & Miller, 1995), 
importance (Krosnick, 1988), and elaboration (Petty, 
Haugtvedt, & Smith, 1995; see Petty & Krosnick, 1995, for a 
review of attitude strength variables). Although it is intu- 
itively appealing to assume that attitude strength variables 
are manifestations of a single latent construct, intercorrela- 
tions among the various attitude strength variables are often 
somewhat low (e.g., Krosnick, Boninger, Chuang, Berent, & 
Carnot, 1993; Raden, 1985). Furthermore, the search for a 
limited number of underlying attitude strength factors has 
yielded inconclusive results so far (see Eagly & Chaiken, 
1998, for a review). Nevertheless, it seems reasonable that 
the many strength variables ultimately boil down to a rela- 
tively few critical dimensions that are most important for pro- 
ducing the major strength consequences (e.g., making the 
attitude resistant to change). 


Implicit Versus Explicit Attitudes 


Although most research on attitudes concerns people’s explicit 
likes and dislikes, in recent years a good deal of research inter- 
est has been generated by the idea of implicit attitudes. In an 
influential review of implicit attitude effects, Greenwald and 
Banaji (1995) referred to implicit attitudes as “introspectively 
unidentified (or inaccurately identified) traces of past experi- 
ence that mediate favorable or unfavorable feeling, thought, or 
action toward social objects” (p. 8). This definition suggests 
that people are unaware of some past experiences (implicit at- 
titudes) that mediate current responses. Wilson, Lindsey, and 
Schooler (2000) expanded this definition by suggesting that 
implicit attitudes are “evaluations that (a) have an unknown 
origin . . . (b) are activated automatically; and (c) influence im- 
plicit responses . . .” (p. 104). This definition suggests that 
people may be unaware of the origin of their past attitudes, al- 
though they may be aware of the attitudes themselves. Green- 
wald, McGhee, and Schwartz (1998) stated that “implicit 
attitudes are manifest as actions or judgments that are under 
the control of automatically activated evaluation without the 
performer’s awareness of that causation” (p. 1464). This defi- 
nition suggests that people are unaware of the effects of im- 
plicit attitudes. The implicit attitudes construct has been 
applied to a growing body of research and can have important 
implications for how researchers conceptualize attitude 
change. Although the various definitions of implicit attitudes 
have significant overlap, their application in practice is some- 
times characterized by substantial ambiguity. 

As the above definitions imply, one dimension on which 
implicit attitudes are thought to differ from explicit attitudes 
is awareness. That is, implicit attitudes are viewed as ones 
for which people are unaware of what the attitude is, where 
it comes from, or what effects it has. It is perhaps important 
to note that these types of awareness are not mutually exclu- 
sive. Any attitude can be characterized by all or none of 
these types of awareness. We discuss each of these features 
next. 


Awareness of the Attitude Itself 


The first type of awareness concerns an awareness of the 
attitude itself—that is, does the person consciously acknowl- 
edge that he or she holds an evaluative predisposition toward 
some person, object, or issue? If so, the attitude is said to be 
explicit. On the other hand, individuals sometimes have 
stored evaluative associations of which they are unaware. 
This type of awareness corresponds to the meaning of im- 
plicit as employed in other psychological research domains. 
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For example, in many demonstrations of implicit memory, an 
individual shows evidence of having memorized a piece of 
information, yet is unable to consciously retrieve the infor- 
mation when desired (see Schacter, 1987, for a review). Sim- 
ilarly, evidence for implicit learning is found when an 
individual acquires some knowledge or skill that is evidenced 
on task performance, but the individual is unable to verbalize 
the underlying rule or basis for the skill (see Seger, 1994, for 
a review). Thus, according to this criterion, to the extent that 
people have evaluative predispositions of which they are not 
consciously aware and are unable to consciously report when 
asked, these attitudes are said to be implicit. 


Awareness of the Basis of the Attitude 


Another type of awareness mentioned in some discussions of 
implicit attitudes concerns awareness of the basis of the atti- 
tude. If people are not aware of the attitude itself, it is un- 
likely that they would be aware of its basis Oe, where it 
comes from). However, people are often unaware of the basis 
of their explicit attitudes as well. For example, repeated sub- 
liminal exposures to a stimulus can increase liking of the 
stimulus (Bornstein & D’ Agostino, 1992) without aware- 
ness. Although the individual can explicitly report his or her 
preference for the previously seen stimulus, he or she has no 
access to the source of the liking (i.e., the previous subliminal 
exposures). Similarly, a consciously reported attitude (e.g., 
one’s life satisfaction) may be unknowingly biased by extra- 
neous inputs (e.g., the good weather; Schwarz & Clore, 
1983). Even if the source of an attitude seems quite explicit 
(e.g., exposure to a persuasive message), people may be un- 
aware that the message has influenced their attitudes. People 
sometimes recall having had their new attitude all along 
(Ross & McFarland, 1988). People can also think that their 
attitudes have changed when they have not. 

Thus, using awareness of an attitude’s basis or source as a 
defining criterion for implicit attitudes is problematic in part 
because individuals rarely (if ever) have complete access 
to all of the influences on their judgments (see Nisbett & 
Wilson, 1977; Wilson & Hodges, 1992). Therefore we do not 
think that this criterion is a useful one for distinguishing im- 
plicit from explicit attitudes. Stated simply, if an attitude is 
implicit, the basis may be unknown—but not knowing the 
basis of an attitude does not make it implicit. 


Awareness of the Attitude’s Influence 


A third type of awareness concerns awareness of the extent of 
an attitude’s influence on other judgments and behaviors. For 


356 Persuasion and Attitude Change 


example, Greenwald and Banaji (1995) indicated that halo 
effects are one example of the operation of implicit attitudes. 
Halo effects refer to instances in which information about 
one attribute influences judgments about other unrelated at- 
tributes. For example, Johnny may judge Sue to be intelligent 
because he believes her to be attractive. To the extent that 
Johnny is unaware that his conscious beliefs concerning her 
attractiveness influence his judgments of her intelligence, 
his attitude toward her attractiveness may be labeled implicit 
by this criterion (Greenwald & Banaji, 1995). This view is 
problematic, however. Individuals are unlikely to be aware 
of all of the consequences of their attitudes for judgment and 
behavior, and thus this criterion would render nearly every 
attitude implicit. Furthermore, whether the attitude was con- 
sidered implicit could vary from context to context (i.e., the 
person could be aware that a negative attitude was influenc- 
ing him or her in one situation but not in another). Conse- 
quently, this feature does not appear to be an optimal criterion 
for defining implicit attitudes. As with the previous criterion, 
if people are unaware of the attitude itself (i.e., the attitude is 
implicit) they are unlikely to be aware of the effects of the 
attitude. But not knowing the effects of an attitude does not 
make it implicit. 


Summary 


In considering the three types of awareness, it is awareness or 
acknowledgement of holding the attitude itself that is the dis- 
tinguishing feature of implicit versus explicit attitudes. Peo- 
ple are aware of holding their explicit attitudes; they are not 
aware of holding their implicit attitudes. Our use of the 
phrase acknowledging one’s attitude is not meant to imply 
that people like or are comfortable with their attitudes—only 
that they recognize that they have these attitudes. For 
example, a person might acknowledge some prejudice for or 
liking of cigarettes, but the same person might also wish that 
these attitudes could change. People tend to be happy with 
and want to defend their attitudes, but this is not always the 
case. In addition, an implicit attitude may enter conscious- 
ness in a variety of ways. For example, therapy may reveal 
hidden attitudes, or an experimenter may reveal such atti- 
tudes to participants in a study. The person’s own behavior 
(e.g., a slip of the tongue) may also provide a clue to an im- 
plicit attitude. When presented with such information, a per- 
son can acknowledge the implicit attitude, thereby making it 
explicit—or the person can deny having this reaction (i.e., the 
therapist is wrong), keeping it implicit. Regarding the other 
dimensions, we note that implicit attitudes generally have an 
implicit basis and have implicit effects, but these attributes 
per se do not make the attitudes implicit because explicit 


attitudes can also have an implicit basis and have implicit 
effects (see also Wegener & Petty, 1998). 


Measurement of Attitudes 


Researchers have developed a multitude of attitude measure- 
ment instruments (see Eagly & Chaiken, 1993; also see the 
chapter by Olson & Maio in this volume). Measurement of at- 
titudes is important for determining whether attitude change 
has occurred. A long-standing distinction between attitude 
measures has been drawn concerning whether the measure is 
a direct or an indirect one (Petty & Cacioppo, 1981). Direct 
attitude measures are those that simply ask the respondent to 
report his or her attitude. Because these measures are trans- 
parent and make it obvious that attitudes are being assessed, 
they can be considered explicit measures of attitudes. In- 
cluded in this category are attitude measurement devices such 
as the semantic differential (Osgood, Suci, & Tannenbaum, 
1957), the one-item rating scale, the Likert scale (Likert, 
1932), and the Thurstone scale (Thurstone, 1928). Indirect at- 
titude measures on the other hand are those that do not di- 
rectly ask the individual to report his or her attitude. Instead, 
the individual’s attitude is inferred from his or her judgments, 
reactions, or behaviors. Because these measures do not make 
it obvious that attitudes are being assessed, they can be con- 
sidered implicit measures of attitudes. A person completing 
an implicit measure is presumably unaware that the measure 
is assessing attitudes. Included in this category are a wide 
variety of methods such as the Thematic Apperception Test 
(Proshansky, 1943), the Information Error Test (Hammond, 
1948), the Implicit Association Test (IAT; Greenwald et al., 
1998), the automatic evaluation task (e.g., Fazio et al., 1995), 
physiological measures such as the facial electromyograph 
(EMG; e.g., Cacioppo & Petty, 1979a) or electroencephalo- 
gram (EEG; e.g., Cacioppo, Crites, Bernston, & Coles, 1993), 
and physical behaviors like nonverbal gestures, eye contact, 
or seating distance (e.g., Argyle & Dean, 1965; Dovidio, 
Kawakami, Johnson, Johnson, & Howard, 1997; Macrae, 
Bodenhausen, Milne, & Jetten, 1994; Word, Zanna, & 
Cooper, 1974). Direct and indirect measurement methods 
typically exhibit modest positive correlations (Dovidio, 
Kawakami, & Beach, 2000). 

It is important to note that direct and indirect measurement 
methods can differ in the extent to which they permit deliber- 
ative responding (Vargas, von Hippel, & Petty, 2001). For ex- 
ample, experimenters could require individuals to report their 
attitudes on a direct one-item rating scale very quickly with no 
time for deliberation, or they could permit individuals to make 
the judgment after some minimal or extensive reflection. Sim- 
ilarly, some indirect attitude measures permit relatively slow 


and deliberate responding (e.g., the Thematic Apperception 
Test; Information Error Test), whereas others require very fast 
responses (e.g., the IAT or automatic evaluation task). 

Researchers make two common assumptions about direct 
(explicit) and indirect (implicit) measures of attitudes, and 
we discuss each assumption in turn. 


What Do Implicit and Explicit Measures Assess? 


One assumption is that explicit attitude measures tap explicit 
attitudes, whereas implicit measures tap implicit attitudes 
(e.g., Dovidio et al., 2000; Greenwald & Banaji, 1995). This 
assumption is tidy but seems ill-advised for a number of rea- 
sons. First, even if it were the case that implicit attitudes 
could be assessed only with implicit measures, this would not 
mean that implicit measures assessed only implicit attitudes. 
In fact, implicit measuring devices have long been used to tap 
explicit attitudes that people were simply unwilling to report 
due to social desirability concerns, and such measures do tap 
explicit attitudes if there is no competing implicit attitude. 
For example, an attitude measure like eye contact or seating 
distance could tap primarily implicit attitudes to the extent 
that the individual is not aware that he or she holds that atti- 
tude. Hence, an individual may sit farther away from mem- 
bers of a stigmatized social category despite professing (and 
believing) that he or she harbors no animosity or dislike to- 
wards the group. However, behaviors like eye contact or seat- 
ing distance can often also be manifestations of quite explicit 
attitudes. One may sit closer to one’s spouse than to a com- 
plete stranger and also be quite aware that one prefers the 
company of one’s spouse. Contemporary measures of auto- 
matic responding (e.g., Fazio, 1995) also assess primarily ex- 
plicit attitudes if there is no competing implicit one. 

On the other hand, if there is a competing implicit attitude, 
measures of automatic evaluation might be used to assess it. 
Thus, discrepancies between nondeliberative implicit mea- 
sures and deliberative explicit measures can sometimes be 
attributed to social desirability contaminants (e.g., Fazio et al., 
1995; Greenwald et al., 1998), but they can also be due to 
competing implicit and explicit attitudes (Wilson, Lindsey, & 
Schooler, 2000). 

Second, it does not appear to be the case that implicit atti- 
tudes can only be assessed with implicit measures. This is be- 
cause implicit measures, like explicit ones, vary in the extent 
to which they allow controlled versus automatic responding 
(Vargas et al., 2001). For example, if a direct measure is 
administered quickly with little time for reflection, implicit 
attitudes might well influence responses (see also Wilson 
et al., 2000). Thus, if time pressure is high, a fast direct scale 
might assess a prior and now-rejected attitude because it was 
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more accessible than was the new attitude (Petty & Jarvis, 
1998). Perhaps a simple generalization that can be made is 
that explicit attitudes are most confidently assessed with 
deliberative direct attitude assessments. Of course, this state- 
ment rests on the assumption that self-presentational concerns 
or other biasing factors are not contaminating the attitude re- 
port. To the extent that such biasing factors (e.g., an unusually 
positive mood) are at work, the measure may tap the influence 
of the biasing agents rather than solely the underlying atti- 
tude. When direct attitude reports do not permit deliberative 
responding, however, the direct measure could tap either ex- 
plicit or implicit attitudes. 

Most of the time explicit and implicit measures should as- 
sess the same underlying attitude. It is in the interesting case 
in which the two types of assessments produce different out- 
comes that one might conclude that the implicit measure has 
tapped an implicit attitude. Of course, before one reaches this 
conclusion, it is important to rule out the possibility that the 
person is actually aware of the conflicting attitude but simply 
does not report it for purposes of self-presentation. 


What Do Implicit and Explicit Measures Predict? 


A second assumption is that explicit attitudes predict deliber- 
ative behaviors (e.g., jury voting), whereas implicit attitudes 
predict spontaneous behavior (e.g., seating distances; 
Dovidio et al., 1997). If implicit attitudes are always more 
accessible than are explicit attitudes, one might expect this to 
be the case (Dovidio, et al, 2000; Wilson et al., 2000). For 
example, Fazio (1990) suggested that highly accessible atti- 
tudes influence behavior when motivation and opportunity to 
evaluate the consequences of one’s actions are low, but that 
less accessible or newly constructed attitudes can influence 
behavior when motivation and opportunity are high. How- 
ever, the conclusion that implicit attitudes predict sponta- 
neous behavior whereas explicit attitudes predict deliberative 
behavior may be premature. Vargas et al. (2001) argued that 
this conclusion was reached because the prominent contem- 
porary implicit measures have relied on quick and sponta- 
neous reactions (e.g., speeded response task; Wilson et al., 
2000; automatic evaluation task; Fazio, 1995), whereas 
explicit measures have relied on deliberative responses. That 
is, the information-processing conditions of attitude mea- 
surement (spontaneous or deliberate) matched the informa- 
tion-processing conditions of behavioral assessment, and this 
assessment compatibility fostered higher correlations (Ajzen 
& Fishbein, 1977). To test this notion, Vargas et al. developed 
a deliberative implicit measure of attitudes and demonstrated 
that it could predict deliberative behavior over and above a 
series of deliberative explicit attitude measures. Although not 
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demonstrated yet, it presumably would be the case that a 
spontaneous explicit measure could predict spontaneous be- 
havior above and beyond that predicted by a spontaneous im- 
plicit measure. To the extent that these effects hold, it 
suggests that both dimensions of attitudes (implicit-explicit, 
spontaneous-deliberate) are important to consider in predict- 
ing behavior. After discussing the major approaches to 
attitude change in the next section, we return to the implicit- 
explicit attitude distinction and discuss some implications of 
this distinction for understanding attitude change. 


ATTITUDE CHANGE: AN OVERVIEW 


Now that we have examined some important conceptual 
issues surrounding the attitude concept, we turn to a discus- 
sion of attitude change processes. In the remainder of this 
chapter we describe the fundamental processes of attitude 
change that have been proposed by social psychologists in 
the modern era. The study of attitude change is one of the old- 
est in social psychology, and so many different theories and 
effects have been uncovered over the past 50 years that it can 
be challenging to understand them all. 

The focus of theories of attitude change to date has been 
on producing and changing explicit attitudes. That is, an atti- 
tude change technique is deemed effective to the extent that it 
modifies a person’s self-report of attitudes. For example, if a 
person is neutral toward an abstract symbol prior to the 
change treatment but is explicitly more favorable afterward, 
attitude change was successful. Although some recent re- 
search has demonstrated that attitude change can be produced 
on implicit attitude measures (Dasgupta & Greenwald, 
2001), these change techniques probably also introduced 
change that could have been measured with explicit measures 
(see also Olson & Fazio, 2001). To date, there are no persua- 
sion techniques that have proven to be effective in changing 
implicit but not explicit attitudes; thus, our review focuses 
on changing explicit attitudes. The topic of implicit attitude 
change will likely occupy considerable research attention 
in the coming decade (e.g., Kawakami, Dovidio, Moll, 
Hermsen, & Russin, 2000). 

To organize the different theories of attitude change, we rely 
on the key ideas from contemporary dual process models of so- 
cial judgment (Chaiken & Trope, 1999). The two such models 
that are most popular for understanding attitude change are the 
elaboration likelihood model (ELM; Petty & Cacioppo, 1986) 
and the heuristic-systematic model (HSM; Chaiken, Liberman, 
& Eagly, 1989). These models provide a metaframework from 
which to understand the moderation and mediation of attitude 
change effects, and they explain how the same variable (e.g., 
source credibility, mood) can have different effects on attitude 


change in different situations (e.g., increasing attitude change 
in one situation but decreasing it in another) and produce the 
same effect by different processes in different situations. Per- 
haps the key idea in the dual process models is that some 
processes of attitude change require relatively high amounts of 
mental effort, whereas other processes of attitude change re- 
quire relatively little mental effort. Thus, Petty and Cacioppo 
(1981) reasoned that most of the major theories of attitude 
change were not necessarily competitive or contradictory, but 
rather operative in different circumstances. Later in this chap- 
ter we use this notion to organize the major processes of per- 
suasion. Although the ELM and HSM stem from somewhat 
different traditions, today the models have many similarities 
and can generally accommodate the same empirical results, al- 
though the explanatory language and sometimes the assumed 
mediating processes vary (Eagly & Chaiken, 1993; Petty & 
Wegener, 1998). 

Contemporary persuasion theorists endorse the funda- 
mental dual process notion that different processes lead to 
attitude change in different circumstances (cf., Kruglanski & 
Thompson, 1999). Some of these processes require diligent 
and effortful information-processing activity, whereas others 
proceed with relatively little mental effort. In this section, we 
first describe the elaboration likelihood model of persuasion 
and review some prominent factors that determine whether 
people exert high or low amounts of mental effort in a per- 
suasion situation (the HSM points to similar factors). Next, 
we describe in more detail the persuasion processes that tend 
to require relatively low amounts of mental effort. Following 
this, we describe the persuasion processes that tend to require 
relatively high amounts of mental effort. 


The Elaboration Likelihood Model of Persuasion 


The elaboration likelihood model of persuasion (ELM; 
Petty & Cacioppo, 1981, 1986; Petty & Wegener, 1999) is a 
theory about the processes responsible for attitude change 
and the strength of the attitudes that result from those 
processes. A key construct in the ELM is the elaboration like- 
lihood continuum. This continuum is defined by how moti- 
vated and able people are to assess the central merits of an 
issue or a position. The more motivated and able people are 
to assess the central merits of an issue or position, the more 
likely they are to effortfully scrutinize all available issue-rel- 
evant information. Thus, when the elaboration likelihood is 
high, people assess issue-relevant information in relation to 
knowledge that they already possess, and they arrive at a rea- 
soned (although not necessarily unbiased) attitude that is well 
articulated and bolstered by supporting information (central 
route). When the elaboration likelihood is low, however, then 
information scrutiny is reduced and attitude change can result 


from a number of less resource-demanding processes that do 
not require as much effortful evaluation of the issue-relevant 
information (peripheral route). Attitudes that are changed by 
low-effort processes are postulated to be weaker than are at- 
titudes that are changed the same amount by high-effort 
processes (see prior discussion of attitude strength). 

The elaboration likelihood continuum incorporates both a 
quantitative and a qualitative distinction (see Petty, 1997; 
Petty, Wheeler, & Bizer, 1999). That is, as one goes higher on 
the elaboration continuum, central route processes increase 
in magnitude (cognitive effort increases), and as one goes 
down the continuum, central route processes diminish in 
magnitude (cognitive effort decreases). This quantitative 
variation suggests that at high levels of elaboration, people’s 
attitudes are determined by their effortful examination of all 
relevant information, but at lower levels of elaboration, atti- 
tudes can be determined by effortful examination of less in- 
formation (e.g., the person critically examines only the first 
argument in a message but not the remaining arguments), or 
less effortful examination of all of the information. In 
addition, however, the ELM incorporates a qualitative 
distinction—that is, the ELM holds that not all change 
processes are the same. For example, consider a person who 
is exposed to a message with 10 arguments. The high elabo- 
ration (central route) processor tends to think carefully about 
much or all of the information. If motivation or ability to 
think were reduced, the recipient might think about each 
argument less carefully or think about fewer arguments 
(quantitative difference). However, the ELM holds that when 
the elaboration likelihood is low, people might also process 
the arguments in a qualitatively different way. For example, 
rather than assessing the substantive merits of the arguments, 
they might simply count them and reason, “there are so many 
arguments, it must be good” (Petty & Cacioppo, 1984). In 
the section of this chapter entitled “Relatively Low-Effort 
Processes of Attitude Change,” we describe a variety of rela- 
tively low-effort processes that can modify attitudes. 

In addition to the elaboration continuum and the various 
processes that operate along it, two other ELM notions are 
worth explaining. The first is that the ELM postulates a trade- 
off between the impact of high- and low-effort processes on 
judgments along the elaboration continuum—that is, as the 
impact of high-effort processes on judgments increases, the 
impact of low-effort processes on judgments decreases. This 
trade-off hypothesis implies a number of things. First is 
that at most points along the continuum, various change 
processes can co-occur and jointly influence judgments. 
Second, however, is that movement in either direction along 
the continuum tends to enhance the relative impact of one 
or the other family of processes (e.g., effortful scrutiny for 
merit vs. reliance on a counting heuristic) on judgments. 
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Another important ELM notion is called the multiple roles 
hypothesis; this is the idea that any given variable can influ- 
ence attitudes by different processes at different points along 
the elaboration continuum. For example, if a pleasant televi- 
sion show makes you feel happy, this happiness might make 
you develop a positive attitude toward the products featured 
in the commercials shown during the show. The mechanism 
by which this happens can vary, however, depending on the 
overall elaboration likelihood. When the elaboration likeli- 
hood is low (e.g., high distraction), happiness could affect 
judgments by serving as a simple associative cue (e.g., if 
I feel good, I must like it). On the other hand, if the elabora- 
tion likelihood is high, happiness could affect judgments by 
biasing the thoughts that come to mind (Petty, Schumann, 
Richman, & Strathman, 1993). If the elaboration likelihood is 
not constrained to be high or low, being happy can affect the 
extent of processing of the message arguments. In particular, 
if the message is counterattitudinal or unpleasant in some 
way, being happy reduces message processing (Bless, 
Bohner, Schwarz, & Strack, 1990). If the message is uplifting 
and pleasant, however, happiness can increase message pro- 
cessing over neutrality (Wegener, Petty, & Smith, 1995). 
Other variables can also play different roles depending on the 
overall elaboration likelihood. 


Determinants and Dimensions of Elaboration 


According to the ELM, in order for high-effort processes to 
influence attitudes, people must be both motivated to think 
(i.e., have the desire to exert a high level of mental effort) and 
have the ability to think (i.e., have the necessary skills and 
opportunity to engage in thought). There are many variables 
capable of affecting the elaboration likelihood and thereby 
influencing whether attitude change is likely to occur by the 
high- or low-effort processes we describe in more detail 
shortly. Some of these motivational and ability variables are 
part of the persuasion situation, whereas others are part of the 
individual. Some variables affect mostly the amount of infor- 
mation processing activity, whereas others tend to influence 
the direction or valence of the thinking. 

One of the most important variables influencing a person’s 
motivation to think is the perceived personal relevance or 
importance of the communication (Johnson & Eagly, 1989; 
Petty & Cacioppo, 1979b, 1990; Petty, Cacioppo, & 
Haugtvedt, 1992; Thomsen, Borgida, & Lavine, 1995). When 
personal relevance is high, people are more influenced by 
the substantive arguments in a message and are less affected by 
peripheral processes (e.g., Petty, Cacioppo, & Goldman, 
1981). There are many ways to render a message self-relevant, 
such as including many first-person pronouns (Burnkrant & 
Unnava, 1989) or matching the message in some way to a 
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person’s self-conceptions (Petty, Wheeler, & Bizer, 2000). In 
addition, people are more motivated to scrutinize information 
when they believe that they are solely responsible for message 
evaluation (Petty, Harkins, & Williams, 1980), when they are 
individually accountable (Tetlock, 1983), when they recently 
have been deprived of control (Pittman, 1994), and when they 
expect to discuss the issue with a partner (Chaiken, 1980). 
Increasing the number of message sources can enhance 
information-processing activity (e.g., Harkins & Petty, 1981; 
Moore & Reardon, 1987), especially when the sources are 
viewed as providing independent assessments of the issue 
(Harkins & Petty, 1987). Various incongruities can increase 
information-processing activity, such as when an expert source 
presents surprisingly weak arguments (Maheshwaran & 
Chaiken, 1991), when the message does not present the 
information in a form that was expected (S. M. Smith & Petty, 
1996), and when people feel ambivalent rather than certain 
about some issue (Maio, Bell, & Esses, 1996). 

In addition to factors associated with the persuasive mes- 
sage or the persuasion context, there are individual differ- 
ences in people’s motivation to think about persuasive 
communications. For example, people who enjoy thinking 
(i.e., those high in need for cognition; Cacioppo & Petty, 
1982) tend to form attitudes on the basis of the quality of the 
arguments in a message rather than on peripheral cues (see 
Cacioppo, Petty, & Morris, 1983). Factors associated with the 
attitude itself can also influence the extent of information pro- 
cessing. For example, people tend to think more about mes- 
sages relevant to their accessible attitudes rather than to their 
relatively inaccessible attitudes (Fabrigar, Priester, Petty, & 
Wegener, 1998). 

Among the important variables influencing a person’s abil- 
ity to process issue-relevant arguments is message repetition. 
Moderate message repetition provides more opportunities for 
argument scrutiny (e.g., Cacioppo & Petty, 1979b; Gorn & 
Goldberg, 1980), which is beneficial for processing as long as 
tedium is not induced (Cacioppo & Petty, 1989; Cox & Cox, 
1988). External distractions (e.g., Petty, Wells, & Brock, 
1976), fast presentations (S. M. Smith & Shaffer, 1991) exter- 
nal pacing of messages (such as those on radio or TV rather 
than in print; Chaiken & Eagly, 1976; Wright, 1981), time 
pressures on processing (e.g., Kruglanski & Freund, 1983), 
enhancing recipients’ physiological arousal via exercise (e.g., 
Sanbonmatsu & Kardes, 1988), placing recipients in an 
uncomfortable posture (Petty, Wells, Heesacker, Brock, & 
Cacioppo, 1983), and rendering the message difficult to 
understand (e.g., Ratneshwar & Chaiken, 1991) all decrease 
substantive message processing and should increase the im- 
pact of peripheral processes. Interestingly, even though a 
number of studies have examined differences in the actual 
ability of recipients to process a persuasion message, little 


work has examined differences in perceived ability to 
process. For example, a message that appears technical or 
overly quantitative (Yalch & Elmore-Yalch, 1984) may re- 
duce processing not because it interferes with actual ability, 
but rather because it interferes with a person’s perceived abil- 
ity to process (e.g., it’s probably too complicated for me, so 
why bother). 

Individual differences also exist in the ability of people to 
think about a persuasive communication. For example, as 
general knowledge about a topic increases, people can be- 
come more able (and perhaps more motivated) to think about 
issue-relevant information (Wood, Rhodes, & Biek, 1995). 
Knowledge is only effective to the extent that it is accessible, 
however (e.g., Brucks, Armstrong, & Goldberg, 1988). When 
knowledge is low or inaccessible, people are more reliant on 
simple cues (e.g., Wood & Kallgren, 1988). 

Of course, in most communication settings, a confluence 
of factors rather than one variable acting in isolation deter- 
mines the nature of information processing. Although the ef- 
fects of single variables on information processing have been 
studied extensively, there is relatively little work examining 
possible interactions among variables (cf. Petty, Cacioppo, & 
Heesacker, 1981). 


Relatively Objective Versus Biased 
Information Processing 


The variables we have discussed, such as distraction or need 
for cognition, tend to influence information-processing activ- 
ity in a relatively objective manner—that is, all else being 
equal, distraction tends to disrupt whatever thoughts a person 
is thinking (Petty et al., 1976). The distraction per se does not 
specifically target one type of thought (e.g., favorable or un- 
favorable) to impede. Similarly, individuals with high need 
for cognition are more motivated to think in general than are 
people low in need for cognition (Cacioppo, Petty, Feinstein, 
& Jarvis, 1996). They are not more motivated to think certain 
kinds of thoughts over others. Some variables, however, are 
selective in their effects on thinking. For example, when peo- 
ple are highly motivated to think, a positive mood tends to 
encourage positive thoughts, discourage negative thoughts, 
or both (Petty et al., 1993), and expert sources tend to 
encourage favorable rather than unfavorable interpretations 
of message arguments (Chaiken & Maheswaran, 1994). 

The ELM accommodates both relatively objective and rel- 
atively biased information processing by pointing to the mo- 
tivational and ability factors involved. The ELM assumes 
that motivation is relatively objective when no a priori judg- 
ment is preferred and a person’s implicit or explicit goal is to 
seek the truth, wherever it might lead (Petty & Cacioppo, 
1986). In contrast, a motivated bias can occur whenever 


people implicitly or explicitly prefer one judgment over an- 
other (see also Kruglanski, 1990). A wide variety of motiva- 
tions can determine which particular judgment is preferred in 
any given situation. For example, if the reactance motive 
(Brehm, 1966) is aroused, people prefer to hold whatever 
judgment is forbidden. If balance motives (Heider, 1958) are 
operating, people prefer to adopt the position of a liked 
source but distance themselves from a disliked source. If 
impression management motives (Tedeschi, Schlenker, & 
Bonoma, 1971) are operating, people prefer to hold whatever 
position they think would be ingratiating. Importantly, many 
of these biasing motives could have an impact on judgments 
by either the central or the peripheral route. For example, in- 
vocation of reactance could lead to simple rejection of the 
forbidden position without much thought or through active 
counterarguing of the position. 

The ELM holds that biased processing can occur even if no 
specific judgment is preferred (i.e., if based on motivational 
factors alone, processing would be relatively objective)—this 
is because ability factors can also introduce bias. For exam- 
ple, some people might simply possess a biased store of 
knowledge compared to other people. If so, their ability to 
process the message objectively can be compromised. That is, 
recipients with a biased store of knowledge might be better 
able to see the flaws in opposition arguments and the merits in 
their own side compared to recipients with a more balanced 
store of knowledge (cf. Lord, Ross, & Lepper, 1979). In addi- 
tion, variables in the persuasion situation can bias retrieval of 
information even if what is stored is completely balanced and 
no motivational biases are operating. For example, a positive 
mood can increase access to positive material in memory 
(e.g., Bower, 1981). In general, biases in processing a persua- 
sive message are fostered when the message contains infor- 
mation that is ambiguous or mixed rather than clearly strong 
or weak (Chaiken & Maheswaran, 1994). 

Finally, just because some motivational or ability factor 
results in biased information processing, this does not mean 
that a biased judgment will result because people sometimes 
attempt to correct for factors they believe might have unduly 
biased their evaluations (e.g., Martin, Seta, & Crelia, 1990; 
Petty & Wegener, 1993; Wilson & Brekke, 1994). The avail- 
able research suggests that corrections can proceed in differ- 
ent directions depending on recipients’ theories of how the 
biasing event or stimulus (e.g., an attractive source) was 
likely to have influenced their views. According to the flexi- 
ble correction model (Petty & Wegener, 1993; Wegener & 
Petty, 1997), in order for corrections to occur, people should 
(a) be motivated and able to identify potentially biasing fac- 
tors, (b) possess or generate a naive theory about the magni- 
tude and direction of the bias, and (c) be motivated and able 
to make the theory-based correction. 
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Assessing Information Processing 


Persuasion researchers have identified a number of ways to 
assess the extent to which persuasion is based on effortful 
consideration of information. Perhaps the most popular 
procedure to assess the amount of objective information pro- 
cessing that takes place has been to vary the quality of the 
arguments contained in a message and examine the size of the 
argument quality effect on attitudes and valenced thoughts 
(e.g., Petty et al., 1976). Greater argument quality effects sug- 
gest greater objective scrutiny. Because strong arguments 
elicit more favorable thoughts and become more persuasive 
with thought, but weak arguments elicit more unfavorable 
thoughts and become less persuasive with thought, thinking 
enhances the argument quality effect on attitudes and va- 
lenced thoughts. If the message processing is biased, how- 
ever, the size of the argument quality effect on these variables 
can be attenuated over what it is with objective processing 
(Nienhuis, Manstead, & Spears, 2001; Petty & Cacioppo, 
1986); this is because when engaged in biased processing, 
people may fail to appreciate the merits or demerits of the ar- 
guments (e.g., seeing strengths in even weak arguments and 
finding some flaws in strong ones). 

When biased processing is an issue, there are other means 
to gauge the extent of thinking. In particular, one can assess 
the mere number of issue-relevant thoughts generated (Petty, 
Ostrom, & Brock, 1981). High elaboration conditions are 
associated with more thoughts (e.g., Burnkrant & Howard, 
1984). Also, correlations between message-relevant thoughts 
and postmessage attitudes tend to be greater when argument 
scrutiny is high (e.g., Chaiken, 1980; Petty & Cacioppo, 
1979b), although other variables, such as the confidence peo- 
ple have in their thoughts, can affect this correlation (Petty, 
Brifiol, & Tormala, 2002). Finally, high message elaboration 
can produce reading or exposure times longer than more cur- 
sory analyses (Mackie & Worth, 1989), although longer read- 
ing times might also reflect daydreaming rather than careful 
message scrutiny (see Wegener, Downing, Krosnick, & Petty, 
1995, for a discussion of these measures). 


RELATIVELY LOW-EFFORT PROCESSES 
OF ATTITUDE CHANGE 


We have now seen that a multitude of variables can determine 
whether the attitude change context is likely to be one of 
relatively high or low cognitive effort. First we focus on the spe- 
cific low-effort processes that can determine whether attitudes 
will change, and then we turn to high-effort processes. 

The low-effort mechanisms of attitude change vary in the 
extent to which they require conscious processing, ranging 
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from those relying on automatic associations to those posit- 
ing simple inferences. Thus, some peripheral processes re- 
quire somewhat more cognitive effort than do others (Petty & 
Cacioppo, 1986). Nevertheless, these processes have in com- 
mon the fact that none of them requires extensive and effort- 
ful scrutiny of the central merits of the attitudinal advocacy 
or position. 


Associative Processes 


Some low-effort attitude change processes are associative in 
nature—that is, attitudes are often impacted by associations 
that develop between attitude objects and positive or negative 
stimuli (i.e., objects and feelings), or even by observations of 
those associations. Examples of these processes include clas- 
sical conditioning, affective priming, mere exposure, and 
balance. 


Classical Conditioning 


One way to produce attitude change in the absence of effortful 
scrutiny is to associate an attitude object that is initially neutral 
(e.g., anew product) with stimuli that already have positive or 
negative meaning. Considerable research has demonstrated 
that when an initially neutral stimulus immediately precedes 
another stimulus that already has positive or negative associa- 
tions, the neutral stimulus can come to be positively or nega- 
tively evaluated itself. For example, attitudes toward words 
(e.g., Staats & Staats, 1958), people (e.g., Griffitt, 1970), and 
products (e.g., Gresham & Shimp, 1985) have been influenced 
by their association with pleasant or unpleasant odors, tem- 
peratures, sounds, shock, photographs, and so on (e.g., 
Gouaux, 1971; Staats, Staats, & Crawford, 1962; Zanna, 
Kiesler, & Pilkonis, 1970). Furthermore, attitudes have been 
shown to be influenced by the contraction of certain muscles 
associated with positive and negative experiences (e.g., 
Cacioppo, Priester, & Bernston, 1993; Priester, Cacioppo, & 
Petty, 1996; Strack, Martin, & Stepper, 1988). Consistent with 
the classification of classical conditioning as a low-effort 
process, conditioning effects have been found to be particu- 
larly likely when effortful processing is at a minimum (Field, 
2000). Specifically, these effects are enhanced when the stim- 
uli are presented subliminally (DeHouwer, Baeyens, & Eelen, 
1994) and when the stimuli have no a priori meaning attached 
to them (Cacioppo, Marshall-Goodell, Tassinary, & Petty, 
1992; Priester et al., 1996; Shimp, Stuart, & Engle, 1991). 


Affective Priming 


Another process that relies on associations between stimuli is 
affective priming. In this method, also known as backward 


conditioning, presentation of positively or negatively va- 
lenced stimuli immediately precedes rather than follows pre- 
sentation of target stimuli. These presentations have been 
found to influence evaluations of the target stimuli. For 
example, Krosnick, Betz, Jussim, and Lynn (1992) found that 
subliminal presentation of positive or negative pictures 
(e.g., smiling people vs. snakes) made subsequent evalua- 
tions of target individuals more favorable or less favorable, 
respectively. Consistent with classification of this change 
mechanism as a low effort process, these effects have been 
found to be unaffected by cognitive load (e.g., Hermans, 
Crombez, & Eelen, 2000) and more likely to occur when 
the initial affective stimuli can be processed only mini- 
mally (DeHouwer, Hermans, & Eelen, 1998) or not at all 
(e.g., when they have been presented subliminally; Murphy, 
Monahan, & Zajonc, 1995; Murphy & Zajonc, 1993). 


Mere Exposure 


Research has also shown that the mere repeated exposure of 
an object can make one’s attitude toward that object more 
favorable even if one does not recognize the object as having 
been encountered previously (Zajonc, 1968). Kunst-Wilson 
and Zajonc (1980), for instance, repeatedly presented partic- 
ipants with a series of polygon images and found that even 
though participants could not recognize which images they 
had seen before and which they had not, they expressed 
significantly greater preferences for those they had seen. 
Additionally, mere exposure effects also occur in patients 
suffering from Alzheimer’s disease (Winograd, Goldstein, 
Monarch, Peluso, & Goldman, 1999). Some researchers have 
argued that even when a stimulus cannot be consciously iden- 
tified as having been encountered, its previous exposure 
might make it easier to process. This could create a kind of 
perceptual fluency (Bornstein, 1989; Jacoby, Kelley, Brown, 
& Jasechko, 1989) that becomes attached to the stimulus or 
confused with a positive evaluation of the stimulus. This 
process only occurs, however, to the extent that the feeling of 
familiarity is not directly attributed to the repeated exposure. 
If people attribute the experience of familiarity to the re- 
peated exposure of a stimulus, the mere exposure effect is 
attenuated (Bornstein & D’ Agostino, 1994). Moreover, as 
with other low-effort processes, the influence of mere expo- 
sure on attitudes appears to be increased when the repeated 
object is low in meaning (see Bornstein, 1989, for a review) 
or presented subliminally (Bornstein & D’ Agostino, 1992), 
thus reducing or eliminating conscious processing. Similarly, 
the effect appears to be decreased as conscious processing in- 
creases, such as when evaluation apprehension is induced 
(Kruglanski, Freund, & Bar-Tal, 1996). When meaningful 


stimuli are presented (e.g., familiar words or persuasive mes- 
sages), repeated exposure has been found to accentuate the 
dominant reaction (e.g., Cacioppo & Petty, 1989; Brickman, 
Redfield, Harrison, & Crandall, 1972) regardless of whether 
the reaction is positive or negative. With meaningful stimuli, 
deliberative analyses can enhance the dominant response, at 
least until tedium sets in. 


Balance 


According to balance theory (Heider, 1958), certain cognitive 
states are associated with pleasantness, whereas other states 
are associated with unpleasantness. More specifically, bal- 
ance (harmony) within the elements of an attitudinal system 
exists when people agree with others they like (or with whom 
they are closely associated) and disagree with others they dis- 
like (or with whom they are dissociated). Because imbalance 
is an uncomfortable state (Heider, 1958), people should seek 
to eliminate it as quickly and easily as possible. In many 
cases, the easiest way to restore balance is to alter one’s eval- 
uation of one of the elements in the attitude system 
(Rosenberg & Abelson, 1960; see also Visser, 1994). Unlike 
the effortful restoration of cognitive consistency associated 
with dissonance reduction (Festinger, 1957; see subsequent 
discussion), the alteration of evaluations need not be effortful 
according to balance theory. In addition to the general prefer- 
ence for balanced relationships among people, objects, and 
attitudes, research has also shown that people prefer positiv- 
ity in these relationships (Miller & Norman, 1976). Impor- 
tantly, the changes people make to ensure balance and 
positivity do not require thoughtful consideration of the cen- 
tral merits of the attitude objects in the system (see Insko, 
1984; Newcomb, 1968, for further discussion). 


Inference-Based Processes 


Low-effort attitude change processes can also be more infer- 
ential in nature rather than a result of the operation of affec- 
tive or associative processes. In other words, people 
sometimes base attitudes on simple inferences that do not re- 
quire considerable cognitive processing. The use of balance 
principles can be considered inferential if people reason that 
they will feel better if they adopt the attitude of a liked other. 
Two other inferential rules are to infer one’s attitude from 
one’s own behavior and to rely on simple heuristics, or deci- 
sion rules, that circumvent effortful scrutiny of information. 


Attribution 


At a general level, attribution theory addresses the inferences 
people make about themselves and others after witnessing 
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behaviors and the situational constraints surrounding those 
behaviors (e.g., Bem, 1965; Jones & Davis, 1965). In some 
cases, these inferences involve attitudes, such as when indi- 
viduals infer their own or someone else’s attitudes on the 
basis of their behavior with respect to some attitude object 
(e.g., if a person donates money to a candidate, it is reason- 
able to infer that that person favors the candidate). Although 
some attributional processes require effortful cognitive activ- 
ity (see Gilbert, 1998, for a review), others result in relatively 
quick and simple inferences (e.g., inferring that you like a 
certain TV program because you smile when you watch it). 

According to Bem’s (1965, 1972) self-perception theory, 
when people are not attuned to their internal states, they can 
infer their own attitudes from their behaviors just as they might 
do when inferring the attitudes of others. Self-perception may 
be more likely to operate under relatively low-effort condi- 
tions. For example, Taylor (1975) conducted a study in which 
women evaluated the photographs of men under high or low 
personal relevance conditions. Participants also received false 
physiological feedback about their responses toward some of 
the men (see Valins, 1966). Taylor found that the women in- 
ferred attitudes from their ostensible physiological reactions to 
a greater extent when personal relevance was low than when it 
was high (see also Chaiken & Baldwin, 1981; Wood, 1982). 
This implies that self-perception processes are more likely to 
operate when the likelihood of thinking about the attitude ob- 
ject is relatively low rather than high. 

Attribution theory has also contributed to attitude change 
research in other ways. In one application called the overjus- 
tification effect, people come to devalue previously enjoyed 
activities (e.g., running) when they are given overly sufficient 
rewards for engaging in them (e.g., Lepper, Greene, & 
Nisbett, 1973). If someone is given an extrinsic reward for 
promoting a proattitudinal advocacy, for instance, their 
attitude may become less favorable to the extent that they 
view their behavior as stemming from the reward rather than 
from the merits of the position they are endorsing (e.g., Scott 
& Yalch, 1978). Furthermore, attribution theory has shed 
light on the processes by which inferences about a message 
source impact attitudes. For example, Eagly, Chaiken, and 
Wood (1981) argued that when people are exposed to a per- 
suasive communication, their expectancies regarding the 
source of the communication have an important impact on 
their acceptance of that source’s position. If the communica- 
tor advocates a position that violates his or her own self-in- 
terest, he or she is perceived as more trustworthy and the 
position as more valid. If the communicator takes a position 
consistent with self-interest, however, he or she is perceived 
as less trustworthy and the position as less valid. When the 
position is viewed as valid, it can be accepted with relatively 
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little scrutiny. However, when the position is seen as possibly 
invalid, effortful scrutiny of the information is increased 
(Priester & Petty, 1995). We find it interesting that when a 
source takes a position that violates his or her group’s interest 
(rather than individual self-interest) the source is not seen as 
more trustworthy—perhaps because violating group interest 
is seen as disloyal. As a result, when an individual advocates 
a position that violates group interest, people are surprised 
and this leads to enhanced message scrutiny (Petty, Fleming, 
Priester, & Feinstein, 2001). 


Heuristics 


The heuristic-systematic model of persuasion (HSM; 
Chaiken et al., 1989) suggests that when people are engaged 
in relatively little information-processing activity, they typi- 
cally evaluate persuasive information in terms of stored 
heuristics, or simple decision rules based on prior experiences 
or observations. One such heuristic might be that length im- 
plies strength. In several studies (e.g., Chaiken, 1987; Petty & 
Cacioppo, 1984) it has been found that people are more per- 
suaded by messages containing large numbers of examples or 
arguments, but only when recipients of such messages are rel- 
atively unmotivated to engage in extensive thought (e.g., low 
need for cognition or low personal relevance). Similarly, 
some people might have stored heuristics pertaining to source 
credibility, such as experts are usually correct, and use of 
these heuristics is especially potent when personal relevance 
is relatively low (e.g., Petty, Cacioppo, & Goldman, 1981), 
distraction is high (e.g., Kiesler & Mathog, 1968), or elabora- 
tion likelihood is low for some other reason (see Andreoli & 
Worchel, 1978; Wood & Kallgren, 1988). 

As noted earlier, according to dual process models, source 
expertise, like other variables, can serve in other roles when 
the elaboration likelihood is moderate or high—such as af- 
fecting the extent of processing or biasing argument process- 
ing. If source expertise takes on other roles, its impact under 
high elaboration conditions can be equivalent to or even ex- 
ceed its impact under low elaboration conditions (Chaiken & 
Maheswaran, 1994; Kruglanski & Thompson, 1999; Petty, 
1994). 

A variety of additional variables have been shown to oper- 
ate as cues when the elaboration likelihood is low—such as 
source attractiveness (e.g., Chaiken, 1980) and speed of 
speech (e.g., S. M. Smith & Shaffer, 1995). These variables 
also can serve in other roles as the likelihood of elaboration is 
increased (see Petty & Wegener, 1998, for a review). Chaiken 
et al. (1989) propose that the use of heuristics depends on 
their availability (i.e., the heuristic must be stored in mem- 
ory), accessibility (i.e., it must be activated from memory), 


and applicability to the judgment at hand (see Chaiken, 
Wood, & Eagly, 1996). Although this proposition is intrigu- 
ing, little research has been conducted examining these as- 
pects of heuristics (but see Chaiken & Eagly, 1983). Thus, the 
operation of some variables that have been attributed to 
heuristics under low elaboration conditions (e.g., a person’s 
mood state; Schwarz, 1990; Schwarz & Clore, 1983) might 
instead have impact on attitudes through some other periph- 
eral process (e.g., classical conditioning). Nevertheless, the 
heuristic concept has been very useful and has sparked a 
great deal of persuasion research. 


RELATIVELY HIGH-EFFORT PROCESSES 
OF ATTITUDE CHANGE 


In addition to the low-effort attitude change mechanisms de- 
scribed previously, attitudes can also be formed and changed 
through relatively high-effort processes. According to dual- 
process formulations, these high-effort processes tend to in- 
fluence persuasive outcomes when motivation and ability to 
think are relatively high, such as when the issue is of high 
personal relevance, when people are accountable for their 
judgments, when they have high knowledge on the topic, 
when few distractions are present, and so forth. 


Message Learning and Reception 


Early information-processing theories of attitude change held 
that persuasion was contingent upon a sequence of stages, in- 
cluding attention, comprehension, learning, acceptance, and 
retention of the information in a persuasive communication 
(Hovland, Janis, & Kelley, 1953). Thus, a given persuasive 
appeal would be successful to the extent that the message and 
its conclusion were processed, understood, accepted, and 
later recalled. McGuire (1968) later modified this model 
and focused on two core processes—reception and yielding. 
According to McGuire, variables could influence persuasive 
outcomes by affecting either of these processes, and variables 
might affect each process in different ways. For example, in- 
creasing intelligence might increase the likelihood of recep- 
tion but decrease the likelihood of yielding. Although some 
research has examined the role of literal comprehension or 
reception of a message in attitude change (Eagly, 1974), a 
majority of the research in this domain has addressed the 
reception-yielding hypothesis by assessing the relationship 
between attitude change and message recall. Despite the intu- 
itive appeal of the model, considerable research has demon- 
strated that attitudes and message recall are often weakly 
related at best (e.g., Anderson & Hubert, 1963; Watts & 
McGuire, 1964; see Eagly & Chaiken, 1993, for a review). 


A number of factors have been proposed to account for the 
relatively low correlation between attitude change and infor- 
mation recall. One argument, for example, has been that 
simple learning theories do not account for the fact that dif- 
ferent people form different evaluations of information con- 
tained in persuasive messages—that is, although one person 
may be convinced by an argument, someone else might find it 
to be ludicrous (see Petty, Ostrom, et al., 1981). Yet both 
might be able to recall the argument. Attitude change has been 
found to correspond more closely with information recall 
when individuals’ unique assessments of the information 
recalled is accounted for (Chattopadhyay & Alba, 1988). 
Furthermore, attitudes have been found to correlate more 
strongly with learning and recall when people are not evaluat- 
ing information on-line at the time of exposure. For example, 
when processing is made difficult (e.g., Bargh & Thein, 1985; 
Bodenhausen & Lichtenstein, 1987), when people are given 
nonevaluative processing goals (e.g., Chartrand & Bargh, 
1996; Hastie & Park, 1986; Lichtenstein & Srull, 1987; 
Mackie & Asuncion, 1990), or when they are the type of 
people who do not spontaneously engage in evaluation (low 
in their need to evaluate; Tormala & Petty, 2001), the attitude- 
recall correlation is higher. Under these conditions, when peo- 
ple are asked to report their attitudes, they are forced to first 
retrieve what they can from memory, and then base their atti- 
tudes on the evaluative implications of this information. 


Cognitive Response Approach 


Following a series of early findings that attitude change and 
information recall were not consistently related, researchers 
developed the cognitive response approach to persuasion 
(e.g., Brock, 1967; Greenwald, 1968; Petty, Ostrom, et al., 
1981). According to this approach, attitudes and message 
argument recall are not always related because persuasion 
typically depends largely on an individual’s idiosyncratic 
thoughts in response to a persuasive message (i.e., thoughts 
about message arguments or other factors such as the tone, 
source, or context of the message). According to the cognitive 
response view, when exposed to a persuasive message, peo- 
ple reflect on the message with respect to their preexisting 
knowledge and prior attitude (if they have one), considering 
information not contained in the message itself. Three aspects 
of people’s cognitive responses have proven important. 


Extent of Thought 


First, investigators have explored determinants of the extent 
of issue-relevant thinking. As noted earlier in our discussion 
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of the elaboration likelihood model, a number of variables 
have been found to affect how much people are motivated 
(e.g., personal relevance) or able (e.g., distraction) to think 
about a persuasive communication. 


Content of Thought 


Another aspect of thinking that has garnered considerable re- 
search attention is the content of thought. Perhaps the most 
important dimension in this regard is the overall valence of 
the thinking that occurs. Researchers typically categorize 
thoughts as favorable, unfavorable, or neutral, and then com- 
pute an overall valence index (e.g., positive thoughts minus 
negative thoughts; see Mackie, 1987). According to the cog- 
nitive response approach, persuasion can be increased to the 
extent that the message elicits mostly favorable thoughts 
(e.g., If we raise taxes, the roads will improve and reduce my 
commute time) and few unfavorable thoughts (e.g., If we 
raise taxes, I'll have less money to go out to dinner). On the 
other hand, people can resist messages to the extent that they 
generate mostly unfavorable thoughts and few favorable 
thoughts. 

As noted earlier, people can be motivated to generate par- 
ticular thoughts by external variables such as their mood 
(Petty et al., 1993) or the message source (Chaiken & 
Maheswaran, 1994). In a series of studies on resistance to 
change, McGuire (1964) demonstrated that counterarguing 
of persuasive messages could be increased by giving people 
weak attacking messages prior to a stronger attack. The un- 
derlying logic of this inoculation approach to resistance is 
that a small dose of an attacking virus De, a weak challenge 
to the person’s attitude that is refuted) motivates the person to 
build up antibodies (i.e., counterarguments) that can be used 
against subsequent attacks. Even if an exact defense is not 
anticipated, people are presumably motivated by the inocula- 
tion treatment (1.e., the initial attack and refutation) to defend 
their attitudes by counterarguing opposition messages in the 
future. Recent research has shown that having people experi- 
ence a successful defense of their attitudes can produce 
greater confidence in the initial attitude. This enhanced confi- 
dence renders the attitude not only more resistant to change 
in the future, but also more predictive of future behavior 
(Tormala & Petty, in press). 


Confidence in Thoughts 


In addition to extent and content of thinking, recent research 
has uncovered a third aspect of thought that influences 
persuasion—the confidence people have in their own cognitive 
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responses. According to the self-validation hypothesis (Petty, 
Briñol, & Tormala, in press), people vary in the extent to which 
they have confidence in or doubt the validity of the thoughts 
that they generate to a persuasive message. Although thoughts 
in which people have confidence have a large impact on atti- 
tude change, thoughts in which people have low confidence do 
not. Thus, this research suggests that favorable thoughts in- 
crease persuasion primarily when people have confidence in 
them. Similarly, unfavorable thoughts decrease persuasion 
mostly when people have confidence in them. When confi- 
dence in thoughts is low, thoughts do not predict attitudes very 
well even under high elaboration conditions. 

Several variables have been shown to affect the confi- 
dence people have in their thoughts and thereby to influence 
the extent of attitude change. For example, in one study 
(Brifiol & Petty, 2001), people who were nodding their head 
in a yes (vertical) fashion while listening to a message re- 
ported more confidence in their thoughts than did people who 
were nodding their heads in a no (horizontal) fashion. As a re- 
sult, when processing a compelling message that elicited 
mostly favorable thoughts, people nodding yes were more 
persuaded than were people nodding no (see also Wells & 
Petty, 1980). However, when processing a specious message 
that elicited mostly unfavorable thoughts, people nodding 
yes were less persuaded than were people nodding no. In ad- 
dition to head nodding, the ease of generating thoughts 
affects the confidence people have in them (Tormala, Petty, & 
Brifiol, in press). When people are asked to generate a small 
and easy number of cognitive responses (e.g., counterargu- 
ments or favorable thoughts), they have more confidence in 
the responses and rely on them to a greater extent in deter- 
mining their attitudes than when they are asked to generate a 
higher and more difficult number of thoughts. 


Self-Persuasion with No Message 


The importance of one’s own thoughts in producing persua- 
sion outcomes is highlighted in research showing that self- 
persuasion can occur even in the absence of an external 
message. For example, research has demonstrated that per- 
suasion in the absence of a message can occur when individ- 
uals are asked to actively present or generate their own 
messages or even when individuals are simply permitted to 
engage in thought about an attitude object. 


Role Playing 


Early research on role playing in persuasion found it to be an 
effective tool to increase persuasion as well as the resistance 


and persistence of the resulting attitudes. In one of the earliest 
role-playing demonstrations, Janis and King (1954) examined 
the differential effects of having people actively present per- 
suasive arguments versus passively hear arguments presented 
by others. Results indicated that participants who actively 
generated and presented messages were typically more per- 
suaded than were those who passively listened to messages. 
This effect has been replicated numerous times (e.g., Elms, 
1966; Greenwald & Albert, 1968; Janis & Mann, 1965). 

A number of mechanisms have been proposed to account 
for these role-playing effects. Janis (1968) proposed a biased 
scanning explanation whereby individuals, in the process of 
supporting an attitudinal position, recruit consistent beliefs 
and inhibit inconsistent beliefs (see also, Kunda, 1990). This 
interpretation is based in part on the finding that improvisa- 
tion is an important element in eliciting role-playing effects. 
King and Janis (1956) showed that a process of active argu- 
ment generation was necessary to elicit role-playing persua- 
sion effects. Simply reading a set of persuasive arguments to 
others did not elicit as much persuasion as did extemporane- 
ously elaborating on the message. Presumably, actively gen- 
erating arguments in favor of a given position leads to the 
active retrieval of supportive information that is uniquely 
persuasive to the individual and to the inhibition of nonsup- 
porting information (Greenwald & Albert, 1968; Janis & 
King, 1954). The information that people self-generate might 
seem particularly compelling to the generator because of the 
enhanced effort involved in generation over passive exposure 
(Festinger, 1957)—or the arguments might seem more com- 
pelling simply because they are associated with the self (i.e., 
an ownness bias; Perloff & Brock, 1980). People might also 
have more confidence in the thoughts that they generate, 
leading them to be more influential than are arguments re- 
ceived by others (Petty, et al., 2002). 


Mere Thought 


Some research has indicated that attitude polarization can 
sometimes occur when individuals simply engage in exten- 
sive thought about an attitude object (see Tesser, Martin, & 
Mendolia, 1995, for a review). Attitude polarization follow- 
ing thought requires a well-integrated and consistent attitude 
schema (e.g., Chaiken & Yates, 1985; Tesser & Leone, 1977); 
otherwise, thought leads to attitude moderation. This rela- 
tionship appears to be bidirectional—that is, just as having a 
consistent schema fosters attitude polarization with thought, 
simply thinking about an issue also tends to increase schema 
consistency via the generation of schema-consistent cogni- 
tions and the reinterpretation of inconsistent cognitions (e.g., 


Millar & Tesser, 1986; Sadler & Tesser, 1973; Tesser & 
Cowan, 1975). 

The attitudinal consequences of mere thought are depen- 
dent upon the salient subset of information that is the focus of 
the thought (Tesser, 1978). Attitude change can sometimes 
occur following thought because individuals focus on selec- 
tive subsets of information (e.g., Levine, Halberstadt, & 
Goldstone, 1996; Wilson, Dunn, Kraft, & Lisle, 1989). For 
example, when participants are instructed to analyze the rea- 
sons for their attitudes, they often focus on those that are eas- 
iest to verbalize (Wilson et al., 1989). Consequently, they 
may often overemphasize the cognitive component of their 
attitudes to the neglect of the affective component, leading 
to a momentary attitude shift. Selective focus on a subset of 
attitude-relevant information increases the impact of that lim- 
ited subset of information on attitude judgments and can con- 
sequently lead to suboptimal decision making (e.g., Wilson 
et al., 1993; Wilson & Schooler, 1991). 


Self-Persuasion as a Result of Dissonance Processes 


We have seen that self-persuasion can occur when people are 
prompted to think by receiving a persuasive message, by 
doing a role-playing exercise, or by simply being asked to 
think. Attitude change can also occur when a person’s own 
behavior motivates him or her to think. A common assump- 
tion of many persuasion theories is that individuals have a de- 
fault motivation of accuracy—that is, people want to hold 
correct attitudes. However, the elaboration likelihood model 
and other persuasion theories acknowledge that a variety of 
biasing motivations can sometimes distort objective informa- 
tion processing. Although a number of these motivations 
exist, the motive to be consistent is the most studied, and the 
theory of cognitive dissonance is the most influential of the 
consistency theories. In its original formulation (Festinger, 
1957), dissonance was described as a feeling of aversive 
arousal akin to a drive state experienced by an individual 
when he or she simultaneously held two conflicting cogni- 
tions. The resulting aversive arousal was hypothesized to in- 
stigate attempts to restore consonance among the relevant 
cognitions. Attempts to restore consistency typically in- 
volved very active thinking about the attitude object, and the 
end result of this thinking was often a change in the person’s 
attitude. 


Dissonance Effects 


A large body of research using different experimental para- 
digms has supported the essence of dissonance theory (see 
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Brehm & Cohen, 1962; Cooper & Fazio, 1984; Harmon-Jones 
& Mills, 1999; for reviews). Some experimental procedures 
used to induce dissonance include coaxing people to write 
counterattitudinal essays under the illusion of free choice 
(e.g., Losch & Cacioppo, 1990), undergoing harsh initiations 
to join an uninteresting group (e.g., Aronson & Mills, 1959), 
selecting between two different but equally desirable products 
(e.g., Brehm, 1956), and eating grasshoppers after a request 
from a dislikable person (Zimbardo, Weisenberg, Firestone, & 
Levy, 1965). In these instances, people become more favor- 
able toward the initially counterattitudinal position, the 
uninteresting group, the chosen product, and the initially dis- 
tasteful grasshoppers. 

Early work in dissonance theory suggested that individu- 
als must directly resolve the cognitive inconsistency by 
changing their attitudes—generating cognitions to make the 
dissonant elements more consistent (i.e., bolstering)—or by 
minimizing the importance of the dissonant cognitions Oe, 
trivializing; see Simon, Greenberg, & Brehm, 1995). How- 
ever, some research has suggested that dissonance can be 
reduced (at least temporarily) by engaging in virtually any ac- 
tivity that distracts one from the dissonance. For example, in- 
dividuals appear to successfully reduce their dissonance by 
affirming even unrelated aspects of their self-concepts 
(Steele, 1988; Tesser & Cornell, 1991), by consuming alcohol 
(Steele, Southwick, & Critchlow, 1981), or by watching a 
comedy film (Cooper, Fazio, & Rhodewalt, 1978). By con- 
trast, individuals avoid receiving even positive information 
about themselves if it is highly related to the dissonance- 
arousing event, and when such exposure is forced, the amount 
of experienced dissonance increases (Blanton, Cooper, 
Skurnik, & Aronson, 1997). 

A number of research studies have supported the hypothe- 
sis that physiological arousal follows from situations thought 
to induce cognitive dissonance (e.g., Elkin & Leippe, 1986; 
Losch & Cacioppo, 1990), and such arousal has been shown 
to be subjectively unpleasant (Elliot & Devine, 1994). When 
the arousal can be plausibly misattributed to some unrelated 
environmental agent (rather than to the true dissonance- 
arousing event), dissonance-based attitude change fails to 
occur (e.g., Fazio et al., 1977; Zanna & Cooper, 1974). How- 
ever, evidence for the mediational role of arousal in eliciting 
dissonance-based attitude change is equivocal. Some work, 
for example, suggests that the experience of dissonance has 
less to do with arousal per se and more to do with feeling un- 
pleasant (e.g., Higgins, Rhodewalt, & Zanna, 1979; Losch & 
Cacioppo, 1990). Additionally, in contrast to the predictions 
of dissonance theory, attitude change following a dissonance 
induction can sometimes fail to reduce dissonance-based 
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arousal (Elkin & Leippe, 1986; Harmon-Jones, Brehm, 
Greenberg, Simon, & Nelson, 1996). 


Limiting Conditions 


Early research supported the hypothesis that dissonance was 
experienced when a person had insufficient justification for 
violating a belief or attitude (Festinger, 1957; Festinger & 
Carlsmith, 1959). Since the original formulation of the disso- 
nance construct, however, many researchers have imposed 
limiting conditions on the basic dissonance predictions. For 
example, some researchers asserted that commitment to the 
behavior was necessary to elicit dissonance (e.g., Brehm & 
Cohen, 1962). Additionally, some research indicates that cog- 
nitive inconsistency per se is neither necessary nor sufficient 
to generate dissonance. In an influential new look at disso- 
nance research, Cooper and Fazio (1984) concluded that for 
dissonance to be aroused, an individual must be responsible 
for engaging in an action that has negative or undesired con- 
sequences. If an individual engages in a counterattitudinal ac- 
tion that has no apparent effect (e.g., Collins & Hoyt, 1972; 
Cooper & Worchel, 1970) or a positive effect (Scher & 
Cooper, 1989), dissonance effects do not occur. Similarly, 
even a proattitudinal behavior can arouse dissonance if it has 
unintended, aversive consequences (Scher & Cooper, 1989). 
Moreover, if the individual does not feel responsibility for the 
discrepant action because the consequences were unforesee- 
able (e.g., Cooper, 1971; Hoyt, Henley, & Collins, 1972), dis- 
sonance likewise fails to obtain. 


Alternative Views 


Two additional alternatives implicate the self as the essential 
component in eliciting dissonance. Steele’s self-affirmation 
theory suggests that dissonance results from any threat to 
viewing oneself as “adaptively and morally adequate” 
(Steele, 1988, p. 262). Alternately, Aronson (1969) has ar- 
gued that dissonance is based on inconsistency between one’s 
self-view and one’s actions (e.g., I am a good person and did 
a bad deed). These two alternatives differ in their predictions 
of whether individuals prefer self-verification or self- 
enhancement. Steele’s self-affirmation theory predicts that 
people prefer positive feedback even if it is inconsistent with 
their self-view, whereas Aronson’s self-inconsistency view 
postulates that people will prefer self-consistent feedback 
even if it is negative. The views also differ in whether people 
high or low in self-esteem should be more susceptible to dis- 
sonance effects. The self-inconsistency view holds that indi- 
viduals high in self-esteem should show greater dissonance 
effects because it is more inconsistent for a person with high 


self-esteem to engage in bad deeds. The self-affirmation view 
holds that high self-esteem individuals should show reduced 
dissonance effects because they have more self-affirmational 
resources to use to protect against dissonance. Unfortunately, 
the research evidence on this question is mixed, with some 
studies showing greater dissonance effects for individuals 
with low self-esteem (Steele, Spencer, & Lynch, 1993) and 
other studies showing greater dissonance effects for persons 
with high self-esteem (Gerard, Blevans, & Malcolm, 1964). 

A final alternative is the self-standards model of disso- 
nance (Stone & Cooper, 2001). This model attempts to put the 
new look, self-consistency, and self-affirmation theories 
under a single conceptual umbrella by suggesting that disso- 
nance results from the violation of salient normative or idio- 
graphic self-standards. According to this model, when 
dissonant-relevant self-attributes are salient, higher disso- 
nance should result in persons with high than low self-esteem. 
This is because high self-esteem individuals have higher per- 
sonal standards and the dissonant behavior is more likely to 
be inconsistent with these standards. When irrelevant self-at- 
tributes are salient, however, the opposite pattern is predicted 
to occur; this is because the positive irrelevant self-attributes 
should provide high self-esteem individuals with greater self- 
affirmational resources to draw upon and therefore reduce the 
need to engage in self-justification via attitude change. Last, 
when normative standards are more salient, dissonance 
should be equal between high and low self-esteem individu- 
als because the same normative standard is determining dis- 
sonance arousal for everyone (see Stone & Cooper, 2001, for 
more detail regarding these predictions). 

The true distinctions between the original dissonance the- 
ory, the new look formulation, the self-approaches, and the 
self-standards model are sometimes nebulous, however, and 
findings consistent with one approach can often be incorpo- 
rated by another. For example, results that could be inconsis- 
tent with the new look formulation include the finding that 
engaging in counterattitudinal behaviors with no apparent 
consequences to others (Harmon-Jones, 2000; Harmon-Jones 
et al., 1996) or engaging in proattitudinal behaviors with 
positive consequences to others (Dickerson, Thibodeau, 
Aronson, & Miller, 1992; Prislin & Pool, 1996; Stone, 
Aronson, Crain, Winslow, & Fried, 1994) can elicit disso- 
nance-based attitude change. However, when aversive con- 
sequences are considered to be “anything that blocks one’s 
self-interest or an event that one would rather have not occur” 
(Cooper & Fazio, 1984, p. 232; Cooper, 1992) or the viola- 
tion of some standard (Cooper, 1999), the new look approach 
can accommodate such results (Cooper, 1992; cf. Harmon- 
Jones, 2000; Harmon-Jones et al., 1996; Stone & Cooper, 
2001). 


Results that might be inconsistent with the original disso- 
nance formulation include the finding that proattitudinal be- 
haviors can elicit dissonance when aversive consequences 
result (e.g., Scher & Cooper, 1989) and that inconsistency 
might fail to arouse dissonance if the individual has low 
consistency needs (e.g., Cialdini, Trost, & Newsom, 1995; 
Snyder & Tanke, 1976). The original dissonance formulation 
might account for such results by considering the importance 
of the cognitions creating the inconsistency (Harmon-Jones 
et al., 1996). Important cognitions should have greater weight 
in determining dissonance magnitude than should unimpor- 
tant cognitions. Therefore, if aversive (and inconsistent) con- 
sequences are highly important, they could override the 
effect of the proattitudinal act itself (Harmon-Jones et al., 
1996). Similarly, individual differences in consistency needs 
might be accounted for by differential importance weights 
across individuals. 

As should be apparent, the original dissonance formula- 
tion and its alternatives appear to be quite flexible in ac- 
counting for the wide variety of effects one’s behavior can 
have on one’s attitude. The flexibility associated with these 
different ways of interpreting dissonance findings affords 
greater explanatory breadth, but it comes with a cost. Specif- 
ically, this flexibility makes it difficult to accurately predict 
when any given individual will experience dissonance—a 
criticism that has often been leveled at dissonance theory 
(Aronson, 1992, 1999). 


Nondissonance Alternatives 


In addition to the dissonance modifications described previ- 
ously, two nondissonance alternatives have been proposed to 
account for the findings of dissonance researchers. One such 
alternative is self-perception theory. As described earlier, 
self-perception theory (Bem, 1965) holds that individuals 
often infer their attitudes from their own behavior. Self- 
perception theory was a formidable opponent to the disso- 
nance view because it was able to account for many of the 
results attributed to dissonance mechanisms (Greenwald, 
1975). It later became apparent that self-perception was a dif- 
ferent phenomenon that functioned in different settings and 
was not simply an alternative explanation for cognitive dis- 
sonance (e.g., Beauvois, Bungert, & Mariette, 1995). For in- 
stance, in contrast to dissonance processes, self-perception 
processes appear to operate when one’s behavior falls in 
one’s latitude of acceptance and thus elicits little aversive 
arousal (Fazio et al., 1977). In addition, whereas dissonance 
reduction has been proposed to require considerable cogni- 
tive effort (Festinger, 1957), self-perception processes appear 
to involve simpler attributional decisions that operate under 
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relatively low-effort circumstances (see attribution section in 
this chapter). 

A second alternative mechanism is impression manage- 
ment. Proponents of this view believe that the attitude change 
observed in dissonance experiments results not from aversive 
arousal associated with cognitive inconsistency, but instead 
from the desire to appear consistent to others (e.g., Tedeschi 
et al., 1971). Although impression management is a motiva- 
tional variable that can affect attitude reports, it cannot 
account for all dissonance phenomena. For example, disso- 
nance-based attitude change can also occur in situations in 
which attitude reports are private and anonymous and should 
therefore arouse no impression management concerns (e.g., 
Baumeister & Tice, 1984; Hoyt et al., 1972). 


Combinatory Approaches 


Combinatory approaches emphasize the different ways in 
which individuals assign value to pieces of information and 
integrate them into a structure of beliefs and attitudes. These 
models differ in their emphasis on the types of information 
individuals consider, as well as the means by which the in- 
formation is integrated. Three different types of models have 
received the most research attention. 


Probabilogical Model 


The probabilogical model (McGuire, 1960, 1981; Wyer, 
1970, 1974) suggests that beliefs are represented in memory 
in a network of syllogistic structures that have both horizon- 
tal and vertical dimensions. Each syllogism contains two 
premises that logically imply a conclusion. 

Consider the following syllogism: 


Premise 1: Drinking Brand X beer makes one popular. 
Premise 2: Being popular is desirable. 
Conclusion: Drinking Brand X beer is desirable. 


The conclusion of this syllogism relies on Premises 1 and 2. 
The vertical structure of the network is formed by related 
syllogisms. For example, Premises 1 and 2 could each be 
the conclusion of other syllogisms, and the premises that 
lead to these conclusions could each be the conclusions of 
yet other syllogisms. The vertical structure of the network 
has important implications for attitude change because 
changing beliefs at one point in the vertical structure can 
lead to logical change in other elements within the vertical 
structure. 

The network of syllogisms also has a horizontal structure. 
The horizontal dimension incorporates other syllogisms that 
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share the same conclusion and can also have important impli- 
cations for attitude change. In particular, the probabilogical 
model specifies that belief in the conclusion of a syllogism 
should be resistant to the extent that many other syllogisms 
imply the same conclusion. Hence, if a conclusion that Brand 
X beer is desirable rests on a single syllogism, undermining 
one belief in the syllogism should easily undermine one’s 
belief that Brand X beer is desirable. However, if multiple 
premises support Brand X’s virtues, undermining a single 
syllogism should have less of an effect—that is, the more in- 
formation on which an attitude is based, the more difficult it 
should be to change the attitude. 

Importantly, the models of McGuire (1960, 1981) and 
Wyer (1970, 1974) do not assume that beliefs in premises or 
conclusions are all or none. Rather, people can hold beliefs 
with differing degrees of likelihood. A number of studies 
have shown that the probability one assigns to a conclusion 
follows closely what it should based on the laws of probabil- 
ity (Henninger & Wyer, 1976; Rosen & Wyer, 1972; Wyer, 
1973). More important is that changes in the belief of a con- 
clusion based on a change in belief in one of the premises can 
also be predicted to a good extent by the laws of probability 
(McGuire, 1981). However, logical consistency is not the 
only factor that determines the strength of people’s beliefs. 
The desirability of the beliefs is also important. This hedonic 
consistency (McGuire, 1960) leads to a bias such that people 
tend to see as likely things that are good, and to see as good 
things that are likely. 

One of the most interesting elements of the probabilogical 
model is its ability to describe how some attitudes or judg- 
ments affect others. The model offers one explanation for 
how attitude change on one issue such as abortion can affect 
related attitudes such as one’s attitude toward contraception 
(Mugny & Perez, 1991; see also Dillehay, Insko, & Smith, 
1966). Changing one attitude can lead to a change in another 
if the attitudes are somehow related in the syllogistic network 
(e.g., being derived from common premises; see also Crano & 
Chen, 1998). 


Expectancy-Value Formulations 


Expectancy-value theories propose that attitudes reflect an 
individual’s subjective assessment of the likelihood that an 
attitude object will be associated with positive or negative 
consequences or related to important values (Peak, 1955; 
Rosenberg, 1956; see Bagozzi, 1985, for a review). A partic- 
ularly influential model, the theory of reasoned action 
(Fishbein & Ajzen, 1975, 1981), posits that attitudes are a 
multiplicative function of the desirability of an individual’s 
salient beliefs about an attitude object and the likelihood that 


those beliefs are true. For example, one’s attitude towards a 
political candidate could be predicted by the expectancy that 
the candidate will enact certain policies if elected and the 
value or desirability the individual places on those policies. 

Although studied primarily within the framework of be- 
havioral prediction rather than that of attitude change, this 
formulation has clear implications for the successful devel- 
opments of persuasive communications. Specifically, the the- 
ory of reasoned action implies that attitude change should 
follow changes in perceptions of the likelihood or desir- 
ability of the consequences associated with a position (see 
Albarracin, in press; Petty & Wegener, 1991, for discussion). 
And, in fact, a number of studies have indicated that persua- 
sive messages and contextual variables such as a person’s 
mood can produce attitude change by changing the perceived 
likelihood or desirability of salient beliefs (e.g., Albarracin & 
Wyer, 2001; Fishbein, Ajzen, & McArdle, 1980; Lutz, 1975; 
MacKenzie, 1986; Wegener, Petty, & Klein, 1994). 

Although some researchers have proposed that virtually all 
attitude change occurs via the thoughtful consideration of like- 
lihood and desirability assessment (Fishbein & Middlestadt, 
1995; McGuire & McGuire, 1991), as we described previ- 
ously, attitude change can also occur via multiple low-effort 
processes. Additionally, even likelihood and desirability as- 
sessments could be made via low-effort processes. For exam- 
ple, under low-elaboration conditions, individuals are prone to 
believing whatever they hear (Gilbert, 1991; Gilbert, Tafarodi, 
& Malone, 1993) and perceiving stimuli positively (Cacioppo 
& Berntson, 1994; Peeters & Czapinski, 1990). Repeated 
exposure appears to magnify these propensities. For example, 
repeated exposure to a piece of information increases percep- 
tions of its validity (e.g., Arkes, Boehm, & Xu, 1991), and as 
noted earlier, repeated mere exposure to a stimulus increases 
its desirability (Zajonc, 1968), even when the exposure is sub- 
liminal (Bornstein & D’ Agostino, 1992). 

However, it seems likely that the retrieval and integration 
of likelihood and desirability assessments of multiple salient 
beliefs would require effort and would occur only when indi- 
viduals have the ability and motivation to do so. In support of 
this reasoning, expectancy-value processes tend to account 
for more variance in attitudes when motivation (e.g., the need 
for cognition; Wegener et al., 1994) and ability (e.g., topic- 
relevant knowledge; Lutz, 1977) to think are high. 


Information Integration 


In addition to specifying the primary components of atti- 
tudes, attitude theorists have also attempted to specify the 
means by which these components are combined to influence 
attitudes. As just noted, the expectancy-value formulation of 


Fishbein and Ajzen predicts that the information is combined 
additively to form attitudes—that is, attitudes are postulated 
to be the sum of the likelihood X desirability products for each 
salient attribute associated with the attitude object. However, 
other theorists such as Anderson (1971) have proposed that 
beliefs are combined by an averaging function. In this formu- 
lation, each salient belief is weighted by the individual’s as- 
sessment of the importance of that piece of information. 

Anderson’s averaging model has proven efficacious in ex- 
plaining the impact of different information on resulting atti- 
tudes or summary judgments. The flexibility of the averaging 
account in accommodating the data is simultaneously its 
greatest strength and weakness (see Eagly & Chaiken, 1984; 
Petty & Cacioppo, 1981). By adjusting the weighting para- 
meter of the initial attitude or beliefs in a post hoc fashion, 
the model can accommodate nearly any finding, but an 
a priori basis for different combinatory patterns is not well 
specified by the model. Distinguishing the averaging account 
from additive accounts can be exceedingly difficult, and cru- 
cial tests have yet to emerge. At present, there is some sug- 
gestion that people are more likely to use an adding 
integration rule when thinking is at the low end of the elabo- 
ration continuum (Betsch, Plessner, Schwieren, & Giitig, 
2001), but they use an averaging rule when elaboration is 
higher (Petty & Cacioppo, 1984). 


WHAT HAPPENS WHEN ATTITUDES CHANGE? 


We have now discussed the major low- and high-effort 
approaches to understanding attitude change. As we noted ear- 
lier, all of these approaches focus on changing a person’s ex- 
plicit attitude—but what role do implicit attitudes play in 
attitude change? Our previous discussion of implicit and ex- 
plicit attitudes suggested that a given individual might hold 
more than one attitude toward the same attitude object—one 
explicit and one implicit. It has been demonstrated, for in- 
stance, that although people tend to report favorable attitudes 
toward minority group members on some explicit measures, 
they may simultaneously show evidence of unfavorable atti- 
tudes on more implicit measures (e.g., Banaji & Greenwald, 
1995; Devine, 1989; Dovidio et al., 1997; Fazio et al., 1995; 
Wittenbrink, Judd, & Park, 1997). Acommon explanation for 
this finding (e.g., Devine, 1989) has been that negative associ- 
ations develop early in life and remain accessible in memory 
even after more positive attitudes are later formed. This expla- 
nation is consistent with the dual-memory system articulated 
by E. R. Smith and DeCoster (2000). According to this model, 
people have two memory systems—a slow-learning system 
that detects regularities in the environment over time and a 
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fast-learning system designed more for the memory of single 
events or one-time experiences. Based on this formulation, 
conflicting attitudes might coexist in different systems. 

The possibility of people having both implicit and explicit 
attitudes has a number of important implications. Perhaps the 
most relevant implication for attitude change is that it suggests 
that on some occasions when attitudes appear to change (e.g., 
when initial negative racial attitudes become more positive), 
the new attitude might not literally replace the old attitude, 
but may instead coexist in such a way that the old attitude 
can resurface under specifiable circumstances (Cacioppo 
et al., 1992; Jarvis, Petty, & Tormala, 1999; Petty, Baker, & 
Gleicher, 1991; Wilson et al., 2000). This notion is a radical 
departure from previous treatments of attitude change—that 
is, the prevailing assumption of prior models was that when 
attitude change occurred, the prior attitude was incorporated 
into the new attitude such that the old attitude ceased to exist 
and was replaced by the new one. In his information integra- 
tion theory discussed earlier, Anderson (1971) represented 
this mathematically as 


ae (woAo +5 wisi) / (wo + Lu) (15.1) 


This formula says that a person’s new attitude (A,,) following 
some new information (s) is a weighted (w) average of the 
new information and the old attitude (Ao). Stated differently, 
the old attitude is weighted by its importance along with the 
importance of the new information, each piece of which has 
some scale value (s). After the integration has taken place, the 
old attitude is replaced by the new one. 

In contrast to the information integration approach, 
the notion of implicit attitudes suggests that people can have 
different attitudes toward the same object: one that is explicit 
and one that is implicit. According to the dual attitude model 
(Wilson et al., 2000), two attitudes can form when one atti- 
tude, Ag, changes to another, A,. When this occurs, the 
original attitude Ap does not actually disappear. Instead, 
according to this model, it becomes implicit and persists 
in memory along with A,, which is considered the explicit 
attitude. The dual attitude model is depicted schematically in 
the top panel of Figure 15.1. This model represents a case 
in which a person with an initially negative attitude toward a 
racial group subsequently becomes positive. Wilson et al. 
posit that both attitudes can influence responding. Whereas 
the newer (explicit) attitude affects controlled responses 
(e.g., direct attitude measures; deliberative behaviors), the 
older (now implicit) attitude affects responses that individu- 
als are not motivated or able to control (e.g., indirect attitude 
measures; spontaneous behaviors; see Dovidio et al., 1997; 
and Greenwald & Banaji, 1995, for similar views). 
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Dual Attitudes Model 


TIME 1 


TIME 2 


PAST Model 
(-3) 


TIME 2 = 


Figure 15.1 What happens when attitudes change? (Top 
panel). In the dual attitudes model, when attitudes change 
from Time 1 to Time 2, the old attitude becomes implicit and 
the new attitude is the explicit attitude (Wilson et al., 2000). 
(Bottom panel). In the PAST model, when attitudes change, 
the old attitude acquires a “false” tag which allows for the 
possibility of ambivalent responding (Jarvis et al., 1999; 
Petty & Jarvis, 1998). 


An alternative to the dual attitude model, the PAST (prior 
attitudes are still there; Jarvis et al., 1999; Petty & Jarvis, 
1998) model was also proposed to account for what happens 
to the old attitude when attitudes change. The PAST model 
differs, however, in that it presents a more dynamic picture 
of the relationship between the old and new attitude, suggest- 
ing that both can simultaneously influence responding under 
certain circumstances. In short, the PAST model, like the dual 
attitudes model, holds that the prior attitude remains in mem- 
ory, and because it is consciously rejected can be considered 
implicit (i.e., people are unaware of currently holding this at- 
titude). However, the PAST model proposes that when a new 
attitude is acquired, the old attitude takes on a false or “low 
confidence” tag that must also be activated if the old attitude 
is to be suppressed (see Gilbert et al., 1993). The bottom 
panel of Figure 15.1 presents a schematic depiction accord- 
ing to the PAST model of a person who was initially unfa- 
vorable toward a minority group and then became favorable. 
According to the PAST model, to the degree that the false tag 
is accessible, the newer attitude will guide responses (see 
also Kawakami et al., 2000). The prior attitude will have an 
impact, however, if it was never fully rejected (i.e., no false 
tag or a weak one), if the false tag cannot be retrieved, or if 
the tag is retrieved but one is still unable to inhibit the prior 


attitude’s influence for some other reason. According to the 
PAST model, when current and prior attitudes conflict and 
both are accessible, they should produce ambivalent respond- 
ing. Thus, the PAST model, unlike the dual attitude model, 
suggests that current and prior attitudes do not always oper- 
ate in an either-or fashion. Rather, depending on the circum- 
stances, either one or the other or both could exert some 
impact. Over the coming years, the viability of dual atti- 
tude models for understanding attitude change is likely to 
receive considerable research attention. 


CONCLUSIONS 


Our goal in this chapter has been to present an organizing 
framework for understanding the psychological processes re- 
sponsible for attitude change. Since the earliest empirical 
studies of attitude change in the 1920s, much has been 
learned about the underlying determinants and consequences 
of different attitude change processes. We divided the theo- 
retical processes responsible for modifying attitudes into 
those that emphasize effortful thinking about the central mer- 
its of the attitude object and those that rely on less cognitively 
demanding processes. This framework allows understanding 
and prediction of what variables affect attitudes and in what 
general situations they do so. In addition, this framework 
helps to place the various minitheories of attitude change in 
their proper domain of operation. For example, high-effort 
processes like cognitive responses should account for attitude 
change in those contexts in which thinking is expected to be 
high, whereas a lower-effort process such as balance or use of 
simple heuristics should be more likely to account for empir- 
ical effects in those contexts in which thinking is expected to 
be low. Finally, recognition of an elaboration continuum per- 
mits understanding and prediction of the strength of attitudes 
changed by different processes. Attitudes that are changed as 
a result of considerable mental effort tend to be more persis- 
tent, resistant to counterpersuasion, and predictive of behav- 
ior than are attitudes that are changed by a process invoking 
little mental effort in assessing the central merits of the 
object. 

Although a multitude of processes are involved in chang- 
ing attitudes, we have a reasonably good handle on what 
these processes are and when they operate. Yet despite the 
considerable progress that has been made in understanding 
attitude change, much work remains to be done. The next 
decade will likely bring advances in a number of areas. First, 
greater appreciation is needed for the view that any one vari- 
able is capable of multiple roles in the persuasion process. At 
present, most studies still focus on the one process by which 


a variable has an impact on attitudes. More research is needed 
on the multiple ways in which variables can influence atti- 
tudes in different situations. Second, one of the most exciting 
new domains of inquiry is the interplay between explicit and 
implicit attitudes. For example, what is the best way to con- 
ceptualize and assess implicit attitudes? Under what condi- 
tions are implicit and explicit attitudes likely to guide action? 
Are some attitude change processes more likely to influence 
implicit attitudes, whereas others are more likely to change 
explicit attitudes? Work on the topic of implicit attitudes is in 
its infancy, but the next decade promises to provide more de- 
finitive answers to these and other questions. 
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The belief that we are the masters of our own destiny surely 
ranks among the most fundamental of human conceits. This 
overarching self-perception is viewed by many scholars as a 
prerequisite to personal adjustment, enabling us to face un- 
certainty with conviction and challenges with perseverance 
(cf. Alloy & Abramson, 1979; Deci & Ryan, 1985; Kofta, 
Weary, & Sedek, 1998; Seligman, 1975; Taylor & Brown, 
1988), and as equally central to the maintenance of social 
order because of its direct link to the attribution of personal 
responsibility (cf. Baumeister, Stillwell, & Heatherton, 1994; 
Shaver, 1985). Its adaptive significance notwithstanding, the 
sense that one’s actions are autonomous, self-generated, and 
largely impervious to external forces is routinely exagger- 
ated in daily life (e.g., Langer, 1978; Taylor & Brown, 1988), 
and ultimately can be dismissed as philosophically untenable 
to the extent that it reflects naive assumptions about personal 
freedom (cf. Bargh & Chartrand, 1999; Skinner, 1971). 
Social psychologists know better, and in their pursuit of the 
true causal underpinnings of behavior, they have routinely 
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placed the individual at the intersection of various and sundry 
social forces. In this view, people represent interdependent 
elements that together comprise larger social entities, be they 
familial, romantic, or societal in nature. Against this back- 
drop, people continually influence and in turn are influenced 
by one another in myriad ways. Social influence is the 
currency of human interaction, and although its operation 
may be subtle and sometimes transparent to the individuals 
involved, its effects are pervasive. 

In recognition of the primacy of influence in the social 
landscape, G. W. Allport (1968) defined the field of social 
psychology as “an attempt to understand . . . how the thought, 
feeling, and behavior of the individual are influenced by the 
actual, imagined, or implied presence of others.” No other 
topic in social psychology can lay claim to such centrality. 
After all, no one has defined social psychology as the study of 
impression formation or self-concept, nor have researchers 
investigating such topics done so without assigning a promi- 
nent role to social influence processes. The belief in self- 
determination may well be important for personal and 
societal function, but the reality of social influence is equally 
significant—and for many of the same reasons. Our aim in 
this chapter is to outline the fundamental features of social 
influence and to illustrate the manifestations of influence in 
different contexts. In so doing, we emphasize the various 
functions served by social influence, both for the individual 
and for society. 
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OVERVIEW OF CHAPTER 


Because social influence is deeply embedded in every aspect 
of interpersonal functioning, any attempt to discuss it apart 
from all the topics and research traditions defining social psy- 
chology is necessarily incomplete and potentially mislead- 
ing. How can one divorce a depiction of basic influence 
processes from such phenomena as attitude change, self- 
concept malleability, or the development of close relation- 
ships? As it happens, of course, any field of scientific inquiry 
is differentiated into relatively self-contained regions, and 
social psychology is no exception. Although it can be ar- 
gued that one person’s practical differentiation is another 
person’s unnecessary fragmentation (see, e.g., Gergen, 1985; 
Vallacher & Nowak, 1994), it is nonetheless the case that dis- 
tinct theoretical and research traditions have emerged over 
the years to create a workable taxonomy of social psycholog- 
ical phenomena. Despite the pervasive nature of social influ- 
ence, then, it is commonly treated as a separate topic in 
textbooks and secondary source summaries of relevant the- 
ory and research. To an extent, our treatment of social influ- 
ence works within the accepted boundary conditions. Thus, 
we discuss such agreed-upon subtopics as compliance, con- 
formity, and obedience to authority. At the same time, how- 
ever, we attempt to impose a semblance of theoretical order 
on the broad assortment of relevant processes. So although 
each manifestation of influence—whether in advertising, the 
military, or intimate relationships—taps correspondingly dis- 
tinct psychological mechanisms, there are certain invariant 
features that transcend the surface structure of social influ- 
ence phenomena. 

We begin by discussing the exercise of external control to 
influence people’s thoughts and behaviors. Rewards and pun- 
ishments have self-evident efficacy in controlling behavior 
across the animal kingdom, so their incorporation into influ- 
ence techniques in human affairs is hardly surprising. We 
then turn our attention to less blatant strategies of influence 
that typically fare better in inducing sustained changes in 
people’s thought and behavior. It is noteworthy in this regard 
that the lion’s share of the literature subsumed under the so- 
cial influence label emphasizes subtle manipulation rather 
than direct attempts at control. We provide an overview of the 
principal manipulation techniques and abstract from them 
common features that are responsible for their relative 
success. This theme provides the foundation for an even less 
blatant approach to influence, one centering on the coordina- 
tion of people’s internal states and overt behaviors. People 
have a natural tendency to bring their beliefs, preferences, 
and actions in line with those of the people around them, and 


this tendency becomes manifest in the absence of overt or 
subtle manipulation strategies. This penchant for interper- 
sonal synchronization is what enables a mere collection of 
individuals to become a functional unit defining a higher 
level of social reality. 

We then turn our attention to the manifestation of social 
influence at the level of society. A central theme here is that 
the emergence and maintenance of macrolevel properties in 
a social system can be understood in terms of the mi- 
crolevel influence processes described in the preceding 
sections. We describe the results of computer simulations 
demonstrating this linkage between different levels of so- 
cial reality. In a concluding section, we abstract what appear 
to be the common features of influence across different top- 
ics and relate them to fundamental psychological processes, 
chief among them the coordination of individual elements 
to create a coherent higher-order unit. Our suggestions in 
this regard are as much heuristic as integrative, and we offer 
suggestions for future lines of theoretical work to forward 
this agenda. 


EXTERNAL CONTROL 


The most elemental way to influence someone’s behavior is 
make rewards and punishments contingent on the enactment 
of the behavior. For the better part of the twentieth century, 
experimental psychology was essentially defined in terms of 
this perspective, and during this era a wide variety of rein- 
forcement principles were generated and validated. Attempts 
to extend these principles to social psychology were always 
complicated by the undeniable cognitive capacities of hu- 
mans and the role of such capacities in regulating behavior 
(cf. Bandura, 1986; Zajonc, 1980). Nonetheless, several lines 
of research based on behaviorist assumptions are represented 
in social psychology (e.g., Byrne, 1971; Staats, 1975). With 
respect to social influence, this perspective suggests simply 
that people are motivated to do things that are associated with 
the attainment of pleasant consequences or the avoidance of 
unpleasant consequences. Thus, people adopt new attitudes, 
develop preferences for one another, change the frequency of 
certain behaviors, or take on new activities because they in 
effect have been trained to do so. It’s fair to say this perspec- 
tive never achieved mainstream status in social psychology, 
but one might think that social influence would be an ex- 
ception. Reinforcement, after all, is defined in terms of the 
control of behavior, and to the extent that a self-interest 
premise underlies virtually all social psychological theories 
(cf. Miller, 1999), it is hard to imagine how the promise of 


reward or threat of punishment could fail to influence peo- 
ple’s thoughts, feelings, and actions. 


Bases of Social Power 


The ability to control someone’s behavior, whether by carrot 
or stick, is synonymous with having power over that person. 
Presumably, then, successful influence agents are those who 
are seen—by the target at least—as possessing social power. 
In contemporary society, power reflects more than physical 
strength, immense wealth, or the capacity and readiness 
to harm others—although having such attributes certainly 
wouldn’t hurt under some circumstances. Social power 
instead derives from a variety of different sources, each 
providing a correspondingly distinct form of behavior con- 
trol. The work of French and Raven (1959; Raven, 1992, 
1993) is commonly considered the definitive statement on the 
various bases of social power and their respective manifesta- 
tions in everyday life. They identify six such bases: reward, 
coercion, expertise, information, referent power, and legiti- 
mate authority. 

Reward power derives, as the term implies, from the abil- 
ity to provide desired outcomes to someone. The rewards 
may be tangible and material (e.g., money, a nice gift), but 
often they are more subtle and nonmaterial in nature (e.g., ap- 
proval, affection). The compliance-for-reward exchange may 
be direct and explicit, of course, as when a parent offers an 
economic incentive to a child for doing his or her homework. 
But the transaction is often tacit or implicit in the relationship 
rather than directly stated. The salesperson who pushes used 
cars with special zeal, for example, may do so because he or 
she knows the company gives raises to those who meet a cer- 
tain sales quota. Coercive power derives from the ability to 
provide aversive or otherwise undesired outcomes to some- 
one. As with rewards, coercion can revolve around tangible 
and concrete outcomes, such as the use or threat of physical 
force, or instead involve outcomes that are nonmaterial and 
acquire their valence by virtue of less tangible features. The 
parent concerned with a child’s study habits might express 
disapproval for the child’s shortcomings in this regard, for 
example, and the salesperson might redouble his or her ef- 
forts at moving stock for fear of losing his or her job. 

Expert power is accorded those who are perceived to have 
superior knowledge or skills relevant to the target’s goals. 
Deference to such individuals is common when the target 
lacks direct personal knowledge regarding a topic or course 
of action. In the physician-patient relationship, for example, 
the patient typically complies with the physician’s instruc- 
tions to take a certain medicine, even when the patient has no 


External Control 385 


idea how the purported remedy will cure him or her. Knowl- 
edge, in other words, is power. Information power is related 
to expert power, except that it relates to the specific informa- 
tion conveyed by the source, not to the source’s expertise 
per se. A person could stumble on a piece of useful gossip, for 
example, and despite his or her general ignorance in virtually 
every aspect of his or her life, this person might wield con- 
siderable power for a time over those who would benefit from 
this information. Knowledge is power, it seems, even in the 
hands of someone who doesn’t know what he or she is talk- 
ing about. 

Referent power derives from people’s tendency to identify 
with someone they respect or otherwise admire. “Be like 
Mike” and “I am Tiger Woods,” for example, are successful 
advertising slogans that play on consumers’ desire to be sim- 
ilar to a cultural icon. The hoped-for similarity in such cases, 
of course, is stunningly superficial—all the overpriced shoes 
in the world won’t enable a teenager to defy gravity while 
putting a basketball through a net or drive a small white ball 
300 yards to the green in one stroke. Referent power is rarely 
asserted in the form of a direct request, operating instead 
through the pull of a desirable person, and can be manifest 
without the physical presence or surveillance of the influence 
agent. A young boy might shadow his older brother’s every 
move, for example, even if the brother hardly notices, and an 
aspiring writer might emulate Hemingway’s sparse writing 
style even though it is fair to say this earnest adulation is 
totally lost on Hemingway. 

Legitimate power derives from societal norms that accord 
behavior control to individuals occupying certain roles. The 
flight attendant who instructs 300 passengers to put their 
tables in an upright position does not have a great deal of 
reward or coercive power, nor is he or she seen as necessarily 
possessing deep expertise pertaining to the request, and it is 
even more unlikely that he or she is the subject of identifica- 
tion fantasies for most of the passengers. Yet this person 
wields enormous influence over the passengers because of 
the legitimate authority he or she is accorded during the 
flight. Legitimate power is often quite limited in scope. A 
professor, for example, has the legitimate authority to sched- 
ule exams but not to tell students how to conduct their per- 
sonal lives— unless, of course, he or she also has referent 
power for them. Legitimate power is clearly essential to soci- 
etal coordination—imagine how traffic at a four-way inter- 
section would fare if the signal lights failed and the police on 
the scene had to rely on gifts or their personal charisma to 
gain the cooperation of each driver. But blind obedience to 
those in positions of legitimate authority also has enormous 
potential for unleashing the worst in people, sometimes to the 
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detriment of themselves or others. In recognition of this po- 
tential, social psychologists have devoted considerable atten- 
tion to the nature of legitimate power, with special emphasis 
on obedience to authority. Not wanting to question this schol- 
arly norm, we highlight this topic in the following section. 


Obedience to Authority 


Guards herding millions of innocent people into gas cham- 
bers, soldiers mowing down dozens of farmers and villagers 
with machine guns, and hundreds of cult members waiting in 
line for lethal Kool-Aid that is certain to kill themselves and 
their children: These images may be unthinkable, but they are 
part of the legacy of the twentieth century. Nestled in the 
security of our homes, we are nonetheless affected by such 
undeniable examples of mass abdications of personal respon- 
sibility and decision making; they can keep us up nights, not 
to mention undermine our sense of control. Although recent 
times have no monopoly on genocide, the abominations of 
World War II intensified the drive to plumb the depths of so- 
cial influence, especially influence over the many by the few 
in the name of legitimate authority. 

The best-known and most provocative line of research on 
this topic is that of Stanley Milgram (1965, 1974), who con- 
ducted a set of controversial laboratory experiments in the 
early 1960s. Milgram wanted to document the extent to which 
ordinary people will take orders from a legitimate authority 
figure when compliance with the orders entails another per- 
son’s suffering. The idea was to replicate in a relatively be- 
nign setting the dynamics at work during wartime, when 
soldiers are given orders to kill enemy soldiers and citizens. 
In his experimental situation, ostensibly concerned with the 
psychology of learning, participant “teachers” were asked to 
deliver electric shocks to “learners” (who were actually ac- 
complices of Milgram) if the learners produced an incorrect 
response to an item on a simple learning task. In the initial 
study, Milgram (1965) found that 65% of the subjects cast in 
the teacher role obeyed the experimenter’s demand to pro- 
ceed, ultimately administering 450 volts of electricity to a 
learner (a mild-mannered, middle-aged man with a self- 
described heart condition) in an adjoining room, despite hear- 
ing the learner’s protests, screams, and pleas to stop 
emanating from the other room. Milgram subsequently per- 
formed several variations on this procedure, each designed to 
identify the factors responsible for the striking level of obe- 
dience initially observed. In one of the most intriguing 
variations, subjects were cast in the learner role as well as the 
teacher role, and the experimenter eventually told the teacher 
to cease administering shocks. Remarkably, some learners 
in this situation insisted that the teacher continue “teaching” 


them for the good of the experiment. Because the learner did 
not have the same degree of legitimacy as the experimenter 
did, however, none of the teachers acceded to the learner’s 
demand to continue shocking them. 

Milgram’s findings proved unsettling to scholars and 
laypeople alike. With the horrors of World War II still fairly 
fresh in people’s memories, Milgram’s research suggested 
that Hitler’s final solution was not only fathomable, but per- 
haps also likely to occur again under the right circumstances. 
After all, these findings were produced by people from a 
nation of self-professed mavericks whose ancestors had risen 
up against the motherland’s authority less than two centuries 
earlier. Subsequent research employing Milgram’s basic par- 
adigm has demonstrated comparable levels of obedience in 
many other countries, including Australia, Germany, Spain, 
and Jordan (Kilham & Mann, 1974; Meeus & Raaijmakers, 
1986). The tendency to defer to legitimate authority, even 
when the demands of authority run counter to one’s personal 
beliefs and inhibitions, appears to be robust, representing 
perhaps an integral part of human nature. 

The power of authority can derive from purely symbolic 
manifestations, such as titles or clothing, even when the 
ostensible authority has no credible claim to his or her role 
as a legitimate authority figure. A man wearing a security 
guard’s uniform, for example, can secure compliance with a 
request to pick up litter, even when the requests are made in 
a context outside the guard’s purview (Bickman, 1974). 
Even fictional symbols of authority can produce compliance. 
Television advertising trades on this tendency with astonish- 
ing commercial success. For example, the actor Robert 
Young, who played the part of Dr. Marcus Welby in a popu- 
lar TV doctor series in the 1960s, wore a white lab coat in a 
commercial for Sanka (a brand of decaffeinated coffee). He 
was not an expert on coffee and certainly not a real doctor, 
yet the symbols of his authority (the white lab coat, the 
association with Dr. Welby) were sufficient to increase dra- 
matically the sales of Sanka. Even when an actor states at 
the outset of a commercial pitch that J am not a doctor, but I 
play one on TV, his recommendations regarding cold reme- 
dies are followed by a significant portion of the viewing 
audience. This deference to titles and uniforms can have 
devastating effects. A study performed in a medical context, 
for example, found that 95% of nurses who received a 
phone call from a “doctor” agreed to administer a dangerous 
level of a drug to a patient (Hofling, Brotzman, Dalrymple, 
Graves, & Pierce, 1966). 

Although pressures to obey authority are compelling, 
obedience is not inevitable. Research has shown, for exam- 
ple, that obedience to authority is tempered when the vic- 
tim’s suffering is highly salient and when the authority figure 


is made to feel personally responsible for his or her actions 
(Taylor, Peplau, & Sears, 1997). Resistance to authority is 
enhanced, moreover, when the resister receives social sup- 
port and in situations in which he or she is encouraged to 
question the motives, expertise, or judgments of the author- 
ity figure (Taylor et al., 1997). It should be reiterated, how- 
ever, that legitimate authority serves important social 
functions and should not be viewed with a jaundiced eye 
only as a necessary evil in the human condition. Policeman, 
judges, elected representatives, and school crossing guards 
could not perform their duties if their power were not based 
on an aura of legitimacy. And as much as teachers like to be 
liked and to be seen as experts, their power over students in 
the classroom hinges to a large extent on students’ perceiving 
them as legitimate authority figures. Even parents, who 
wield virtually every other kind of power (reward, coercion, 
expertise, information) over their children, must occasion- 
ally remind their offspring who is ultimately in charge in 
order to exact compliance from them. Obedience to author- 
ity, in sum, is pervasive in informal and formal social rela- 
tions, and is neither intrinsically good nor intrinsically bad. 
Like many features of the human condition, its potential for 
good or evil is dependent on the restraint and judgment of 
those who exercise it. 


Limitations of External Control 


If the exercise of power always had its intended effect, both 
scholarly and lay interest in social influence would be mini- 
mal. Why bother obsessing over something as obvious as the 
tendency of people to defer to people in a position to offer 
rewards or threaten punishment? Is detailed experimentation 
really necessary to figure out why we listen to experts or 
model the behavior and attitudes of people we admire? And 
what could be more obvious than the observation that we typ- 
ically comply with the demands and requests of those who 
are perceived as entitled to influence us in this way? Fortu- 
nately for social psychologists—and perhaps for intellectu- 
ally curious laypeople as well—the story of social influence 
does not end with such self-evident conclusions, but rather 
unfolds with a far more interesting plotline. There is reason to 
think, in fact, that the general approach to influence outlined 
previously is among the least effective ways of implementing 
true change in people’s thoughts and feelings relevant to the 
behavior in question. Indeed, a fair portion of theoretical and 
research attention over the last 40 years has focused on the 
tendency for heavy-handed efforts at influence to boomerang, 
promoting effects opposite to those intended. This is espe- 
cially the case for attempted influence that trades on reward 
and coercive power, although the assumptions underlying 
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this line of theory and research would seem to hold true for 
legitimate power as well. 


Psychological Reactance 


To a certain extent, the failure of power-based approaches to 
induce change in people’s action preferences can be traced to 
the fundamental human conceit noted at the outset. People 
want to feel like they are the directors of their own fate (cf. 
Deci & Ryan, 1985), and accordingly are sensitive to attempts 
by others to diminish this self-perceived role. No one really 
likes to be told what to do, and influence attempts that are 
seen in this light run the risk of producing resistance rather 
than compliance. Reactance theory (J. W. Brehm, 1966; S. S. 
Brehm & Brehm, 1981) trades on the assumption that people 
like to feel free, specifying how people react when this feel- 
ing is undermined. The basic idea is that when personal free- 
doms are threatened, people act to reassert their autonomy 
and control. Commanding a child not to do something runs 
the risk of eliciting an / won t! rebuttal, for example, or reluc- 
tant compliance that disappears as soon as the surveillance is 
lifted (e.g., Aronson & Carlsmith, 1963). In effect, all the 
bases of power at the parent’s disposal—reward, coercion, 
referent, expert, legitimate—pale in comparison to the child’s 
distaste for having his or her tacit agreement removed from 
the parent-child exchange. 

Considerable evidence has been accumulated over the 
years in support of the basic tenets of reactance theory (cf. 
Burger, 1992). Research by Burger and Cooper (1979), for 
example, found that even something as basic and sponta- 
neous as humor appreciation is subject to reactance effects. 
Male and female college students were asked to rate ten car- 
toons in terms of funniness. Some participants rated the 
cartoons when alone, but others provided the ratings after re- 
ceiving instructions from confederates to give the cartoons 
high ratings. Results revealed that pressure by the confeder- 
ates tended to backfire, producing funniness ratings lower 
than those produced by participants not subject to the pres- 
sure. This effect was pronounced among individuals who had 
scored high on a preexperimental personality assessment of 
need for personal control. 

Some studies have produced rather counterintuitive find- 
ings that call into question the basis for certain public policy 
initiatives. In a study investigating attempts to reduce alco- 
hol consumption, for example, participants who received a 
strongly worded antidrinking message subsequently drank 
more than did those who received a moderately worded mes- 
sage (Bensley & Wu, 1991). The strongly worded message 
presumably was perceived by participants as a threat to their 
personal freedom, to which they reacted by drinking more 
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rather than less in an effort to assert their sense of control. 
Findings such as these cast into doubt the wisdom of the Just 
say no mantra of many contemporary drug education pro- 
grams aimed at young people. The slogan itself may promote 
the very behavior it is intended to discourage, because it rep- 
resents a rather direct short-circuiting of targets’ personal 
decision-making machinery. There is evidence, in fact, that 
the Just say no approach has backfired in some instances, 
producing increased rather than decreased consumption of 
illegal substances—although it is not entirely clear that this 
effect is due primarily to reactance (Donaldson, Graham, 
Piccinin, & Hansen, 1995). 

The experience of psychological reactance is not limited to 
influence techniques that trade on power per se. Indeed, the 
concern with protecting one’s self-perceived freedom can cur- 
tail the effectiveness of any influence attempt that is seen as 
such. The use of flattery to seduce a target into a new course of 
action, for example, can backfire if the target is aware—or 
simply suspicious—that the flattery is being strategically em- 
ployed for manipulative purposes (e.g., Jones & Wortman, 
1973). Indeed, any attempt to gain influence over another per- 
son by becoming attractive to him or her runs a serious risk of 
failure if the attempted ingratiation is transparent to the per- 
son. Jones (1964) has referred to this stumbling block to inter- 
personal influence as the “ingratiator’s dilemma.” Normally, 
we like to hear compliments, to have others agree with our 
opinions, and to interact with people who are desirable by 
some criterion. As intrinsically rewarding as these experi- 
ences are, they also make us correspondingly vulnerable to re- 
quests and other forms of influence from the people in 
question. When their compliments become obsequious or if 
their desirability is buttressed by a little too much name- 
dropping, we become suspicious that they are playing on this 
vulnerability with a particular agenda in mind. The result is 
resistance rather than assent to their subsequent requests, even 
requests that might otherwise seem quite reasonable. 

Reactance, in short, is a pervasive human tendency that 
sets clear limits on the effectiveness of all manner of social 
influence. Power-based forms of influence are particularly 
vulnerable to reactance effects, not only because they are 
linked to a restriction of freedom for targets, but also because 
they tend to be explicit and thus transparent to targets. Letting 
someone know that you are trying to influence him or her is a 
decidedly poor strategy—unless, of course, your real goal is 
to get him or her to do the opposite. 


Reverse Incentive Effects 


Twentieth-century social psychology is a story of two seem- 
ingly incompatible perspectives on human nature. For the 


first half of the century, social psychology accepted as 
received wisdom the notion that the behavior of organisms, 
humans included, is ultimately under the control of external 
reinforcement. The mindless S-R models invoked by radical 
behaviorists may not have been most theorists’ cup of tea, but 
no one seriously challenged the assumption that contingen- 
cies of positive and negative reinforcement play a pervasive 
role in shaping people’s psychological development as well 
as their specific behavior in different contexts. People’s con- 
cern over personal freedom was certainly recognized by 
social psychologists, but more often than not this penchant 
was considered an independent force that competed with re- 
inforcement for the hearts and minds of people in their daily 
lives. Thus, people struggled to control their impulses, resist 
temptation, delay gratification, and maintain their dignity in 
the face of incentives to do otherwise. 

After mid-century, something akin to a phase transition 
began to take place in social psychology. Fueled in large part 
by an emerging emphasis on the importance of cognitive me- 
diation, theory and research began to question the imperial 
role of rewards and punishments in shaping personal and in- 
terpersonal behavior. People’s latent preoccupation with self- 
determination, for example, came to be seen not simply as a 
force that competed with reinforcement, but rather as a con- 
cern that was activated by explicit reinforcement contingen- 
cies (cf. de Charms, 1968; Deci & Ryan, 1985). Thus, the 
awareness of a contingency was said to sensitize people to 
the potential loss of self-determination if they were to adjust 
their behavior in accordance with the contingency. In effect, 
awareness of a contingent relation between behavior and re- 
ward weakened the power of the contingency, leaving the de- 
sire for self-determination the dominant casual force. This 
reasoning, of course, is consistent with the assumptions of 
reactance theory, described above. The dethroning of rein- 
forcement theory, however, went far beyond a recognition of 
people’s need for autonomy, freedom, and the like. Two 
major perspectives in particular captured the academic spot- 
light for extended periods of time, and today they still stand 
as basic insights into human motivation—including motiva- 
tion relevant to social influence. 

The first of these, cognitive dissonance theory (Festinger, 
1957), sparked psychologists’ imagination in large part 
because of its seemingly counterintuitive take on the role of 
rewards in shaping thought and behavior. The essence of the 
theory is a purported drive for consistency in people’s 
thoughts and feelings regarding a course of action. When 
inconsistency arises, it is experienced as aversive arousal, 
which motivates efforts to eliminate or at least reduce the 
inconsistency so as to reestablish affective equilibrium. This 
sounds straightforward enough, but under the right conditions 


a concern for restoring consistency can produce what can be 
described as reverse incentive effects (cf. Aronson, 1992; 
Wicklund & Brehm, 1976). In a prototypical experimental 
arrangement, subjects are induced to perform an action that 
they are unlikely to enjoy (e.g., a repetitive or boring task) or 
one that conflicts with an attitude they are likely to hold (e.g., 
writing an essay in support of raising tuition at their univer- 
sity). At this point, varying amounts of monetary incentive 
are offered for the action’s performance; some subjects are 
offered a quite reasonable sum (e.g., $20), others are offered 
a mere pittance (e.g., $1). Virtually all subjects agree to par- 
ticipate regardless of the incentive value, so technically they 
all perform a counterattitudinal task (i.e., a task that conflicts 
with their attitude concerning the task). 

According to Festinger, the dissonance experienced as a 
result of such counterattitudinal behavior can be reduced by 
changing one of the cognitive elements to make it consistent 
with the other element. In this situation, the relevant cognitive 
elements for subjects presumably are their feelings about the 
action and their awareness they have performed the action. 
Because the latter thought cannot be changed (i.e., the damage 
is done), the only cognitive element open to revision is their 
attitude toward the action (which conveniently had not been 
assessed yet). So, the theory holds, subjects faced with this 
cognitive dilemma will adjust their attitude toward the action 
to make it consistent with the fact that they have engaged in the 
action. Subjects who performed a boring task now consider it 
interesting or important. Subjects who wrote an essay espous- 
ing an unpopular position now indicate they hold that position 
themselves. In effect, subjects rationalize their behavior by 
indicating that it really reflected their true feelings all along. 

At this point, one might assume that all subjects would 
follow this scenario. But revising one’s attitude is not the 
only potential means of reducing the dissonance brought on 
by counterattitudinal behavior. Festinger suggested that a 
person can maintain his original attitude if he or she can jus- 
tify the counterattitudinal behavior with other salient and rea- 
sonable cognitive elements. This is where the large versus 
small reward manipulation enters the picture. A subject of- 
fered a large incentive (e.g., $20) for performing the act 
can use that fact to justify what he or she has done. Who 
wouldn’t do something boring or even write an essay one 
doesn’t believe if the price were right? The reward, in other 
words, obviates the psychological need to change one’s feel- 
ings about what one has done. A subject offered a token in- 
centive (e.g., $1), on the other hand, cannot plausibly argue 
that the reward justified engaging in the boring activity or 
writing the disingenuous essay. The only recourse in this sit- 
uation is to revise one’s own attitude and indicate liking for 
the activity or belief in the essay’s position. 
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Note the upshot here: The smaller the contingent reward, 
the more positive one’s resultant attitude toward the behavior; 
or conversely, the larger the contingent reward, the more neg- 
ative one’s attitude toward the rewarded behavior. This repre- 
sents a rather stunning reversal of the conventional wisdom 
regarding the use of rewards to influence people’s behavior. 
To be sure, large rewards are useful—often necessary—to get 
a person to perform an otherwise undesirable activity or to 
express an unpopular attitude. But the effect is likely to be 
transitory, lasting only as long as the reward contingency is in 
place. To influence the person’s underlying thoughts and feel- 
ings regarding the action, and thereby bring about a lasting 
change in his or her behavioral orientation, it is best to em- 
ploy the minimal amount of reward. In effect, lasting social 
influence requires reconstruction within the person rather 
than inducements from the outside. 

Mental processes are notoriously hard to pin down objec- 
tively, of course, and this fact of experimental psychology 
has always been a problem for dissonance theory. Festinger 
and his colleagues did not attempt to measure what they 
assumed to be the salient cognitions at work in the reward 
paradigm, nor have subsequent researchers fared much better 
in providing definitive evidence regarding the stream of 
thought presumably underlying the experience and reduction 
of psychological tension. With this gaping empirical hole in 
the center of the theory, it is not surprising that other theorists 
soon rushed in to fill the gap with their own inferences about 
the true mental processes at work. In effect, the results ob- 
served in cognitive dissonance research served as something 
of a Rorschach for subsequent theorists, each of whom saw 
the same picture but imparted somewhat idiosyncratic inter- 
pretations of its meaning. Not all interpretations have fared 
well, however, and among those that have, there is sufficient 
common ground to characterize (in general terms at least) a 
viable alternative to the dissonance formulation. 

Central to the alternative depiction of reverse incentive ef- 
fects is the assumption that people’s minds are first and fore- 
most interpretive devices, designed to impose coherence on 
the sometimes diverse and often ambiguous elements of per- 
sonal experience. In analogy to Gestalt principles of percep- 
tion, cognitive processes “go beyond the information given” 
(Bruner & Tagiuri, 1954) to impart higher-order meaning that 
links the information in a stable and viable structure. With re- 
spect to the dissonance paradigm, subjects’ cognitive playing 
field is presumably populated with an abundance of salient or 
otherwise relevant information. These cognitive elements in- 
clude the nature of the task (the activity or essay) and the 
money received, of course, but they no doubt encompass an 
assortment of other thoughts and feelings as well. Thus, sub- 
jects may be sensitized to their sense of personal freedom and 
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control in that context, for example, or perhaps to their sense 
of personal competence in performing the task. For that mat- 
ter, subjects might also be considering their feelings about the 
experimenter, pondering the value of the experiment, or 
rethinking the value of psychological research in general. In 
view of the plethora of likely cognitive elements and the po- 
tential for these elements to come in and out of focus in the 
stream of thought, the achievement of coherence is anything 
but a trivial task. What processes are at work to impart 
coherence to this complex and dynamic array of information? 
And what psychological dimensions capture the resultant 
coherence? 

There is hardly a shortage of relevant theories. Several 
early models, for example, emphasized processes of causal 
attribution (cf. Bem, 1972; Jones & Davis, 1965; Kelley, 
1967) that were said to promote personal interpretations 
favoring either internal causation (e.g., personal beliefs and 
desires) or external causation (most notably, the monetary 
incentive). In this view, a large incentive provides a reason- 
able and sufficient cause for engaging in the activity, short- 
circuiting the need to make inferences about the causal role 
of one’s beliefs or desires. A small incentive, on the other 
hand, is not perceived as a credible cause for taking the time 
and expending the effort to engage in the activity, so one in- 
stead invokes relevant beliefs and desires as causal forces 
for the behavior. In effect, the counterintuitive influence 
of rewards is a testament to their perceived efficacy in caus- 
ing people to do things they might not otherwise do. Causal 
attribution, of course, is not the only plausible endpoint of 
coherence concerns. Other well-documented dimensions 
relevant to higher-order integrative understanding include 
evaluative consistency (cf. Abelson et al., 1968), explanatory 
coherence (cf. Thagard & Kunda, 1998), narrative structure 
(cf. Hastie, Penrod, & Pennington, 1983), and level of action 
identification (cf. Vallacher & Wegner, 1987). It is hardly sur- 
prising, then, that a number of other models have been fash- 
ioned and tested in an attempt to explain why rewards 
sometimes fail to influence people’s beliefs and desires in 
the intended direction (e.g., Csikszentmihalyi, 1990; Deci & 
Ryan, 1985; Kruglanski, 1975; Harackiewicz, Abrahams, & 
Wageman, 1987; Trope, 1986; Vallacher, 1993). 

Taken together, the various models emphasizing inference 
and interpretation have a noteworthy advantage over the 
standard dissonance reduction model in that they predict re- 
verse incentive effects for any action, not just those that are 
likely to be viewed in a context-free manner as aversive by 
some criterion (e.g., repetitive, boring, pointless, time- 
consuming, etc.). Indeed, some of the most interesting re- 
search has established conditions under which otherwise 
enjoyable or interesting activities can seemingly lose their 


intrinsic interest by virtue of their association with material 
rewards (cf. Lepper & Greene, 1978). Rewards do not always 
have this effect, however, a point that has been incorporated 
with varying degrees of success into many of these models. 
Still, the theoretical preoccupation with the effects of rewards 
has generated an unequivocal lesson: The success or failure 
of attempted influence depends on how the attempt engages 
the mental machinery of the target. Rewards can be perceived 
as bribery and aversive consequences can mobilize resis- 
tance, for example, and both can activate concerns about 
one’s freedom of action and self-determination. Social influ- 
ence does not operate on blank minds, but rather encounters 
an active set of interpretative processes that operate accord- 
ing to their own dynamics to make sense of incoming infor- 
mation (Vallacher, Nowak, Markus, & Strauss, 1998). 


MANIPULATION 


Change in people’s behavior can be imposed from the outside 
by the exercise of power, but this approach to influence may 
prove effective only as long as the relevant contingencies 
(reward, punishment, expertise, information) are in place. To 
influence people in a more fundamental sense, it is necessary 
to include them as accomplices in the process. A self-sustain- 
ing change in behavior requires a resetting of the person’s in- 
ternal state—her or her beliefs, preferences, goals, and so 
on—in a way that preserves the person’s sense of freedom 
and control. Assuming the influence agent has an agenda that 
does not coincide with the target’s initial preferences and 
concerns, the agent may then find it necessary to employ sub- 
tle strategies designed to manipulate the relevant internal 
states of the target. Couched in these terms, social influence 
boils down to various means by which an agent can obtain 
voluntary compliance from targets in response to his or her 
requests, offers, or other forms of overture. Research has 
identified several compliance-inducing strategies, some of 
which rely on basic interpersonal dynamics, others of which 
reflect the operation of basic social norms. We discuss spe- 
cific manifestations of these general approaches in the fol- 
lowing sections. 


Manipulation Through Affinity 


Could you pass the broccoli? Will you marry me? Whether 
the agenda at issue is mundane or life-altering, requests pro- 
vide the primary medium by which people seek compliance 
from one another. Requests are a fairly routine feature of 
everyday social interaction and have been examined for their 
effectiveness under experimental arrangements designed to 


identify basic principles. However, requests are also central 
to businesses, charitable organizations, political parties, and 
other societal entities that depend on contributions of money, 
effort, or time from the citizenry. Accordingly, much of the 
knowledge concerning compliance has been gleaned from 
observation—sometimes participant observation—of profes- 
sional influence agents operating in charitable, commercial, 
or political contexts (cf. Cialdini, 2001). Experimentation 
and real-world observation provide cross-validation for 
one another, and together have generated a useful taxonomy 
of effective strategies for obtaining compliance. Many of 
these strategies are based on what can be called the affinity 
principle—the tendency to be more compliant in the hands of 
an influence agent we like as opposed to dislike. 


The Affinity Principle 


Whoever suggested caution in the face of friends bearing 
gifts may not have been advocating cynicism, but rather self- 
preservation. Extensive research supports the commonsense 
notion that personal affinity motivates compliance. From 
sales professionals, the consummate chameleons of the com- 
mercial world, to con artists preying on the elderly and 
college students calling home for cash, several effective in- 
fluence strategies rest on the influence agent’s being liked, 
known by, or similar to the target. When such affinity exists 
between agent and target, ruse is not necessarily a prerequi- 
site for compliance. Quite the opposite, in fact, can be true. 

Consider, for example, the Tupperware Corporation, 
which has exploited the power of friendship in an unprece- 
dented fashion. It has been reported that a Tupperware party 
occurs somewhere every 2.7 seconds (Cialdini, 1995)— 
although they typically last much longer than that, which 
suggests the sobering possibility that there is never a moment 
without one. The format is as follows: A host invites friends 
and relatives over to his or her home to participate in a gath- 
ering at which Tupperware products are demonstrated by a 
company representative. Armed with the knowledge that 
their friend and host will receive a percentage of sales, the at- 
tendees tend to buy willingly, because they are purchasing 
from someone they know and like rather than from a stranger. 
As confirmation for the pivotal role of “liking” in this con- 
text, Frenzen and Davis (1990) found that 67% of the vari- 
ance in purchase likelihood was accounted for by socials ties 
between the hostess and the guest and only 33% by product 
preference. 

Personal affinity has been shown to be a potent compli- 
ance inducer even in the absence of the liked individual. 
Anecdotal evidence of this phenomenon abounds in our daily 
lives. It is the rare parent who has not sent his or her child 
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around to friends and neighbors to collect for a school 
walkathon or raffle. The child, hardly the embodiment of a 
“compliance professional” (Cialdini, 2001), represents the 
parent who is (one would hope) liked by the target. In the 
same vein, Cialdini (1993) discovered that door-to-door 
salespersons commonly ask customers for names of friends 
upon whom they might call. Although we may wonder what 
kind of friends a person might surrender in this way, rejecting 
the salesperson under these circumstances apparently is seen 
as a rejection of the referring friend—the person for whom 
affinity is felt. The potency of the affinity principle per se 
may be diminished by the physical absence of the liked per- 
son, but the allusion appears nonetheless to render the target 
more susceptible to other compliance tactics. 

The affinity principle is not limited to influence seekers 
and their surrogates, but applies as well to those who are 
known or at least recognized by the target. During elections, 
for example, voters have been shown to cast their ballots for 
candidates with familiar-sounding names (Grush, 1980; 
Grush, McKeough, & Ahlering, 1978). In similar fashion, 
survey response rates sometimes double if the sender’s name 
is phonetically similar to the recipient’s (Garner, 1999). 
Physical attractiveness represents another extension of the 
affinity principle. A total stranger blessed with good looks 
has a distinct advantage over his or her less attractive 
counterparts in securing behavioral compliance (Benson, 
Karabenick, & Lerner, 1976) and attitude change (Chaiken, 
1979). Good grooming, for example, accounts for greater 
variance in hiring decisions than does the applicant’s job 
qualifications, although interviewers deny the impact of at- 
tractiveness (Mack & Rainey, 1990). In political campaigns, 
meanwhile, there is evidence that a candidate’s attractiveness 
can substantially influence voters’ perceptions of him or 
her and affect their voting behavior as well (Budesheim & 
DePaola, 1994; Efran & Patterson, 1976). Even criminal 
justice is not immune to the power of physical attractive- 
ness. Better-looking defendants generally receive more fa- 
vorable treatment in the criminal justice system (Castellow, 
Wuensch, & Moore, 1990) and often receive lighter sen- 
tences when found guilty (Stewart, 1980). 


Similarity and Affinity 


Similarity between influence agent and target represents a 
special case of the affinity principle. It is rarely a coincidence 
when a car salesperson claims to hail from a customer’s 
home state or when an apparel salesperson claims to have 
purchased the very same outfit the vacillating customer is 
sporting. People like those who are similar to them (cf. 
Byrne, 1971; Byrne, Clore, & Smeaton, 1980; Newcomb, 
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1961), and in accordance with the affinity principle, they are 
inclined to respond affirmatively to requests from similar 
others as well. The similarity effect encompasses a wide 
range of dimensions, including opinions, background, 
lifestyle, and personality traits (Cialdini & Trost, 1998). Even 
similarity in nonverbal cues, such as posture, mood, and 
verbal style, has been observed to increase compliance 
(LaFrance, 1985; Locke & Horowitz, 1990; Woodside & 
Davenport, 1974). The effect of similarity is quite pervasive, 
having been demonstrated across a wide range of variation in 
age, cultural background, socioeconomic status, opinion top- 
ics, and relationship types (cf. Baron & Byrne, 1994). 

The power of similarity to elicit compliance has been 
observed even when the dimension of similarity is decidedly 
superficial in nature. Sometimes outward manifestations 
of similarity such as clothing are all that are required. 
Emswiller, Deaux, and Willits (1971), for example, arranged 
for confederates to dress as either “straight” or “hippie” and 
had them ask fellow college students for a dime to make a 
phone call. When the confederate and target subject were sim- 
ilar in their respective attire, compliance was observed over 
two thirds of the time. When the confederate-target pair dif- 
fered in clothing type, however, less than half of the students 
volunteered the dime. In a related vein, Suedfeld, Bochner, 
and Matas (1971) observed that if antiwar protestors were 
asked by a similarly dressed confederate to sign a petition, 
they tended to do so without even reading the petition. Auto- 
matic compliance to the requests of others perceived to be 
similar has a decidedly nonthinking quality to it. The very au- 
tomaticity of the similarity principle, however, may have im- 
portant adaptive significance. By using this heuristic to make 
quick decisions regarding compliance requests, people can 
allocate their valuable but limited mental resources to other 
types of judgment and decision-making situations defined in 
terms of ambiguous, conflicting, or complex information. 


Esteem and Affinity 


Perhaps even more basic than our propensity to do things for 
those we like is our need to be liked by those we know (cf. 
G. W. Allport, 1939; Baumeister, 1982; Tesser, 1988). To be 
sure, for some people the desire to be liked can be overridden 
by other motives, such as the need for acceptance (Rudich & 
Vallacher, 1999) or desires to be seen accurately (Trope, 
1986) or in accordance with one’s personal self-view 
(Swann, 1990). For most people most of the time, however, it 
is hard to resist the allure of flattery. Receiving positive feed- 
back from someone is highly rewarding and tends to promote 
a reciprocal exchange with the source. In other words, we 
like others who seem to like us. When activated in this way, 


the affinity principle makes the recipient of flattery a poten- 
tial target for influence by the flatterer. 

Flattery has a long history as an effective compliance tech- 
nique, both inside and outside the laboratory (cf. Carnegie, 
1936/1981; Cialdini, 2001). Drachman, DeCarufel, and Insko 
(1978), for example, arranged for men to receive positive or 
negative comments from a person in need of a favor. The per- 
son offering praise alone was liked most, even if the targets 
knew that the flatterer stood to gain from their liking them. 
Moreover, inaccurate compliments were just as effective as 
accurate compliments in promoting the target’s affinity for 
the flatterer. So influence agents need not bother gathering 
facts to support their complimentary onslaught; simply ex- 
pressing positive comments may be sufficient to woo the tar- 
get and thereby gain his or her compliance. At the same time, 
however, the ingratiator’s dilemma (Jones, 1964) discussed 
earlier sets limits on the effectiveness of the esteem principle. 
In particular, praise and other forms of ingratiation (e.g., 
opinion conformity with the target) can backfire if the ingra- 
tiator’s ulterior motives are readily transparent and the praise 
is seen as solely manipulative. And, of course, the influence 
agent can simply overdo the flattery and come across as disin- 
genuous and obsequious. 


Manipulation Through Scarcity 


From childhood on, we want what we lack—be it toys, 
money, fancy cars, or greener grass. The cache of the unat- 
tainable, for example, is a sure bet to spark competition and 
fuel sales in commercial settings. Cries of today and today 
only and in limited quantities have been known to drive shop- 
pers like lemmings toward the blue-light special, and con- 
venient Christmastime shortages of Tickle Me Elmos or 
Furbees stoke the fires of demand for such toys. We may 
see ourselves as impervious to such base tactics, but the 
power of the human tendency to view scarcity as an indicator 
of worth or desirability is undeniable, well-documented— 
and routinely exploited as a method of securing compliance 
(cf. Cialdini, 2001). 

It’s interesting in this regard to consider the tendency for 
efforts at censorship to backfire, creating a stronger demand 
than ever for the forbidden fruit. The prohibition of alcohol 
in the 1920s, for example, only whetted people’s appetite for 
liquor and spawned the rise of secret establishments (the 
speakeasy) that provided access to the scarce commodity. 
Antipornography crusades typically have the same effect, in- 
creasing interest in the banned books and magazines, even 
among people who might not otherwise consider this particu- 
lar genre. Telling people they cannot read or see something 
can increase—or even create—a desire to take a proverbial 


peek at the hard-to-find commodity. By the same token, after 
the censorship or prohibition is lifted, interest in the object in 
question tends to wane. 

Surprisingly, there is a paucity of research on the psychol- 
ogy of scarcity. The enhanced desirability of scarce items 
may reflect a perceived loss of freedom to attain the items, in 
line with reactance theory. The censorship example certainly 
suggests that people value an object in proportion to the in- 
junction against having it. People don’t like having their free- 
dom threatened, and making an item difficult to obtain or 
forbidding an activity clearly restricts people’s options with 
respect to the item and the activity. Reactance is a reasonable 
model, but one can envision other theoretical contenders. 
Simple supply-and-demand economics, for example, has a 
direct connection to the scarcity phenomenon. The lower the 
supply-demand ratio with respect to almost any item, the 
more those who control the resource can jack up the price and 
still count on willing customers. Perhaps there are viable evo- 
lutionary reasons for the heightened interest in scarce re- 
sources. The conditions under which we evolved were harsh 
and uncertain, after all, and there may have been selection 
pressures favoring our hominid ancestors who were success- 
ful at securing and hording valuable but limited food supplies 
and other resources. 

Yet another possibility centers on people’s simultaneous 
desires to belong and to individuate themselves from the 
groups to which they belong (e.g., Brewer, 1991). Scarcity 
has a way of focusing collective attention on a particular ob- 
ject, and there may be a sense of social connectedness in 
sharing the fascination with others. Waiting in line with 
throngs of shoppers hoping to secure one of the limited 
copies of the latest Harry Potter volume, for example, is ar- 
guably an annoying and irrational experience, but it does 
make the person feel as though he or she is on the same wave- 
length as people who would otherwise be considered total 
strangers. At the same time, if the person is one of the lucky 
few who manages to secure a copy before the shelves are 
cleared, he or she has effectively individuated him- or herself 
from the masses. In essence, influence appeals based on 
scarcity may be effective because they provide a way for peo- 
ple to belong to and yet stand out from the crowd in a world 
where he or she may routinely feel both alienated and 
homogenized. 


Manipulation Through Norms 


Human behavior, compliance included, is driven to a large 
extent by social norms—context-dependent standards of be- 
havior that exert psychological pressure toward conformity. 
At the group level, norms provide continuity, stability, and 
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coordination of behavior among individuals. At the individ- 
ual level, norms provide a moral compass for deciding how to 
behave in situations that might offer a number of action alter- 
natives. The norm of social responsibility (e.g., Berkowitz & 
Daniels, 1964), for example, compels us to help those less 
fortunate than ourselves, and the norm of equity prevents us 
from claiming excessive compensation for minimal contribu- 
tion to a group task (cf. Berkowitz & Walster, 1976). Norms 
pervade social life, and thus provide raw material for social 
influence agents. By tapping into agreed-upon and internal- 
ized rules for behavior, those who are so inclined can extract 
costly commitments to behavior from prospective targets 
without having to flatter them. 


The Norm of Reciprocity 


The obligation to repay what others provide us appears to be 
a universal and defining feature of social life. All human so- 
cieties subscribe to the norm of reciprocity (Gouldner, 1960), 
which is understandable in light of the norm’s adaptive value 
(Axelrod, 1984). The sense of future obligation engendered 
by this norm promotes and maintains both personal and for- 
mal relationships. And when widely embraced by people as a 
shared standard, the reciprocity norm lends predictability, in- 
terpersonal trust, and stability to the larger social system. 
Transactions involving tangible assets are only a subset of the 
social interactions regulated by reciprocity. Favors and invi- 
tations are returned, Christmas cards are sent to those who 
send them, and compliments are rarely accepted without find- 
ing something nice to say in return (Cialdini, 2001). 

The social obligation that there be a give for every take is 
well-documented (DePaulo, Brittingham, & Kaiser, 1983; 
Eisenberger, Cotterell, & Marvel, 1987; Regan, 1971). Even 
when gifts and favors are unsolicited (or unwanted), the re- 
cipient feels compelled to provide something in return. The 
ability of uninvited gifts to produce feelings of obligation in 
the recipient is successfully exploited by many organizations, 
both charitable and commercial. People may not need per- 
sonalized address labels, key rings, or hackneyed Christmas 
cards, but after they have been received, it is difficult not to 
respond to the organization’s request for a “modest contribu- 
tion” (e.g., Berry & Kanouse, 1987; Smolowe, 1990). A par- 
ticularly vivid example of this tendency is provided by the 
Hare Krishna Society (Cialdini, 2001). The members of this 
religious sect found that they could dramatically increase the 
success of their solicitations in airports simply by giving trav- 
elers a free flower before asking for donations. People find it 
hard to turn down a request for money after receiving an un- 
solicited gift, even something as irrelevant to one’s current 
needs as a flower. That receiving a flower is not exactly the 
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high point of the recipients’ day is confirmed by Cialdini’s 
observation that the flower more often than not winds up in a 
nearby waste container shortly after the flower-for-money 
transaction has been completed. 

Reciprocity can have the subsidiary effect of increasing 
the recipient’s liking for the gift- or favor-giver, but the norm 
can be exploited successfully without implicit application of 
the affinity principle (e.g., Regan, 1971). Affect does enter 
the picture, however, when people fail to uphold the norm. 
Nonreciprocation runs the risk of damaging an exchange 
relationship (Cotterell, Eisenberger, & Speicher, 1992; 
Meleshko & Alden, 1993) and may promote reputational 
damage for the offender (e.g., moocher, ingrate) that can 
haunt him or her in future transactions. Somewhat more sur- 
prising is evidence that negative feelings can be engendered 
when the reciprocity norm is violated in the reverse direction. 
One might think that someone who provides a gift but does 
not allow the recipient to repay would be viewed as generous, 
unselfish, or altruistic (although perhaps somewhat mis- 
guided or naive). But under some circumstances, such a per- 
son is disliked for his or her violation of exchange etiquette 
(Gergen, Ellsworth, Maslach, & Seipel, 1975). This tendency 
appears to be universal, having been demonstrated in U.S., 
Swedish, and Japanese samples. 

Cooperation is an interesting manifestation of the reci- 
procity norm. Just as the act of providing a gift or a favor 
prompts repayment, cooperative behavior tends to elicit co- 
operation in return (Braver, 1975; Cialdini, Green, & Rusch, 
1992; Rosenbaum, 1980) and can promote compliance with 
subsequent requests as well (Bettencourt, Brewer, Croak, & 
Miller, 1992). This notion is not lost on the car salesperson 
who declares that he or she and the customer are on “the same 
side” during price negotiations, and then appears to take up 
the customer’s fight against their common enemy, the sales 
manager. Even if this newly formed alliance comes up short 
and the demonized sales manager purportedly holds fast on 
the car’s price, the customer may feel sufficiently obligated to 
repay the salesperson’s cooperative overture with a purchase. 

A related form of reciprocity is the tactical use of conces- 
sions to extract compliance from those who might otherwise 
be resistant to influence. The strategy is to make a request 
that is certain to meet with a resounding no, if not a rhetorical 
are you kidding? The request might call for a large invest- 
ment of time and energy, or perhaps for a substantial amount 
of money. After this request is turned down, the influence 
agent follows up with a more reasonable request. In effect, 
the influence agent is making a concession and, in line with 
the reciprocity norm, the target now feels obligated to make a 
concession of his or her own. A study by Cialdini et al. (1975) 
illustrates the effectiveness of what has come to be known as 


the door-in-the-face technique. Posing as representatives of a 
youth counseling program, Cialdini et al. approached college 
students to see if they would agree to chaperon a group of 
juvenile delinquents for several hours at the local zoo. Not 
surprisingly, most of them (83%) refused. The results were 
quite different, though, if Cialdini et al. had first asked the 
students to do something even more unreasonable—spending 
2 hours per week as counselors to juvenile delinquents for a 
minimum of 2 years. After students refused this request—all 
of them did—the smaller zoo-trip request was agreed to by 
50% of the students, a tripling of the compliance rate. The 
empirical evidence for the door-in-the face technique is im- 
pressive (cf. Cialdini & Trost, 1998) and largely supports the 
reciprocity of concessions interpretation. 

The power of reciprocal concessions is also apparent in 
the that’s not all technique, which is a familiar trick of the 
trade among salespeople (Cialdini, 2001). The tactic involves 
making an offer or providing a come-on to a customer, then 
following up with an even better offer before the target has 
had time to respond to the initial offer. This technique is used 
fairly routinely to push big-ticket commercial items. A sales- 
person, for example, quotes a price for a large-screen TV, and 
while the interested but skeptical couple is thinking it over, 
he or she adds, “but that’s not all—if you buy today, I’m au- 
thorized to throw in a free VCR.” Research confirms that the 
effectiveness of the that’s not all technique is indeed attribut- 
able in part to the creation of a felt need in the target to reci- 
procate the agent’s apparent concession (e.g., Burger, 1986), 
although the contrast between the initial and follow-up con- 
cession plays a role as well. In the real world, the knowledge 
that people tend to reciprocate concessions provides a cor- 
nerstone of negotiation and dispute resolution. The bargain- 
ing necessary to reach a compromise solution in such 
instances invariably hinges on one party’s making a conces- 
sion with the assumption that the other party will follow suit 
with a concession of his or her own. This phenomenon can be 
seen at work in a wide variety of contexts, including busi- 
ness, politics, international diplomacy, and marriage. 


Reciprocity in Personal Relationships 


The norm of reciprocity is not limited to transactions between 
people who otherwise would have little to do with one an- 
other (e.g., salespeople and consumers), but rather provides a 
foundation for virtually every kind of social relationship. The 
reciprocity norm even plays a role in personal relationships, 
serving to calibrate the fairness in people’s ongoing interac- 
tions with friends and lovers. The trust and warmth neces- 
sary to maintain a personal relationship would be impossible 
to maintain if either partner felt that his or her overtures of 


affection, self-disclosures, offers of assistance, and birthday 
gifts went unreciprocated (cf. Lerner & Mikula, 1994). There 
are two complications here, however. First, the partners to a 
relationship are not always equally invested in or dependent 
on the relationship (e.g., Rusbult & Martz, 1995). In terms of 
social exchange theory (Kelley & Thibaut, 1978; Thibaut & 
Kelley, 1959), the comparison level for alternatives (CL) 
for each partner may be substantially different, and this dif- 
ferential dependency can promote exploitative behavior by 
the less dependent person. In effect, the person who feels 
more confident that he or she could establish desirable alter- 
native relationships (i.e., the person with the higher CL) can 
set the terms of exchange in the relationship. This power 
asymmetry need not be discussed explicitly in order for it to 
promote inequality in overt expressions of affection, the allo- 
cation of duties and responsibilities, and decision making. 

The second complication arises in relationships that 
achieve a certain threshold of closeness. Intimate partners are 
somewhat loathe to think about their union in economic, tit- 
for-tat terms, preferring instead to emphasize the communal 
aspect of their relationship (cf. M. S. Clark & Mills, 1979). 
They feel they operate on the basis of need rather than equity 
or reciprocity, and this perspective enables them to make sac- 
rifices for one another without expecting compensation or re- 
payment. The apparent suspension of reciprocity may be 
more apparent than real, however. The issue is not reciproc- 
ity per se, but rather the time scale on which reciprocity and 
other exchange metrics are calculated. What looks like self- 
less and unrequited sacrifice by one person in the short run 
can be viewed as inputs that are eventually compensated by 
the other person in one form or another (cf. Foa & Foa, 
1974). Depending on the sacrifice (e.g., fixing dinner vs. tak- 
ing on a second job), the time scale for repayment can vary 
considerably (e.g., hours or days vs. weeks or even years), 
but at some point the scales need to be balanced. The sense 
that one has been treated unfairly or exploited—or simply 
that one’s assistance and affection have not been duly 
reciprocated—can ultimately spoil a relationship and bring 
about its dissolution. 


Commitment 


Although it is not usually listed as a social norm, commitment 
can influence behavior as much as do reciprocity, equity, re- 
sponsibility, and other basic social rules and expectations 
(Kiesler, 1971). After people have committed themselves to 
an opinion or course of action, it is difficult for them to 
change their minds, recant, or otherwise fail to stay the 
course. Commitment does not derive its power solely from 
the anger and disappointment that breaking of a commitment 
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would engender in others—although this certainly counts for 
something—but also from a basic desire to act consistently 
with one’s point of view. A commitment that is expressed 
publicly, whether in front of a crowd or to a single individual, 
is especially effective in locking in a person’s opinion or 
promise, making it resistant to change despite the availability 
of good reasons for reconsideration (cf. Deutsch & Gerard, 
1955; Schlenker, 1980). 

Agents of influence play on this seemingly noble ten- 
dency, often for decidedly nonnoble purposes of their own. 
Several specific techniques have been observed in real-world 
settings and confirmed in research (Cialdini & Trost, 1998). 
Perhaps the best-known tactic is referred to as the foot-in-the- 
door, which is essentially the mirror image of the door-in-the- 
face tactic. Rather than starting out with a large request and 
then appearing to make a concession by making a smaller re- 
quest, the foot-in-the-door specialist begins with a minor 
request that is unlikely to meet with resistance. After securing 
committing with this request, the influence agent ups the 
ante by making a far more costly request that is consistent 
with the initial request. Because of commitment concerns, it 
can be very difficult at this point for the target to refuse com- 
pliance. A series of clever field experiments (Freedman & 
Fraser, 1966) provide compelling evidence for the effective- 
ness of this tactic. In one study, suburban housewives were 
contacted and asked to do something that most of them (78%) 
refused to do: allow a team of six men from a consumer 
group to come into their respective homes for 2 hours to 
“enumerate and classify all the household products you 
have.” Another group of housewives was contacted and pre- 
sented with a much less inconvenience-producing request— 
simply answering a few questions about their household 
soaps (e.g., “What brand of soap do you use in your kitchen 
sink?”). Nearly everyone complied with this minor request. 
These women were contacted again three days later, but this 
time with the larger home-visit request. In this case, over 
half the women (52%) complied with the request and allowed 
the men to rummage through their closets and cupboards for 
2 hours. 

The commitment process underlying this tactic goes be- 
yond the target’s concern with maintaining consistency with 
the action per se. It also engages the target’s self-concept with 
respect to the values made salient by the action. Thus, the 
women who complied with the initial request in the Freedman 
and Fraser (1966) studies were presumably sensitized to their 
self-image as helpful, public-spirited individuals. To maintain 
consistency with this suddenly salient (and perhaps newly 
enhanced) self-image, they felt compelled to comply with 
the later, more invasive request. Assuming this to be the case, 
the foot-in-the-door tactic holds potential for influencing 
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people’s thought and behavior long after the tactic has run its 
course. Freedman and Fraser (1966) themselves noted a par- 
allel between their approach and the approach employed by 
the Chinese military on U.S. prisoners of war captured during 
the Korean War in the early 1950s. A prisoner, approached 
individually, might be asked to indicate his agreement with 
mild statements like The United States is not perfect. After the 
prisoner agreed with such minor anti-American statements, 
he might be asked by the interrogator to elaborate a little on 
why the United States is not perfect. This, in turn, might be 
followed by a request to make a list of the “problems with 
America” he had identified, which he was expected to sign. 
The Chinese might then incorporate the prisoner’s statement 
in an anti-American broadcast. As a consequence of this 
ratcheting up of an initially mild anti-American statement, a 
number of prisoners came to label themselves as collabora- 
tors and to act in ways that were consistent with this self- 
image (cf. Schein, 1956). 

Commitment underlies a related tactic known as throwing 
a lowball, which is routinely employed by salespeople to gain 
the upper hand over customers in price negotiations (Cialdini, 
2001). Automobile salespeople, for example, will seduce cus- 
tomers into deciding on a particular car by offering it at a very 
attractive price. To enhance the customer’s commitment to 
the car, the salesperson might allow the customer to arrange 
for bank financing or even take the car home overnight. But 
just before the final papers are signed, something happens 
that requires changing the price or other terms of the deal. 
Perhaps the finance department has caught a calculation error 
or the sales manager has disallowed the deal because the 
company would lose money at that price. At this point, one 
might think that the customer would back out of the deal— 
after all, he or she has made a commitment to a particular ex- 
change, not simply to a car. Many customers do not back out, 
however, but rather accept the new terms and proceed with 
the purchase. Apparently, in making the initial commitment, 
the customer takes mental possession of the object and is re- 
luctant to let it go (Burger & Petty, 1981; Cioffi & Garner, 
1996). 

Changing the terms of the deal without undermining the 
target’s commitment is not limited to shady business prac- 
tices. Indeed, lowball tactics underlie transactions having 
nothing to do with economics, and can be used to gain peo- 
ple’s cooperation to do things that center on prosocial con- 
cerns rather than personal self-interest (e.g., Pallak, Cook, & 
Sullivan, 1980). In an interesting application of the lowball 
approach, Cialdini, Cacioppo, Bassett, and Miller (1978) 
played on college students’ potential commitment to psycho- 
logical research. Students in Introductory Psychology were 
contacted to see if they would agree to participate in a study 


on “thinking processes” that began at 7:00 a.m. Because this 
would entail waking up before the crack of dawn, few stu- 
dents (24%) expressed willingness to participate in the study. 
For another group of students, however, the investigators 
threw a lowball by not mentioning the 7:00 a.m. element until 
after the students had indicated their willingness to take part 
in the study. A majority of the students (56%) did in fact agree 
to participate, and none of them backed out of this commit- 
ment when informed of the starting time. After an individual 
has committed to a course of action, new details associated 
with the action—even aversive details that entail unantici- 
pated sacrifice—can be added without undermining the psy- 
chological foundations of the commitment. 

Like the lowball tactic, the bait-and-switch tactic works 
by first seducing people with an attractive offer. But whereas 
the lowball approach changes the rules by which the ex- 
change can be completed, the bait-and-switch tactic nixes the 
exchange altogether, with the expectation that the target will 
accept an alternative that is more advantageous to the influ- 
ence agent. Car salespeople once again unwittingly have fur- 
thered the cause of psychological science by their shrewd 
application of this technique (Cialdini, 2001). They get the 
customer to the showroom by advertising a car at a special 
low price. Taking the time to visit the showroom constitutes a 
tentative commitment to purchase a car. Upon arrival, the 
customer learns that the advertised special is sold, or that be- 
cause of its low price, the car doesn’t come with all the fea- 
tures the customer wants. Because of his or her commitment 
to purchase a car, however, the customer typically expresses 
willingness to examine and purchase a more expensive 
model—even though he or she wouldn’t have made the trip 
to look at these models in the first place. 


SOCIAL COORDINATION 


To this point, social influence has been described as if it were 
a one-way street. One person (the influence agent) has an 
agenda that he or she wishes to impose upon another person 
(the influence target). Although influence strategies certainly 
are employed for purposes of control and manipulation, so- 
cial influence broadly defined serves far loftier functions 
in everyday life. Indeed, as noted at the outset, it is hard to 
discuss any aspect of social relations without acceding a 
prominent role to influence processes. Social influence is 
what enables individuals to coordinate their opinions, moods, 
evaluations, and behaviors at all levels of social reality, from 
dyads to social groups to societies. The process of social 
coordination is a thus a two-way street, with all parties to 
the exchange influencing and receiving influence from one 


another. The ways and means of coordination are discussed 
in this section, as are the functions—both adaptive and 
maladaptive—of this fundamental human tendency. 


Conformity 


People go to a lot trouble to influence one another. Yet for 
all the effort expended in service of manipulation, sometimes 
all it takes to influence a person is to convey one’s own atti- 
tude or action preference. People take solace from the ex- 
pressions of like-minded people and develop new ways of 
interpreting reality from those with different perspectives. In 
both cases, simply expressing an opinion—no tricks, strate- 
gies, or power plays—may be sufficient to bring someone 
into line with one’s point of view. This form of influence cap- 
tures the essence of conformity, a phenomenon that is com- 
monly counted as evidence for people’s herdlike mentality. 
There is a nonreflective quality to many instances of confor- 
mity, but this property enables people to coordinate their 
thoughts in an efficient manner and attain the social consen- 
sus necessary to engage in collective action. We consider first 
what constitutes conformity, and then we develop both the 
positive and negative consequences of this manifestation of 
social influence. 


Group Pressure and Conformity 


Conformity represents a “change in behavior or belief to- 
ward a group as a result of real or imagined group pressure” 
(Kiesler & Kiesler, 1976). Defined in this way, conformity 
would seem to be a defining feature of group dynamics. 
Festinger (1950), for example, suggested that pressures to- 
ward uniformity invariably exist in groups and are brought 
to bear on the individual so that over time, he or she will tend 
to conform to the opinions and behavior patterns of the other 
group members. If one of two diners at a table for two says 
that he or she finds the food distasteful and the other person 
expresses a more favorable opinion, the first person is un- 
likely to change his or her views to match those of his or 
her companion. However, the addition of several more dinner 
companions, each holding the contrary position, may well 
cause the person to rethink his or her position and estab- 
lish common ground with the others. If he or she has yet to 
express an opinion, the likelihood of conforming to the 
others’ opinions is all the greater. To investigate the variables 
at work in this sort of context—group size, unanimity of 
group opinion, and the timing of the person’s expressed 
judgment—Solomon Asch (1951, 1956) performed a series 
of experiments that became viewed unanimously by social 
psychologists as classics. 


Social Coordination 397 


Asch’s original intention actually was to demonstrate that 
people do not conform slavishly and uncritically in a group 
setting (Levine, 1996). Asch put his hope for humanity to a 
test in a simple and elegant way. Participants thought they 
were participating in a study on perception. They sat facing a 
pair of white cardboards on which vertical lines were drawn. 
One card had a single line, which provided the standard for 
subjects’ perceptual judgments. The second card had three 
lines of varying length, one of which was clearly the same 
length as the standard. Participants were simply asked to 
indicate which of the three lines matched the standard. The 
correct answer was always obvious, and in fact when partici- 
pants were tested individually, they rarely made a mistake. To 
give conformity a chance, Asch (1951) placed a naive par- 
ticipant in a group setting with six other people, who were 
actually experimental accomplices pretending to be naive 
participants. By arrangement, the participant always made 
his judgment after hearing the bogus participants make their 
judgments. For the first two trials, the accomplices (and, of 
course, the participant) gave the obviously correct answers. 
After creating this group consensus, the accomplices gave a 
unanimous but incorrect answer on the third trial—and again 
on trials 4, 6, 7, 9, 10, and 12. To Asch’s surprise, the typical 
participant conformed to the incorrect group response one 
third of the time. Over 80% of the participants conformed to 
the incorrect majority on at least one trial, and 7% conformed 
on all seven of the critical trials. Although it was not his in- 
tent, Asch had demonstrated that even when there is a clear 
reality, people are still inclined to go along with the crowd. 


Informational and Normative Influence 


Presumably, Asch’s participants conformed because they 
wanted the other group members to like them or because they 
were fearful of ridicule if they failed to go along. During post- 
experimental interviews, participants typically mentioned 
these concerns as their motivation for concurring with obvi- 
ously inaccurate judgments. And when Asch allowed partici- 
pants to make their responses privately in writing as opposed 
to publicly by voice, the extent to which participants con- 
formed showed a marked decrease. Because people are obvi- 
ously less concerned about the approval of others when the 
others cannot monitor their behavior, these findings suggest 
that participants’ conformity did in fact reflect a desire to win 
approval or avoid disapproval. 

Social approval does not exhaust the possible motives for 
conformity, however. Indeed, several years prior to Asch’s 
research, Muzafer Sherif (1936) had concocted an equally 
compelling experimental situation relevant to conformity, but 
one that played on the often ambiguous nature of physical 
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reality rather than concerns with acceptance, rejection, and 
the like. Sherif felt that groups provide important information 
for individuals—and more important, interpretative frame- 
works for making coherent judgments about information. 
People have a need for cognitive clarity (Schachter, 1959), 
but sometimes they lack an objective yardstick for determin- 
ing the true nature of their experiences. In such instances, 
people turn to others, not to gain approval but rather to obtain 
social clues to reality. People are highly prone to rumors, for 
example, even from unreliable sources, when they hear about 
goings-on for which no official explanation has been pro- 
vided. A sudden noise or a hard-to-read message can simi- 
larly make people prone to the assessments of others in an 
attempt to clarify what has happened. 

To test this motivation for conformity, Sherif (1936) 
needed a situation in which the physical environment lacked 
ready-made yardsticks for understanding, so that the opera- 
tion of social standards could be observed. His solution was 
to take advantage of the autokinetic effect—the apparent 
motion of a stationary spot of light in a dark room. The idea 
was to place a group of participants in this type of situation 
and ask them to make estimates of the light’s movement. Par- 
ticipants, of course, were not informed that the light’s move- 
ment was illusory. When tested individually, participants 
varied considerably in their estimates, from virtually no 
movement to more than 10 inches. He then brought together 
three participants who had previously made estimates in pri- 
vate, and asked them to announce their individual judgments 
aloud and in succession. Despite their initial differences, par- 
ticipants converged fairly quickly (often within three trials) 
on a single estimate that functioned as a group standard for 
the light’s movement. Sherif went on to show that after a 
group defined reality for participants, they continued to ad- 
here to the group judgment even after they left the group (see 
also Alexander, Zucker, & Brody, 1970). 

Deutsch and Gerard (1955) recognized that people can 
conform for different reasons and formally distinguished be- 
tween normative influence, which captures the essence of the 
Asch situation, and informational influence, which reflects 
participants’ motivation in the Sherif situation. Normative in- 
fluence refers to conformity in an attempt to gain approval, 
whereas informational influence refers to conformity in an at- 
tempt to gain clear knowledge about reality. Sometimes it is 
difficult to determine which basis of conformity is operative 
in a given situation. Imagine, for example, that you observe 
someone following the lead of others at a classical music 
concert. When they sit, he or she sits. When they give a stand- 
ing ovation, the person follows suit. The group influence in 
this case could be normative, informational, or perhaps both, 
depending on the person’s primary source of uncertainty. If 


the person is unsure of his or her standing among the fellow 
concert-goers, the person’s conformity could be driven by de- 
sires for approval or fears of ridicule. If the person is unfa- 
miliar with classical music, however, the behavior of others 
might provide all-important clues about the quality of the 
performance. 

Normative influence is especially salient when the group 
controls material or psychological rewards important to the 
person (e.g., Crutchfield, 1955), when the behavior is public 
rather than private (e.g., Insko, Drenan, Solomon, Smith, & 
Wade, 1983), or when the person is especially eager for 
approval (Crowne & Marlowe, 1964). Someone attending 
the concert with prospective colleagues, for instance, may 
be especially inclined to match their behavior, particularly 
if he or she is uncertain about their interest in his or her 
job candidacy and the concert hall has good lighting. The 
salience of informational influence in turn depends on 
the person’s confidence in his or her own judgment, and on 
the person’s judgment of how well-informed the group is. 
Thus, a classical music neophyte who sees tuxedo-clad audi- 
ence members leap to their feet upon completion of the Rach 
3 (Rachmaninoff’s third piano concerto) is more likely to fol- 
low suit than if he or she instead sees the same behavior by 
school children. A graduate of Julliard, meanwhile, is un- 
likely to mimic such behavior in either case. Informational 
influence tends also to take precedence, not surprisingly, 
when the judgment task is particularly difficult or ambiguous 
(e.g., Coleman, Blake, & Mouton, 1958). Even in the Asch 
situation, conformity is increased when the lines are closer in 
length and thus harder to judge (Asch, 1952), and when judg- 
ments are made from memory rather than from direct percep- 
tion of the lines (Deutsch & Gerard, 1955), presumably 
because our memories are considered more fallible than are 
our immediate perceptions. 


Groupthink 


Conformity clearly serves important functions, but like every 
other adaptation, there are downsides as well. A particularly 
troublesome aspect of conformity is groupthink (Janis, 1982). 
Janis borrowed this term from George Orwell’s 7984 to refer 
to a mode of thinking dominated by a concern for reaching 
and maintaining consensus, as opposed to making the best 
decision under the circumstances. Groupthink essentially 
entails “a deterioration of mental efficiency, reality testing, 
and moral judgment that results from group pressure” (Janis, 
1982, p. 9). Rather than examining all possible courses of 
action, people in the grips of groupthink expend their mental 
energy on achieving and maintaining group solidarity and 
opinion unanimity. 


The potential for groupthink exists in any group context, 
informal as well as formal, but the most intriguing examples 
concern decisions with far-reaching consequences by people 
normally considered the best and the brightest. Janis (1982) 
analyzed several such situations, including the Bay of Pigs 
invasion during the Kennedy administration, the bombing of 
Pearl Harbor, and the Vietnam War. Janis identified several 
common factors in these instances. In each case, crucial deci- 
sions were made in small groups whose members had con- 
siderable respect and liking for one another. Positive regard 
is certainly preferable to disinterest or disrespect, of course, 
but it can also serve to inhibit criticism and close examina- 
tion of one another’s suggestions. The group members also 
tended to exhibit collective rationalization, systematically 
discrediting or ignoring all information contrary to the 
prevailing group sentiment. They also tended to develop 
strong feelings that their mission (e.g., invading Cuba, imple- 
menting a massive troop build-up in South Vietnam) was 
moral and that the opposite side was not only immoral but 
also stupid. To further cocoon the group, self-appointed 
“mind-guards” precluded members from accessing informa- 
tion that was inconsistent with the party line. The upshot is 
something akin to tunnel vision, in which a single perspective 
is seen as the only viable perspective—not because of a 
rational assessment of the facts but because of the group’s 
irrational espirit de corps. 


Group Polarization 


The groupthink phenomenon has rather straightforward im- 
plications for another phenomenon—group polarization— 
that was nonetheless considered surprising when first noted 
by researchers (e.g., Stoner, 1961; Wallach, Kogan, & Bem, 
1962). The conventional wisdom was that individuals in 
groups avoid going out on the proverbial limb, and thus tend 
to produce more common or popular opinions and recom- 
mendations (cf. F. H. Allport, 1924). It followed from this 
that a group decision is usually more conservative than the 
average of the decisions generated by group members indi- 
vidually. This assumption regarding group decision making 
is reflected in critics’ laments about the bland and often timid 
recommendations generated by committees in bureaucratic 
environments. When faced with making a decision, groups 
were assumed to inhibit boldness, subjugating the creative 
mind to the lowest common denominator of the group. What 
the research began to reveal, however, was quite the opposite 
tendency—greater endorsement of risky decisions as a result 
of group discussion. 

This so-called risky shift is not surprising in light of theory 
and research on groupthink. If anything, the sense of 
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superiority and certainty fostered by an emphasis on cohe- 
siveness as opposed to rationality would seem to be a breed- 
ing ground for bold decisions that go beyond what an 
individual alone would contemplate. The shift toward risky 
decisions, however, was observed in contexts that didn’t in- 
volve the intellectual and emotional incest displayed by 
highly cohesive groups of self-important people. Even 
groups of strangers brought together for a one-shot encounter 
in a laboratory setting were found to advocate courses of 
action with less guarantee of success than the recommen- 
dations volunteered by the group members prior to their 
discussion. Because this observation flew in the face of con- 
ventional wisdom, it cried out for both replication and expla- 
nation. During the 1960s, neither proved to be in short supply. 
This burgeoning literature demonstrated greater risk-taking 
with respect to a wide variety of domains, including bargain- 
ing and negotiations (Lamm & Sauer, 1974), gambling be- 
havior (Blascovich, Ginsberg, & Howe, 1975; Lamm & 
Ochssmann, 1972), and jury decisions (Myers, 1982). 
The risky shift was observed, moreover, when the conse- 
quences of a group’s decision involved real as well as hypo- 
thetical consequences (Wallach et al., 1962). The research 
also demonstrated that the risky shift was not limited to rec- 
ommendations regarding possible courses of action. Indeed, 
group discussion—again, even among strangers—seemed to 
intensify all sorts of attitudes, beliefs, values, judgments, and 
perceptions (Myers, 1982). Such shifts were observed for 
both sexes, in different populations and cultures (e.g., United 
States, Canada, England, France, Germany, New Zealand), 
and with many kinds of group participants (Pruitt, 1971). 

Several explanations for the risky shift achieved currency 
(Forsyth, 1990). The diffusion of responsibility perspective 
suggested that people are less averse to risk in groups 
because they feel less responsibility for—and hence less anx- 
ious about—the potential negative outcomes of risky deci- 
sions. The leadership account held that risk takers tend to 
emerge as leaders because of their greater confidence, 
assertiveness, and involvement in the task, and that their 
leadership status makes them more influential in group dis- 
cussions. Familiarization theory maintained that group dis- 
cussion increases members’ familiarity with the issue, which 
reduces their uncertainty and increases their willingness to 
advocate more risky alternatives. The value perspective pro- 
posed that taking risks is positively valued (in our culture, at 
least) and that group members like to be perceived as willing 
to take a chance; when group members discover that others in 
the group favor riskier alternatives, they change their original 
position to agree with the riskiest member. 

During this same period, however, some research hinted at 
the opposite effect of group discussion—a cautious shift. To 
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complicate matters even further, research began to find evi- 
dence of movement in both directions after a group discus- 
sion (Doise, 1969; Moscovici & Zavalloni, 1969), suggesting 
that both risky and cautious shifts were different manifesta- 
tions of a more basic phenomenon. Based on a review of this 
research, Myers and Lamm (1976) identified what they felt 
was the underlying process. According to their group- 
polarization hypothesis, the “average postgroup response 
will tend to be more extreme in the same direction as the 
average of the pregroup responses” (p. 603). Imagine two 
groups, each consisting of four individuals whose opinions 
vary in their respective preferences for risk. The average 
choice of members is closer to the risky end of the caution- 
risk dimension in one group, but closer to the cautious end of 
this dimension in the other group. The group-polarization 
effect predicts that the first group should become riskier as a 
result of group discussion (1.e., a risky shift), but that the sec- 
ond group should become more cautious during its delibera- 
tions (i.e., a cautious shift). The evidence cited by Myers and 
Lamm (1976) is consistent with this prediction and is widely 
accepted today as a valid empirical generalization regarding 
group dynamics. 

This straightforward generalization proved to be resistant 
to a simple theoretical account. Most theorists eventually en- 
dorsed the value account (e.g., Myers & Lamm, 1976; Pruitt, 
1971; Vinokur, 1971), although it didn’t take long for differ- 
ent variations on this general theme to emerge. Of these, two 
have stood the test of time (thus far). Social comparison the- 
ory holds that people attempt to accomplish two goals during 
group discussion: evaluating the accuracy of their position by 
comparing it with the positions of other group members, and 
creating a favorable impression of themselves within the 
group. The confluence of these two motives results in a ten- 
dency to describe one’s own position in somewhat more ex- 
treme terms (e.g., Goethals & Zanna, 1979; Myers & Lamm, 
1976). Persuasive-arguments theory, meanwhile, stresses the 
importance of the information obtained during group discus- 
sion. Whether there is a shift toward risk or toward caution 
depends on the relative persuasiveness of the arguments 
favoring each position (e.g., Burnstein & Vinokur, 1977; 
Vinokur & Burnstein, 1974). The distinction between these 
two accounts corresponds to the distinction introduced earlier 
between normative and informational influence. Social com- 
parison theory, with its emphasis on self-presentation at- 
tempts to match the perceived group norm, can be understood 
in terms of normative influence. The persuasive-arguments 
perspective, meanwhile, is practically synonymous with 
the rationale of informational influence. As noted in our ear- 
lier discussion, these two forms of influence often co-occur, 


so it should come as no surprise that social comparison and 
persuasive arguments often work together to promote polar- 
ization in groups (cf. Forsyth, 1990). 


Minority Influence 


In the film Twelve Angry Men, the character played by Henry 
Fonda turned his one-man minority into a unanimous major- 
ity during jury deliberations so that an innocent man could go 
free. In the face of virulent opposition, Galileo struggled for 
acceptance of his proof of Copernican theory that the planets 
revolve around the sun. This acceptance did not come during 
his lifetime, but his influence lived on and eventually turned 
the intellectual tide for subsequent generations. Martin 
Luther King Jr. and Mahatma Gandhi both defied the prevail- 
ing norms of their respective cultures and brought about sig- 
nificant social and political change. And in everyday life, 
people with opinions or lifestyles out of step with those of the 
majority often manage to preserve their personal perspective, 
sometimes even overcoming the majority’s disapproval and 
winning acceptance. If conformity were the only dynamic at 
work in social groups, these examples could be dismissed as 
aberrations with no implications for our understanding of so- 
cial influence processes. One can envision groupthink and 
group polarization carried to the extreme, with the complete 
suppression of minority opinion and a resultant interpersonal 
homogeneity. 

Far from representing aberrations, these examples suggest 
that there is more to social life than accommodation by the 
minority to majority influence. Even in small social groups, it 
is possible for a lone dissenter to be heard and to convert oth- 
ers to his or her point of view. At a societal level, minority in- 
terests and opinions manage to survive in the face of majority 
disapproval and hostility, and can sometimes manage to be- 
come dominant forces in the culture. In recognition of these 
facts of social life, minority influence has emerged as an im- 
portant topic in social psychology (cf. Moscovici, 1976). 
Much of this research attempts to identify factors that enable 
minority opinions to persist in groups. Experiments in the 
Asch tradition, for example, have found that both group size 
and unanimity of the majority have important effects on con- 
formity. The relation between group size and conformity 
appears to be logarithmic, such that conformity increases 
with increasing group size up to a point, after which the addi- 
tion of more group members has diminishing impact (Latané, 
1981). Asch’s own research showed that conformity is re- 
duced if the group opposing the subject is not unanimous. 
Even one dissenter among the confederates emboldens the 
naive subject to resist group pressure and express his or her 


own judgment. This is true even if the dissenting confederate 
disagrees with the subject as well as the rest of the group 
(Allen & Levine, 1971). The key factor is not agreement with 
the subject, but rather the recognition that nonconformity is 
possible and acceptable. 

Other lines of research have explored the conditions under 
which minority opinions not only survive, but also become 
influential to varying degrees in the group. A primary con- 
clusion is that minority members must marshal high-quality 
arguments and come across as credible. In other words, 
minorities must rely on informational influence to counter the 
normative influence associated with the majority position. 
Against this backdrop, research has revealed a variety of more 
specific factors that foster minority influence. Thus, minori- 
ties are persuasive when they hold steadily to their views 
(Maass & Clark, 1984; Moscovici, Lage, & Naffrechoux, 
1969), originally held the majority opinion (e.g., R. D. Clark, 
1990; Levine & Ranelli, 1978), are willing to compromise a 
bit (Mugny, 1982), have at least some support from others 
(e.g., Asch, 1955; Tanford & Penrod, 1984; Wolf & Latané, 
1985), appear to have little personal stake in the issue (Maass, 
Clark, & Haberkorn, 1982), and present their views as com- 
patible with the majority but just a bit ahead of the curve, so 
to speak (e.g., Kiesler & Pallak, 1975; Maass et al., 1982; 
Volpato, Maass, Mucchi-Faina, & Vitti, 1990). Minority in- 
fluence also has a better chance if the majority wants to make 
an accurate decision, because this situation gives the advan- 
tage to informational over normative influence (Laughlin & 
Ellis, 1986). The conditions associated with effective minor- 
ity influence enable groups (and societies) to embrace new 
ideas, fashions, and action preferences. 


Accountability 


The notion of conformity conveys an image of nameless au- 
tomatons who surrender their personal identity to the group. 
Ironically, however, the coordination function served by mu- 
tual influence in a group setting requires rather than negates a 
sense of personal identity and responsibility among group 
members. To achieve social coordination, people must feel 
that they are part of a larger social entity, of course, but they 
also must feel that this part is uniquely their own. Two 
research traditions are relevant to the role of accountability in 
achieving social coordination. The first concerns the condi- 
tions under which people abrogate personal responsibility for 
doing their part to achieve a common goal or for taking the 
initiative in a group setting in which their involvement would 
be helpful. The second concerns the conditions under which 
people in a sense become overly sensitized to the group goal 
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to the point that they lose sight of their personal identity and 
unique role in the group. 


Social Loafing 


Sometimes the whole is less than the sum of its parts. This 
feature of group dynamics was first observed in an experi- 
mental setting by Max Ringelman in the 1920s. Using a 
gauge to measure effort exerted by tug-of-war participants, 
Ringelman found that the collective effort was always greater 
than that of any single participant, but less than the sum of all 
participants (Kravitz & Martin, 1986). If two people working 
alone could each pull 100 units, for example, their combined 
output was only 186—not the 200 one would expect if each 
pulled as hard as he or she could. Similarly, a three-person 
group did not produce 300 units, but only 255, and an eight- 
person group managed only 392 units—less than half the 800 
possible. 

Ringelman suggested that two mechanisms were responsi- 
ble for this phenomenon. The first, coordination loss, reflects 
difficulties individuals have in combining their efforts in a 
maximally effective fashion. On a rope-pulling task, for ex- 
ample, people may not synchronize their respective pulls and 
pauses, and this can prevent each person from reaching his or 
her full potential. The second mechanism, commonly re- 
ferred to today as social loafing (Latané, 1981), refers to 
diminished effort by group members. People may simply not 
work as hard when they feel other people can pick up the 
load. Latané, Williams, and Harkins (1979) attempted to 
replicate the Ringelman effect and to determine which of his 
proposed mechanisms accounted for it. Participants in one 
study, for example, were simply asked to shout or clap as 
loud or as hard as they could, while wearing blindfolds and 
headsets that played a stream of loud noise. When tested 
alone, participants averaged a rousing 9.22 dynes/cm’— 
about as loud as a pneumatic drill or a teenager’s stereo sys- 
tem. But in dyads, subjects performed at only 66% capacity, 
and in six-person groups, their performance dropped to 36% 
capacity. The results, in other words, revealed an inverse 
relationship between the number of coperformers and the 
output each one generated. 

To separate the relative impact of coordination loss and 
social loafing, Latané et al. (1979) tested noise production 
in pseudogroups. Participants thought that either one other 
participant or five other participants were cheering with 
them, although they were actually cheering alone (the blind- 
folds and headsets came in handy here). Because there were 
not any other group members, any drop in individual produc- 
tion could not be due to coordination loss, but instead would 
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reflect social loafing. Results revealed that social loafing was 
the operative mechanism. If participants thought they were 
cheering with one other person, they shouted at 82% of their 
individual capacity. Their productivity dropped to 74% if 
they thought five others were working with them. 

Social loafing is not limited to group tasks involving 
shouting, or even to tasks involving physical effort of some 
kind. The decrement in personal contribution with increasing 
group size has been documented in groups working on a va- 
riety of tasks, including maze performance, typing, swim- 
ming, vigilance exercises, creativity problems, job-selection 
decisions, and even brainstorming (e.g., Weldon & Mustari, 
1988; cf. Forsyth, 1990). Social loafing applies equally well 
to men and women, to people of all ages, and to groups in 
many different cultures (e.g., Brickner, Harkins, & Ostrom, 
1986; Harkins & Petty, 1982). There may be polarization of 
attitudes and other mental states in social groups, but this in- 
tensification effect apparently does not apply to group mem- 
ber’s efforts in accomplishing a group task. 

Social loafing varies in accordance with a set of specific 
factors. Group members loaf less when they are working on 
interesting or challenging tasks (e.g., Brickner et al., 1986). 
Loafing is also minimized when each member’s contribution 
to a group project can be clearly identified, presumably 
because identification creates the potential for evaluation 
by other group members (e.g., Harkins & Jackson, 1985; 
Jackson & Latané, 1981; Williams, Harkins, & Latané, 
1981). Social loafing is also partly attributable to the diffu- 
sion of responsibility that takes place in groups and crowds 
(cf. Latané & Darley, 1970). Bystanders to emergency situa- 
tions feel less compelled to intervene if there are other poten- 
tial helpers (Darley & Latané, 1968), for example, and 
restaurant patrons leave pitiful tips when there are many peo- 
ple in the dinner party (Latané & Darley, 1970). Diminished 
personal responsibility reflects members’ feeling that some- 
one else will make up the difference, and also reflects their 
assessment that they can get away with not helping because 
the blame is shared by everyone in the group. 

The research on social loafing has focused primarily on ad- 
ditive group tasks in which each member’s performance is re- 
dundant with that of every other member. This hardly exhausts 
the possible relationships among group members. In situa- 
tions emphasizing individual rather than group performance, 
for example, there is a tendency for individual energy expen- 
diture and effort to increase rather than decrease when others 
are physically present (cf. Triplett, 1898; Zajonc, 1965). 
Whether this social facilitation effect (cf. Cotterell, 1972) 
translates into better performance, however, depends on fea- 
tures of the task and the contingencies surrounding its occur- 
rence. The presence of others typically enhances performance 


on overlearned tasks, for example, but tends to hinder perfor- 
mance on novel or difficult tasks (Zajonc, 1965). There is 
some controversy regarding the social influence processes at 
work in such contexts, although there is a fair degree of con- 
sensus that the presence of others increases a performer’s 
physiological arousal, which in turn activates his or her domi- 
nant responses on the task. This is consistent with the empiri- 
cal generalization noted by Zajonc (1965), because correct 
responses are dominant for well-learned tasks and incorrect 
responses are dominant for unfamiliar tasks. 

Even in groups mandating cooperation among group mem- 
bers, the nature of the task may entail forms of coordination 
that go beyond the simple additive criterion employed in so- 
cial loafing research (cf. Steiner, 1972). Neither simultaneous 
shouting nor tug-of-war, after all, captures the essence of 
groups that build machines or solve human relations prob- 
lems. Many group goals are defined in terms of distinct sub- 
acts that must be accomplished by different group members. 
For such activities, the quality of the group’s performance de- 
pends on how well members’ respective contributions are 
synchronized in time. Assembling a car on a production line 
requires such role differentiation, as does maintaining a 
household, moving heavy pieces of furniture, or implement- 
ing plans to manually recount votes in a close election. Coor- 
dination is every bit as critical as individual effort per se in 
such instances, and a particular blend of normative and infor- 
mational influence may be necessary for the action to unfold 
smoothly and effectively. Identifying these blends of influ- 
ence is an agenda for future research. 


Deindividuation 


Festinger, Pepitone, and Newcomb (1952) coined the term 
deindividuation to describe a mental state defined by total 
submergence in a group. A deindividuated person feels he or 
she does not stand out as a unique individual, and this feeling 
leads to a reduction of inner restraints that can result in 
impulsive acts or other behaviors that might otherwise be in- 
hibited. Although these behaviors may be benign or even 
desirable (e.g., spontaneous expression of feelings, laughing 
and dancing at a boisterous party), researchers have typically 
focused on the potential for antisocial and aggressive actions 
under conditions that promote deindividuation (cf. Diener, 
1980; Zimbardo, 1970). Soccer hooligans committing ran- 
dom acts of violence, mobs rioting and looting stores, and 
gangs terrorizing their enemies are disturbing manifestations 
of this potential. 

Several preconditions for deindividuation have been iden- 
tified (Zimbardo, 1970). Being part of a large, unstructured 
group, for example, increases one’s anonymity and thus can 


reduce feelings of personal responsibility for one’s actions. 
The same can be said for clothing that conceals one’s iden- 
tity, the cover of darkness, sensory overload, the use of drugs 
or alcohol, and collective action of a simple, repetitive (or 
rhythmic) nature (e.g., marching, clapping, dancing). Diener 
(1980) suggested that the anonymity associated with deindi- 
viduating conditions is tantamount to a loss of self-aware- 
ness (Duval & Wicklund, 1972) and hence to diminished 
salience of personal standards for acceptable conduct (e.g., 
Carver & Scheier, 1999; Higgins, 1987; Vallacher & 
Solodky, 1979). Lacking the usual self-regulatory mecha- 
nisms for enacting and inhibiting behavior, the deindividu- 
ated person becomes highly susceptible to influence from the 
group and the context in which the group is acting. The 
nature of this influence, however, does not map onto either 
normative or informational influence in a straightforward 
manner. Thus, the person is not consciously modifying his or 
her behavior to court approval from others, nor is he or she 
gaining a great deal of insight into physical reality from fel- 
low group members. 

One likely dynamic at work is akin to what Le Bon 
(1895/1960) referred to as behavioral contagion, the rapid 
spread of behavior in a group context. Contagion occurs 
through simple imitation of others’ behavior or through the 
adoption of others’ emotional state, and thus is not particularly 
taxing on people’s mental processes. A related possibility fol- 
lows from emergent norm theory (Turner & Killian, 1972), 
which holds that people in unstructured group settings without 
clear a priori group goals are highly susceptible to cues to 
higher-order meaning and guides to action that develop in the 
situation. Consider, for example, the experience of walking 
down New Orleans’ Bourbon Street at 2 a.m. during Mardi 
Gras. This situation is ripe for deindividuation—maybe even 
prototypical. You are part of a large, unstructured group con- 
sisting of unfamiliar people, it’s dark and no one is paying 
attention to you anyway, music is coming from all angles to 
overwhelm your powers of sensory integration, and there may 
have been a couple of hurricane specials consumed by this 
time. But despite the complex array of sights and sounds, there 
is no plan dictating your movements and shifts in attention. At 
this point, if others in the throng spontaneously broke into a 
rhythmic chant or began throwing plastic beads at a passing 
float, you might be tempted to follow suit. The collective ac- 
tion you observe provides temporary integration for the en- 
semble of your specific experiences and thus functions as an 
emergent norm. The norm doesn’t imply acceptance or 
rejection by others—you could keep on walking and no one 
would care—but it does provide a guide that allows you to 
engage in concerted action rather than mere movement 
(cf. Goldman, 1970; Vallacher & Wegner, 1985). 
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Viewed in this way, it is easy to appreciate how a state 
of deindividuation can promote widely divergent action 
trajectories—moral versus immoral, prosocial versus antiso- 
cial, effusive versus sullen, and so on. In effect, the deindi- 
viduated person is behaving in accordance with rudimentary 
moment-to-moment action guides that are devoid of higher- 
level meaning. This mental state is a precondition for emer- 
gent understanding (Vallacher & Wegner, 1987), making the 
person highly susceptible to whatever goals and plans are 
rendered salient as the situation evolves. Should the situation 
resolve itself as an occasion for social camaraderie, the per- 
son might be inclined to laugh and dance with everyone he or 
she encounters. But should the opportunity for personal gain 
at the expense of others suddenly arise, the same person 
could just as easily behave in a decidedly unfriendly, even 
aggressive manner toward those who provide the oppor- 
tunity. Social influence in this context provides personal 
(if somewhat transient) coherence and direction for individu- 
als’ otherwise disassembled and unregulated actions. 


THE INDIVIDUAL AND SOCIETY 


One of the most challenging problems in social psychology 
centers on the relation between micro- and macrolevels of de- 
scription. Social psychological theories are typically couched 
in terms of a single level of description, with little explicit co- 
ordination with theories defined at different levels. Thus, the 
processes at the level of the individual tend to be independent 
of group-level processes. Yet it is unreasonable to expect any 
level of structure and function to operate in isolation. An in- 
dividual’s behavior is influenced by the social context in 
which he or she functions, and each individual in turn creates 
the social context for other individuals through his or her in- 
teractions with them. The nature of this mutual dependency is 
difficult to capture, but recent advances in the study of com- 
plex systems (cf. Schuster, 1984) are proving useful in link- 
ing different levels of social reality (e.g., Nowak & Vallacher, 
1998a, 1998b; Nowak, Vallacher, & Burnstein, 1998; Nowak, 
Vallacher, & Zochowski, 2002). In this section, we describe 
one relevant approach—cellular automata—that has estab- 
lished a track record in this regard in recent years. Other 
approaches (neural networks, coupled dynamical systems) 
are showing promise as well, and the reader is referred to the 
sources cited above for a description of them. 


The Cellular Automata Approach 


Cellular automata models (Gutowitz, 1991; Ulam, 1952; von 
Neumann, 1966; Wolfram, 1986) capture important features 
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of complex systems and are widely used in physics and vari- 
ous domains of biology, including neuroscience (Amit, 1989) 
and population dynamics (May, 1981). A set of elements is 
specified to represent the basic units (e.g., neurons, people) 
in the process under consideration. Each element can adopt 
a finite number of discrete states (e.g., activated vs. inhibited, 
pro- vs. antiabortion). The elements are arranged in a 
spatial configuration, the most common of which is a two- 
dimensional grid. The state of an element at t + 1 depends on 
the states of the neighboring elements at time t. The exact 
form of this dependence is specified by so-called updating 
rules. The dynamics of cellular automata depend on the na- 
ture of the updating rule and on the format of the grid dictat- 
ing the neighborhood structure. 

Two classes of cellular automata models are used to char- 
acterize social processes. In both, elements represent individ- 
uals in a social system. In one, personal characteristics 
change as a result of updating rules. This approach explores 
changes in attitudes and opinions that occur as a result of 
social interaction. In the other class, individuals maintain sta- 
ble characteristics but may change their physical location. 
This approach has revealed the emergence of spatial patterns 
on the basis of stable values and preferences. Shelling (1969, 
1971), for instance, developed an updating rule specifying 
that an individual who has more dissimilar than similar 
neighbors will move to a different random location. Simula- 
tions based on this simple rule demonstrated the emergence 
of spatial patterns corresponding to social segregation. Both 
classes of models reveal the emergence of regularities and 
patterns on a global level that were not directly programmed 
into the individual elements. These regularities and patterns 
typically take the form of spatial configurations, such as co- 
herent minority opinion clusters that emerge from an initial 
random distribution of opinions. Regularities may also ap- 
pear as temporal patterns, including such basic trajectories as 
the development of a stable equilibrium (fixed-point attrac- 
tor), alternation between different states (periodic attractor), 
and apparent randomness (deterministic chaos). 


Cellular Automata and Social Processes 


Cellular automata models are useful for exploring different 
social interaction rules and the generation of societal level 
phenomena as a result of such rules (cf. Hegselman, 1998; 
Messick & Liebrand, 1995; Nowak, Szamrej, & Latané, 
1990). In these applications, the neighborhood structure is 
intended to capture the structure of interdependence among 
individuals (Thibaut & Kelley, 1959). Indirect interdepen- 
dence exists when an individual’s actions have conse- 
quences, intended or unintended, for other people. This form 
of interdependence is often examined in the context of social 


dilemmas, in which an action intended to maximize personal 
gain has negative consequences for others (cf. Schulz, 
Alberts, & Mueller, 1994). In the tragedy of the commons 
(Hardin, 1968), for instance, a farmer is motivated to over- 
graze an area of land shared with other farmers. In the short 
run, the farmer gains advantage over his neighbors, but in the 
long run, everyone—the farmer included—suffers. Direct in- 
terdependence reflects what we normally think of as social 
influence: One person directly influences the state or behav- 
ior of another person. Power, manipulation, and coordination 
thus represent direct interdependence. Both indirect and di- 
rect forms of interdependence have been examined in cellular 
automata models. 


Interdependence and Social Dilemmas 


How can altruistic behavior can emerge against the backdrop 
of self-interest? Insight into this puzzle derives from cellular 
automata models that simulate the short- and long-term 
effects of behavior in the Prisoner’s Dilemma Game (PDG). 
In pioneering this approach, Axelrod (1984) demonstrated 
that cooperation often emerges among individuals trying to 
maximize their respective self-interest. Essentially, Axelrod 
found that cooperators survived by forming clusters with one 
another, so that they could engage in mutual help without 
risking exploitation. 

In an extension of this approach, Messick and Liebrand 
(1995) modeled the consequences of different strategies in 
the PDG. Each interactant occupied a fixed position in a two- 
dimensional lattice and played a PDG with one of his or 
her nearest neighbors. On each trial, the interactant chose 
whether to cooperate or defect according to one of several 
updating rules, each reflecting a specific social strategy. In a 
given simulation, everyone used the same strategy. In the tit- 
for-tat strategy, individuals imitated the choice made on the 
preceding trial by their neighbor. In the win-cooperate— 
lose-defect strategy, the interactant with the greater outcome 
cooperated, whereas the interactant with the smaller outcome 
defected. In the win-stay—lose-shift strategy, meanwhile, in- 
teractants who perceived themselves to be winning behaved 
in the same fashion on the next trial, whereas interactants 
who perceived themselves as losing changed their behavior 
on the next trial. The results of simulations employing these 
updating rules reveal different effects depending on the size 
of the group. In relatively small groups, an equilibrium tends 
to be reached fairly quickly, with all interactants converging 
on a particular choice. In larger groups, however, each strat- 
egy leads to continuous dynamics characterized by the coex- 
istence of different behavioral choices. Eventually, however, 
each strategy leads to specific proportions of cooperating in- 
dividuals. These proportions tend to be maintained at the 


group level, with the interactants themselves continuing to 
change their choices throughout the simulation. 

In a different approach, Hegselman (1998) explored the 
emergence of social support networks in a society. Individu- 
als lived on a two-dimensional grid containing some unoc- 
cupied sites and played a two-person “support game” with 
all of their immediate neighbors. Each individual was char- 
acterized by some probability of needing help. A needy indi- 
vidual clearly benefited, of course, if he or she received help 
from a neighbor, but providing help to a neighbor was 
clearly costly. With this trade-off in mind, each individual’s 
preferred neighborhood was one in which he or she could 
obtain the degree of help needed while minimizing the help 
he or she provided. Individuals were sometimes provided a 
migration option that enabled them to move to a more desir- 
able location within a certain radius. The results reveal how 
support networks can evolve in a world of rational egoists 
who are differentially needy, but similarly motivated to 
choose partners in an opportunistic manner. Although social 
support inevitably develops, the social networks that emerge 
tend to be highly segregated. Individuals with a moderate 
probability of becoming needy tend to form relationships 
with one another, and also with individuals from somewhat 
higher and lower risk classes. Interestingly, individuals at the 
extremes of neediness—those with very high or very low 
probabilities of needing help—tend to have the most diffi- 
culty in establishing support relations. If they do manage to 
form such relationships, their partners tend to be from the 
same risk class. 


Social Influence and the Emergence of Social Structure 


The cellular automata model of social process that has been 
analyzed most thoroughly concerns social influence (e.g., 
Lewenstein, Nowak, & Latané, 1993; Nowak, Lewenstein, & 
Frejlak, 1996). The initial formulation of this model (Nowak 
et al., 1990) focused on the emergence of public opinion in a 
society characterized by a diversity of attitudes. The model 
assumes that in the course of social interaction, individuals 
are motivated to sample the degree of social support for their 
position on a given topic. The model also assumes, in line 
with social impact theory (Latané, 1981), that each individual 
gives the greatest weight to the opinions of others who are 
spatially closest to him or her and who have the greatest 
strength (e.g., who are most influential or persuasive). An in- 
dividual’s own opinion is also taken into consideration and is 
weighted most heavily by virtue of spatial immediacy (i.e., 
distance is 0). After each round of interaction, the individual 
compares the degree of support for each attitude position and 
adopts the one with the strongest support in preparation for 
the next round of interaction. 
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In the simulations, one individual is chosen (usually at 
random), and influence is computed for each opinion in the 
group. (The strength of influence of each opinion is ex- 
pressed by the following formula. 


where /; denotes total influence, s; corresponds to the 
strength of each individual, and d;; corresponds to the dis- 
tance between individuals i and j.) If the resultant strength for 
an opinion position is greater than the strength of the individ- 
ual’s current position, his or her opinion changes to match the 
prevailing position. This process is performed for each indi- 
vidual. This procedure is repeated until there are no further 
changes, which typically requires several rounds of simula- 
tion, because a person who had previously changed his or her 
position to match that of his or her neighbors may revert to 
the original position if the neighbors change their opinions. 
Figures 16.1 and 16.2 present representative results of the 
computer simulations. Each box corresponds to an individ- 
ual. The color of the box (light vs. dark gray) denotes the 
individual’s position, and the height of the box corresponds 
to the individual’s strength. In Figure 16.1, there is a majority 
of 60% (light gray) and a minority of 40% (dark gray). The 
majority and minority members are randomly distributed, 


Figure 16.2 Final equilibrium of opinions in the simulated group. 
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and each group has the same relative proportions of strong 
and weak members (high vs. low boxes). Figure 16.2 shows 
the equilibrium reached after six rounds of simulated discus- 
sion. Now the majority is 90% and the minority is 10%. Note 
that the minority opinion has survived by forming clusters of 
like-minded people and that these clusters are largely formed 
around strong individuals. 

These two group-level outcomes—polarization and 
clustering—are commonly observed in computer simulations 
(cf. Nowak et al., 1996; Latané, Nowak, & Liu, 1994) and are 
reminiscent of well-documented social processes. As noted 
earlier in this chapter, the average attitude in a group be- 
comes polarized in the direction of the prevailing attitude as 
a result of group discussion (e.g., Moscovici & Zavalloni, 
1969; Myers & Lamm, 1976). In the simulations, polariza- 
tion reflects the greater influence of the majority opinion. In 
the initial random configuration (Figure 16.1), the average 
proportion of neighbors holding a given opinion corresponds 
to the proportion of this opinion in the total group. The aver- 
age group member, then, is surrounded by more majority 
than minority members, a difference that results in more mi- 
nority members’ being converted to the majority position 
than vice versa. Some majority members are converted to 
the minority position, however, because they happen to be 
located close to an especially influential minority member, or 
because by pure accident, more minority members happen to 
be at this location. 

Clustering is also pervasive in social life. Attitudes, for ex- 
ample, have been shown to cluster in residential neighbor- 
hoods (Festinger, Schachter, & Back, 1950). Pronounced 
clustering also characterizes political beliefs, religions, cloth- 
ing fashions, and farming techniques. Clustering reflects the 
relatively strong influence exerted by an individual’s neigh- 
bors. When opinions are distributed randomly, the sampling 
of opinions through social interaction provides a reasonably 
accurate portrait of the distribution of opinions in the larger 
society. When opinions are clustered, however, the same sam- 
pling process will yield a highly biased result. Because the 
opinions of those in the nearby vicinity are weighted the most 
heavily, the prevalence of one’s own opinion is likely to be 
overestimated. Hence, opinions that are in the minority in 
global terms can form a local majority. Individuals who hold 
a minority opinion are therefore likely to maintain this opin- 
ion in the belief that it represents a majority position. 


Control Factors for Social Influence 


The results concerning polarization and clustering have been 
confirmed analytically (Lewenstein et al., 1993) and have 
received empirical support as well (Latané, Liu, Nowak, 


Bonavento, & Zheng, 1995; Latané & Nowak, 1997). This 
research has also identified several control factors that are re- 
sponsible for the emergence of these macroscopic properties 
(Latané & Nowak, 1997; Lewenstein et al., 1993; Nowak 
et al., 1996). Individual differences in strength, first of all, are 
indispensable to the survival of minority clusters. This con- 
clusion is consistent with evidence demonstrating the impor- 
tance of leaders for maintaining the viability of minority 
opinions. The literature on brainwashing, for example, docu- 
ments that natural leaders were commonly removed from the 
group before attempts were made to brainwash prisoners of 
war (cf. Schein, 1956). By counteracting the sheer number of 
majority opinions, the strength of leaders stops minority clus- 
ters from decaying. It is worth noting that as a result of social 
influence, individual differences in strength tend to become 
correlated with opinions. This is because the weakest minor- 
ity members are most likely to adopt the majority position, so 
that over time the average strength of the remaining minority 
members will grow at the expense of the majority. This sce- 
nario is consistent with the observation that individuals advo- 
cating minority positions are often more influential than 
those advocating majority positions. 

A second critical control factor is nonlinearity in attitude 
change. Abelson (1979) demonstrated that when individuals 
move incrementally toward the opinions of interaction part- 
ners as a result of social influence, the invariable outcome of 
simulations is uniformity and the complete loss of minority 
clusters. In the model depicted here, however, attitudes are 
assumed to be categorical in nature (Latané & Nowak, 1994). 
This means that individuals hold a fixed position and actively 
resist influence attempts until a critical threshold of influence 
is reached, at which point they switch dramatically from one 
category to another rather than incrementally on a dimension 
of judgment. There is empirical evidence in support of the 
nonlinearity assumption for attitude topics that are personally 
important (cf. Latané & Nowak, 1994). Such attitudes dis- 
play a bimodal distribution, with almost no individuals occu- 
pying the intermediate points on the attitude dimension. This 
suggests, incidentally, that one way to achieve consensus in a 
group is to decrease the subjective importance of the topic in 
question. 

A third critical feature concerns the geometry of the social 
space (Nowak, Latané, & Lewenstein, 1994). People do not 
communicate equally with everyone in a group, nor are their 
interactions random. Specific communication patterns can 
be approximated with different geometries of social space. 
In most of the simulations, social space is portrayed as a 
two-dimensional matrix of n rows and n columns. This geom- 
etry reflects the assumption that interactions typically occur 
in two-dimensional spaces, such as neighborhoods, town 


squares, and rooms. One can envision other geometries, how- 
ever, to capture different communication structures (Nowak 
et al., 1996). A one-dimensional geometry in which people 
interact mainly with neighbors to their left and right corre- 
sponds to a row of houses along a river or a village stretching 
along a road. In this case, strong clustering occurs because of 
well-pronounced local interactions between nearest neigh- 
bors. Polarization, however, is inhibited because members of 
the majority cannot encircle members of the minority and 
overwhelm them. Far more elaborate geometries of social 
space can also be envisioned. In the real world, many differ- 
ent geometries no doubt co-occur and thus determine the 
dynamics of social influence. The availability of telephones, 
e-mail, and common areas for shopping and recreation 
clearly add many dimensions to the effective geometry in 
which interactions occur. The combined effects of such 
geometries play a significant role in determining the form and 
outcome of social influence. 

A fourth critical factor represents the weight an individ- 
ual attaches to his or her own opinion as compared to the 
opinions of others. This variable, referred to as self-influence, 
corresponds to psychological states like self-confidence, 
strength of conviction, and belief certainty. An individual’s 
self-influence is correlated with his or her strength, although 
the absolute value of self-influence varies as a function of 
topic or social setting. When an issue is new or confusing, for 
example, self-influence is correspondingly lower, reflecting 
the fact that no strong opinion has formed and everyone is 
relatively open to external influence. When an issue is famil- 
iar and personally important, however, self-influence attains 
its maximum value for everyone, reflecting the greater im- 
portance of one’s own opinion compared to others’ opinions. 
Because issue familiarity is assumed to be the same for all 
individuals in a given simulation, variation in self-influence 
is a direct reflection of variation among individuals in their 
respective strength. 

The dynamics of social influence are determined by the 
value of self-influence relative to the total influence of other 
individuals. When self-influence is low, individuals may 
switch their opinions several times during the course of sim- 
ulations. This has the effect of destabilizing clusters. For 
topics that are unfamiliar, then, one observes heightened 
dynamics that promote unification based on the majority 
opinion. However, if self-influence is greater than the com- 
bined influence of others, dynamics tend to be dampened 
altogether, unless sources of noise (random external factors) 
are present. Because noise works jointly with social influ- 
ence, noise-induced changes are typically in the same direc- 
tion as majority influence. Introducing a random factor that 
by itself would not favor any position can thus neutralize the 
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effect of self-influence and enhance the effect of majority 
opinion. Very high values of noise, however, can dilute the 
effects of social interaction as well, producing random 
changes in opinion. 


Social Change and Societal Transitions 


This general approach to the modeling of social processes has 
proven useful in generating insight into the dynamics of social 
change, including major societal transformations (Nowak & 
Lewenstein, 1996; Nowak, Lewenstein, & Szamrej, 1993; 
Nowak & Vallacher, 2001). This approach successfully mod- 
els social change when a source of bias is introduced that 
makes the minority opinion more attractive than the majority 
opinion. The results of simulations reveal that rapid social 
change occurs in a manner that is remarkably similar to phase 
transitions in physical phenomena. Expressed metaphori- 
cally, changes enter as bubbles of new within the sea of old, 
and social transitions occur as these bubbles expand and be- 
come connected. Thus, for example, a new political ideology 
or lifestyle fashion that resonates with existing values or in- 
terests is introduced into a social system and is immediately 
embraced by pockets of people in different areas. These pock- 
ets become increasingly connected over time, until at some 
point the new idea achieves widespread dominance over the 
old idea. 

Computer simulations also indicate, however, that the 
bubbles of the old manage to stay entrenched in the sea of the 
new. The strongest and best-supported individuals holding 
the old position, moreover, are the most likely to survive 
pressures associated with the new position. This, in turn, 
means that the old position is likely to display a rebound 
effect when the bias toward the new position disappears or 
is somehow reversed. This scenario provides an explanation 
for the return of leftist governments in Eastern Europe after 
their overwhelming defeat in the elections in the late 1980s. 

This model of societal transition stands in marked contrast 
to the conventional view of social change, which holds that 
individuals gradually switch from an old set of attitudes or 
preferences to a new set of ideas. From that perspective, new 
ideas spread more or less uniformly through a society at a 
constant and relatively slow rate. The simulation model 
allows for this mode of social change as long as the social 
system is near a relatively stable equilibrium and noise is not 
a significant factor in dictating the system’s dynamics 
(Nowak et al., 1993). The incremental scenario, in other 
words, may effectively characterize how change occurs in a 
stable society (e.g., a gradual shift from liberalism to conser- 
vatism or vice versa), but it does not capture the nature of 
change defining periods of rapid social transition. 
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Two sources of data provide empirical support for this 
perspective on social transition: the development of the pri- 
vate sector of the Polish economy and the emergence of vot- 
ing preferences in the Polish parliamentary elections during 
the transition from socialism to private enterprise in the late 
1980s and early 1990s (Nowak, Urbaniak, & Zienkowski, 
1994). For a description of these data, as well as a compre- 
hensive depiction of the cellular automata model and its im- 
plications for societal transition, the reader is referred to 
Nowak and Vallacher (2001). 


Implications for Cultural Differences 


The cellular automata model is useful in understanding and 
predicting differences among cultures in the dynamics of 
social influence and societal organization. A primary theme 
in cross-cultural comparisons centers on collectivism versus 
individualism (cf. Markus & Kitayama, 1991). In so-called 
collectivist cultures—China and Japan, for example— 
interdependence among individuals is stressed at the expense 
of personal independence, so that individuals are readily in- 
fluenced by the beliefs, attitudes, and expectations of other 
people. In so-called individualistic cultures—the United 
States, for example—greater emphasis is placed on inde- 
pendence, with individuals maintaining a relatively strong 
degree of autonomy in their self-concept, attitudes, and 
lifestyle. This dimension of cultural variation maps directly 
onto the variable of self-influence in the cellular automata 
model. In a society that values independence in decision- 
making and judgment, the magnitude of self-influence is cor- 
respondingly strong and operates at the expense of the 
opinions and expectations of others. Computer simulations 
have revealed that as self-influence increases in magnitude, 
the number of individuals changing their opinion on a given 
issue decreases, there is less polarization and clustering, and 
the average cluster is smaller in size (Latané & Nowak, 1997; 
Lewenstein et al., 1993). 

Societies also differ in their relative stability. In less mod- 
ernized societies, which are predominantly rural and agrarian 
rather than industrial in nature, the social context for individ- 
uals is relatively stable over time. In contrast, relatively mod- 
ernized and industrial societies tend to be characterized by 
greater social mobility (e.g., travel, permanent relocation) 
and greater frequency of communication over large distances 
(by means of phone, e-mail, and fax). These features disturb 
the stability of social influence exerted by the social context 
on the individual. At different times, in other words, the indi- 
vidual is exposed to a broad range of opinions that go beyond 
those expressed in the immediate social context. This aspect 
of modernized society can be represented in the model as 


noise, which reflects the sum of influences (e.g., exposure to 
mass media, contact with people from other cultures) not ac- 
counted for by local influence. The greater the magnitude of 
noise in a society, the weaker the relative role played by the 
individual’s local context. The opinions of someone in a dif- 
ferent part of the country, for example, may have a greater 
impact on an individual’s opinions than do the opinions of his 
or her immediate neighbors. This is clearly not the case in a 
stable society, in which everyone is exposed to the same local 
contacts throughout much of his or her life. 

Computer simulations of the model have demonstrated a 
nonlinear relationship between noise and the distribution of 
opinions in a society (Latané & Nowak, 1997; Lewenstein 
et al., 1993). Small values of noise tend to destabilize weak 
clusters (e.g., Nowak, Vallacher, Tesser, & Borkowski, 
2000). Because weak clusters tend also to be small, low-level 
noise has the effect of increasing the average size of clusters 
in the society, which is reflected in higher overall clustering 
and polarization. Higher values of noise, however, can desta- 
bilize all minority clusters and thus promote unification of 
opinions in the society. At very high levels of noise, however, 
individuals are likely to adopt opinions that are independent 
of their immediate social context. This not only disrupts clus- 
ters, but it also prevents unification of opinions in the society. 
In effect, everyone switches his or her opinions in a more-or- 
less random fashion. 

In a stable society characterized by low levels of noise, 
then, a stable pattern of relatively small clusters is to be ex- 
pected, whereas in a somewhat less stable society character- 
ized by moderate levels of noise, larger clusters and greater 
opinion polarization is to be expected. With further increases 
in societal instability, one might expect a breakdown in mi- 
nority opinion clusters and a tendency toward societal uni- 
fication in opinion. Finally, in a highly modernized and 
unstable society, one would expect the pattern of opinions to 
be largely independent of the pattern of social ties (e.g., 
neighborhood influence), demonstrating instead the influence 
of other factors, such as selective exposure to the media and 
contact with other cultures. 

Cultures also differ in their respective values and prefer- 
ences regarding everything from clothing to religion. This 
feature is represented in the model as bias. If a new idea res- 
onates well with a culture’s prevailing values and prefer- 
ences, it will take somewhat less social influence for the idea 
to take hold in the society. But if the idea runs counter to cul- 
tural values, it is likely to be resisted even if it is supported by 
considerable influence. Communist ideology was never fully 
embraced in Poland, for example, despite the considerable 
influence exerted by the government, because communist 
values ran counter to strong Polish traditions of independence 


and Catholicism. As noted above, research exploring the 
social change implications of the model has verified that cul- 
tural bias is indeed a significant factor in determining the ex- 
tent to which a new idea or ideology can take hold in a society 
(cf. Nowak & Vallacher, 2001). 

It is interesting to consider cultural differences in terms of 
the specific combinations of self-influence, noise, and bias. 
Two industrialized societies may both have high levels of 
self-influence (i.e., an individualistic orientation), for exam- 
ple, but they may differ considerably in their respective lev- 
els of noise (e.g., selective exposure to mass media) or their 
bias toward various positions (e.g., religious beliefs). Be- 
cause each of these variables plays a unique role in social in- 
fluence, the interaction among them is likely to be decisive in 
shaping the predominant form of social influence characteriz- 
ing a given society. Cultural variation in social influence 
processes, in other words, conceivably can be traced to the 
specific blend of variables in the cellular automata model. 
The investigation of this possibility provides an important 
agenda for future research concerning the relationship be- 
tween micro- and macrolevels of social reality. 


TOWARD COHERENCE IN SOCIAL INFLUENCE 


Social influence is clearly a big topic, a fact that reflects its 
centrality to the field of social psychology. The enormous 
range of ideas and principles associated with this topic, how- 
ever, is a mixed blessing. On the one hand, the diversity of 
social influence phenomena and processes attests to the un- 
deniable complexity of human social experience. But on 
the negative side of the ledger, this very diversity can prove 
vexing for those—laypeople and theorists alike—who seek 
integration and synthesis in their understanding. Several hun- 
dred studies and dozens of distinct mechanisms may well be 
necessary to capture the nuances of such a wide-ranging 
topic, but this state of affairs does little to inspire a feeling of 
coherent understanding. Like the field of social psychology 
as a whole (cf. Vallacher & Nowak, 1994), the subfield of 
social influence is highly fragmented, with poorly defined 
connections among the separate elements that define it. 
Ironically, if there is a basis for theoretical coherence in 
social influence, it may reflect what psychologists have 
learned about the dynamics of coherence in recent years. 
Despite the enormous complexity of human minds and so- 
cial groups—or perhaps because of such complexity— 
psychological systems at different levels of personal and 
social reality display self-organization and the emergence of 
higher-level properties. The mutual influences among the 
elements in each system promote such emergence, and the 
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resultant properties in turn provide functional integration 
and coordination for the component elements. This reciprocal 
feedback between lower-level elements and higher-level 
properties may constitute an invariant principle common to 
all social psychological processes—or to all complex sys- 
tems, for that matter (cf. Nowak & Vallacher, 1998a). Thus, 
the specific cognitive elements defining the stream of thought 
become self-organized with respect to higher-order judg- 
ments and values (Vallacher, Nowak, & Kaufman, 1994), 
specific movements and perceptions become coordinated to 
produce meaningful action (cf. Vallacher et al., 1998), indi- 
viduals become integrated into higher-order functional units 
such as dyads and social groups (e.g., Nowak et al., 2002), 
and social groups become coordinated with respect to larger 
goals and values that define the social system in which they 
are embedded (cf. Nowak & Vallacher, 2001). 

With this in mind, it is tempting to consider whether a 
press for higher-order coherence provides a common denom- 
inator for the otherwise dizzying array of specific social in- 
fluence processes. Perhaps seemingly distinct means of 
influencing people prove effective or ineffective depending 
on how well each taps into established rules regarding coher- 
ence in thought and action. If so, many of the phenomena dis- 
cussed in this chapter could be reframed so as to underscore 
their common features, and new predictions could be gener- 
ated about the factors that determine whether a given influ- 
ence strategy will prove successful in a particular context for 
a particular target. The central idea is that influence involves 
resynchronization of the elements in the target’s relevant cog- 
nitive structure. Achieving resynchronization is difficult, 
however, when the cognitive structure in question is well- 
integrated and stable. To promote a change in behavior in 
this case, it is necessary to disassemble or otherwise destabi- 
lize the associated cognitive structure. After the structure is 
destabilized, the person is primed for resynchronization in 
line with cues to higher-order meaning provided by the influ- 
ence agent. 

A basic strategy for resynchronizing people’s thoughts and 
desires follows from the emergence process of action identifi- 
cation theory (cf. Vallacher & Wegner, 1987; Vallacher et al., 
1998). Research on this process has revealed that when peo- 
ple do not have an integrated representation of what they are 
doing, they become highly sensitive to coherent perspectives 
on their behavior provided by others. The extrapolation of 
this process to social influence is straightforward. In this sce- 
nario, the influence agent first induces the target to consider 
the relevant topic or action in concrete, low-level terms. Get- 
ting the target to engage in topic-relevant behavior has this 
effect, provided the behavior is sufficiently novel or complex 
that it requires attention to detail. Simply describing an action 
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in terms of its details can also induce low-level identification, 
as can presenting the target with a surplus of concrete infor- 
mation regarding the attitude object. From this disassembled 
state, the target experiences a heightened press for integra- 
tion. Left to his or her own devices, the target might emerge 
with a higher-level frame for the action or topic that reflects 
past positions or perhaps one that reflects a new integration 
altogether (Vallacher & Nowak, 1997; Vallacher et al., 1998). 
If, however, the influence agent offers a message that pro- 
vides the missing integration before the target has demon- 
strated emergence on his or her own, the target is likely to 
embrace this message as an avenue of emergent understand- 
ing, even if it conflicts with his or her prior conception. 

This general approach to influence is effective in changing 
people’s understanding of their own behavior, but with few 
exceptions (e.g., Davis & Knowles, 1999; Vallacher & Selz, 
1991) this approach has not been extended to other domains 
of influence. Nonetheless, a wide variety of established influ- 
ence strategies can be reframed as the disassembly of a 
coherent state into its lower-level elements, setting the stage 
for a reconfiguration of the elements in line with the influence 
agent’s agenda. Thus, any strategy that involves inducing the 
target to engage in acts that are at least somewhat novel or 
time-consuming can create the necessary precondition for 
guided emergence, as can providing the target with ambigu- 
ous or conflicting information that is open to different higher- 
level interpretations. Placing the target in a situation that lacks 
a priori structure and coherence can similarly make him or her 
vulnerable to emergent norms for how to act. Certain dimen- 
sions of individual difference are also associated with vulner- 
ability to social influence, and these too can be considered in 
light of the emergence scenario. Self-uncertainty (e.g., Swann 
& Ely, 1984; Vallacher, 1980), low levels of personal agency 
(Vallacher & Wegner, 1989), field dependence (Witkin, Dyk, 
Faterson, Goodenough, & Karp, 1962), low cognitive differ- 
entiation (Bieri, Atkins, Briar, Leaman, Miller, & Tripodi, 
1966), and external locus of control (Rotter, 1966) are clearly 
distinct constructs, but each can be seen as a manifestation of 
weak cognitive structure concerning a relevant domain of 
judgment and self regulation (i.e., the self, action, other peo- 
ple, society). Lacking internal coherence, a person character- 
ized in this fashion utilizes information provided by others as 
a frame around which he or she can achieve a sense of per- 
sonal integration. 

The failure of influence strategies, meanwhile, may reflect 
a corresponding failure to disrupt or otherwise disassemble 
the target’s prevailing understanding of the action or topic at 
issue. Thus, resistance to influence (e.g., psychological reac- 
tance) may be enhanced when the target’s prevailing per- 
spective is not sufficiently deconstructed for him or her to 


embrace the influence agent’s alternative perspective. In 
essence, the emergence scenario suggests that all manner of 
influence, from compliance with requests to brainwashing, 
are built on a shared platform emphasizing people’s inherent 
press for coherent understanding. 

We should note, however, that complete integration is 
rarely attained in complex systems. The cellular automata 
model of social influence, for example, commonly produces 
a highly clustered rather than unified social structure, even 
though the underlying dynamics are in service of self- 
organization and coherence (e.g., Nowak et al., 1990, 1998; 
Nowak & Vallacher, 1998b). Differentiation as opposed to 
unification is commonly observed as well in people’s self- 
structure (Nowak et al., 2000), despite a sustained press for 
integration in self-understanding. It is unreasonable, then, to 
expect the voluminous literature on social influence to admit 
to a single higher-order principle. Nor should we expect the 
field to reach a static equilibrium, with an immutable set of 
conclusions concerning the ways in which people influence 
one another. Complex systems are inherently dynamic, con- 
tinually evolving and becoming reconfigured in response to 
new influences from the outside. Because interest in social 
influence shows no sign of letting up, we can expect this 
defining area of social psychology to display repeated 
episodes of disassembly and reconfiguration in the years 
to come. 
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WHY PSYCHOLOGY NEEDS 
ENVIRONMENTAL PSYCHOLOGY 


Introduction 


This review, like the model of psychology we advocate, looks 
to the past, present, and future of environmental psychology. 
The chapter begins with a discussion of the importance of 
the socioenvironmental context for human behavior. Having 
demonstrated that the environment, far from being a silent 
witness to human actions, is an integral part of the plot, the 
chapter continues with an examination of the nature and 
scope of environmental psychology. Both its interdisciplinary 
origins and its applied emphasis have conspired to prevent a 
straightforward and uncontentious definition of environmen- 
tal psychology. We review some of these and suggest how 
recent definitions are beginning to adopt a more inclusive, 
holistic, and transactional perspective on people-environment 
relations. The next section discusses the various spatial scales 
at which environmental psychologists operate—from the 
micro level such as personal space and individual rooms, 
public/ private spaces, and public spaces to the macro level of 
the global environment. This incorporates research on the 
home, the workplace, the visual impact of buildings, the 
negative effects of cities, the restorative role of nature, and 
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environmental attitudes and sustainable behaviors. The third 
section takes three key theoretical perspectives that have 
informed environmental psychology—determinism, interac- 
tionalism, and transactionalism—and uses these as an orga- 
nizing framework to examine various theories used by 
environmental psychologists: arousal theory, environmental 
load, and adaptation level theory within a behaviorist and 
determinist paradigm; control, stress adaptation, behavioral 
elasticity, cognitive mapping, and environmental evaluation 
within an interactionist paradigm; and behavior settings, af- 
fordance theory and theories of place, place identity, and 
place attachment within transactionalism. 

The fourth section looks to the future of environmental 
psychology by challenging the assumptions and limiting 
perspectives of present research. The issues at the forefront 
of the political and environmental agenda at the beginning of 
the twenty-first century—human rights, well-being and qual- 
ity of life, globalization, and sustainability—need to be ad- 
dressed and tackled by environmental psychologists in a way 
that incorporates both cross-cultural and temporal dimen- 
sions. The impact of environmental psychology may be en- 
hanced if researchers work within the larger cultural and 
temporal context that conditions people’s perceptions and 
behaviors within any given environment. This concluding 
section discusses some of the work being undertaken by 
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environmental psychologists seeking to meet this challenge 
and address what some have considered to be an application 
gap within environmental psychology (i.e., the gap between 
the generation of general principles and on-the-ground ad- 
vice of direct use to practitioners). 


The Environment as Context 


One of the shortcomings of so much psychological research 
is that it treats the environment simply as a value-free back- 
drop to human activity and a stage upon which we act out our 
lives. In essence, the environment is regarded as noise. It is 
seen as expedient in psychological investigations and experi- 
ments to remove or reduce as much extraneous noise as 
possible that will affect the purity of our results. This is un- 
derstandable and desirable in many situations, but when it 
comes to understanding human perceptions, attitudes, and 
behaviors in real-world settings, the environment is a critical 
factor that needs to be taken into account. 

A paper presented at a recent environmental psychology 
conference reported on an investigation of children’s class- 
room design preferences. The study was undertaken by means 
of showing the children photographs of different classroom 
layouts. There were three principal methodological flaws that 
illustrate well the issue of the role and importance of environ- 
mental context in psychology. First, the photographs included 
neither adults nor children. In other words, the photographs 
did not illustrate or indicate how the environment was actu- 
ally being used by either children or adults. When the re- 
searcher was asked why children and adults were excluded 
from the photographs, the response was that they would have 
been a distraction. This is another variant of the failing iden- 
tified in the previous paragraph. In this case, people are 
treated as noise and become environmental objects. It is as- 
sumed that if we can get people to rate environments prefer- 
entially without those environments being contaminated with 
people, we will arrive at a purer measure of the impact of the 
environment on human preferences. 

The second flaw with this study was that all the pho- 
tographs were taken at adult height, thereby providing an 
adult perspective on the environment even though children’s 
perceptions and preferences were being sought. Finally, all 
the photographs were taken from an adult point of view (e.g., 
the framing, focus, what was included and excluded) as if the 
environment is visually and symbolically neutral. In other 
words, the researcher thought that taking photographs of the 
classrooms could provide an objective and impartial view of 
the environment. If the photographs had been taken by the 
children from their own perspective, the photographs might 
have come to mean very different things to the children 


and brought about a very different evaluation. The environ- 
ment provides us with opportunities and constraints—sets of 
affordances—that we can choose to draw upon (Gibson, 
1979). Of course, not all children will perceive the same 
affordances in a single environment, nor will similar environ- 
ments generate the same perceptions and evaluations in a sin- 
gle child (Wohlwill & Heft, 1987). 

It is a characteristic feature of environmental psychology 
that in any environmental transaction attention should focus 
on the user of the environment as much as on the environ- 
ment itself. For example, as it is not possible to understand 
the architecture and spatial layout of a church, mosque, or 
synagogue without reference to the liturgical precepts that in- 
fluenced their design, so it is no less possible to understand 
any landscape without reference to the different social, eco- 
nomic, and political systems and ideologies that inform them. 

One might well imagine, for example, a school landscape 
that looks extremely tidy, well kempt, with clear demarcation 
of spaces, producing a controlled and undifferentiated envi- 
ronment with easy surveillance, and with learning and other 
activities taking place in predetermined spaces. Such a de- 
signed environment reflects a traditional view of the passive, 
empty learner waiting for educational input. If one now 
imagines a school landscape that appears on the surface to be 
more haphazard and not so well ordered, unkempt with long 
grass, soft or even no edges between activities, less easy sur- 
veillance, and no obvious places for learning specific curricu- 
lum subjects, then this would seem to be antithetic to learning 
and education. However, if one switches to another model of 
the child—the child as a stimulus-seeking learner—then the 
sterile, formal, and rigid landscape just described would 
seem like an inappropriate place for learning. On the other 
hand, providing an unstructured, environmentally diverse set 
of landscapes would seem to be an ideal place for learning, 
encouraging children to seek out the stimulation that they 
need for learning and development. Reading the environment 
in terms of the assumptions it makes about the user is in- 
structive. Understanding and designing the environment for 
human activity can be achieved only when both the environ- 
ment and the user are considered together as one transaction. 

The environmental setting is not a neutral and value-free 
space; it is culture bound. It is constantly conveying mean- 
ings and messages and is an essential part of human func- 
tioning and an integral part of human action. As Getzels 
(1975) writes, “Our vision of human nature finds expression 
in the buildings we construct, and these constructions in turn 
do their silent yet irresistible work of telling us who we are 
and what we must do” (p. 12). The environment embodies the 
social and cultural values of those who inhabit it. Some psy- 
chologists argue that we need only focus on people because 


even though the environment contains the manifest evidence 
of the values and meanings held by people, these values and 
meanings can be investigated at source (i.e., in the people 
themselves). As we know that attitudes are not always good 
predictors of behavior, so we might also assume that what 
people say about the environment and their actions within it 
may actually be contradicted by extant evidence from the en- 
vironment itself. Furthermore, the environment is not just a 
figment of our imaginations or a social construct; it is real. If 
we take a determinist or even an interactionist position, we 
would acknowledge that the environment can have a direct 
effect on human actions. Within transactionalism the envi- 
ronment has a physical manifestation in order to confer 
meaning in the first place. The environment embodies the 
psychologies of those who live in it. It is used to confer 
meaning, to promote identity, and to locate the person so- 
cially, culturally, and economically. 

The role of environmental context in influencing social be- 
havior can be exemplified by reference to interpersonal rela- 
tions as well as institution-person relations. Helping behavior 
is a good example of the influence of environmental context 
on the interpersonal behavior. The conclusions of numerous 
research studies undertaken since the 1970s (Korte, 1980; 
Korte & Kerr 1975; Krupat, 1985; Merrens, 1973) consis- 
tently demonstrate that the conditions of urban life reduce the 
attention given to others and diminish our willingness to help 
others. Aggressive reactions to a phone box that is out of order 
are more common in large cities than in small towns (Moser, 
1984). Those findings have been explained by the levels of 
population densities such as we encounter in large urban areas 
that engender individualism and an indifference toward oth- 
ers, a malaise noted in 1903 by Simmel (1903/1957), who 
suggested that city life is characterized by social withdrawal, 
egoistic behaviors, detachment, and disinterest toward others. 
The reduction of attention to others can be observed also 
when the individual is exposed to more isolated supplemen- 
tary stressful condition (Moser, 1992). Thus, excessive popu- 
lation density or the noise of a pneumatic drill significantly 
reduces the frequency of different helping behaviors (Moser, 
1988). If, generally speaking, politeness (as measured by 
holding the door for someone at the entry of a large depart- 
ment store) is less frequent in Paris than in a small provincial 
town, this would suggest that population density and its im- 
mediate impact on the throughput of shoppers will affect help- 
ing and politeness behavior (Moser & Corroyer, 2001). 

A good example of the effect of environmental context on 
human attitudes and behaviors in an institution-person setting 
can be found in Rosengren and DeVault’s (1970) study of the 
ecology of time and space in an obstetric hospital. They found 
that both the attitudes and behaviors of all the protagonists 
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involved in the process of delivering a baby—the mother, 
nurses, doctors—were a function not only of where they were 
situated but also of when they were situated there. Authority 
(i.e., who managed the mother’s labor and delivery) was not 
so much a function of a formal position in the hierarchy but of 
where each person was at a particular time and who con- 
trolled that space. This time-space interaction had an impact 
not only on staff-patient relations but also on perceptions of 
the appropriateness of medical procedures as they related to 
the management of pain. 

The environmental context in which perceptions occur, 
attitudes are formed, and behavior takes place also has a 
temporal dimension. We cannot understand space and place 
without taking into account time. We encounter events not 
only in the present but also in the past and in the future. We 
experience places now, in the present, as well as places that 
have had a past that impinges on and colors our interpretation 
of the present. Furthermore, these same places have a future 
that, for example, through anticipatory representations may 
guide our actions (Doise, 1976). 


The Nature and Scope of Environmental Psychology 


Environmental psychology studies individuals and groups 
in their physical and social contexts by giving a prominent 
place to environmental perceptions, attitudes, evaluations and 
representations, and accompanying behavior. Environmental 
psychology focuses both on the effects of environmental con- 
ditions on behavior and on how the individual perceives and 
acts on the environment. The point of departure of analysis is 
often the physical characteristics of the environment (e.g., 
noise, pollution, planning and layout of physical space) acting 
directly on the individual or mediated by social variables in 
the environment (e.g., crowding, population heterogeneity). 
But physical and social factors are inextricably linked in their 
effects on individuals’ perceptions and behaviors (Altman & 
Rogoff, 1987). To achieve this effectively, research in envi- 
ronmental psychology aims to identify processes that regulate 
and mediate this relationship. Environmental psychologists 
work in collaboration with other psychologists such as social, 
cognitive, and occupational psychologists, as well as other 
disciplines and professions such as architects, educationalists, 
environmental scientists, engineers, and landscape architects 
and planners. 

Environmental psychology’s unit of analysis is the 
individual-environment relation. One can study this relation 
only by examining cognitions and behaviors that occur in 
real-world situations. For this reason, environmental psy- 
chology operates according to an inductive logic: Theories 
are generated from what can be observed and from data 
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unearthed in research in the real world. Kurt Lewin’s advo- 
cacy of theory-driven practical research ought to have a reso- 
nance with environmental psychologists. 

The conceptual model by which our perceptions, represen- 
tations, and behaviors are interdependent with the physical 
and social environment has frequently been mentioned in 
psychology. In their work on perception, Brunswik (1959) 
and Gibson (1950) referred to the role of the environment; 
Tolman (1948) used the concept of the mental map to de- 
scribe the cognitive mechanisms that accompany maze learn- 
ing; and in the domain of the psychology of form Lewin 
(1951) elaborated the theory of the environmental field, con- 
ceived as a series of forces that operate on the individual. 
Lynch’s study of The Image of the City (1960), although by an 
urban planner, was another major landmark in the early years 
of environment-behavior research. The first milestones of en- 
vironmental psychology date from the late 1960s (Barker, 
1968; Craik, 1970; Lee, 1968; Proshansky, Ittelson, & Rivlin, 
1970). The intellectual and international origins of envi- 
ronmental psychology are considerably broader than many, 
typically North American, textbooks suggest (Bonnes & 
Secchiaroli, 1995). 

Although environmental psychology can justly claim to 
be a subdiscipline in its own right, it clearly has an affinity 
with other branches of psychology, especially social psychol- 
ogy, but also cognitive, organizational, and developmental 
psychology. Examples of where environmental psychology 
has been informed by and contributed to social psychology 
are intergroup relations, group functioning, performance, 
identity, conflict, and bystander behavior. However, social 
psychology often minimizes the role of the environment as a 
physical and social setting and treats it as simply the stage on 
which individuals and groups act rather than as an integral 
part of the plot. Environmental psychology adds an important 
dimension to social psychology by making sense of differ- 
ences in behavior and perception according to contextual 
variables—differences that can be explained only by refer- 
ence to environmental contingencies. 

Although there are strong links to other areas of psychol- 
ogy, environmental psychology is unique among the psycho- 
logical sciences in terms of the relationship it has forged with 
the social (e.g., sociology, human ecology, demography), en- 
vironmental (e.g., environmental sciences, geography), and 
design (e.g., architecture, planning, landscape architecture, 
interior design) disciplines. 

Because of the difficulties of defining environmental psy- 
chology, many writers have sought instead to characterize or 
describe it, as we ourselves did in part earlier. The most re- 
cent of these can be found in the fifth edition of Bell, 
Greene, Fisher, and Baum’s (2001) textbook Environmental 


Psychology. They suggested that (a) environmental psychol- 
ogy studies environment-behavior relationships as a unit, 
rather than separating them into distinct and self-contained 
elements; (b) environment-behavior relationships are really 
interrelationships; (c) there is unlikely to be a sharp distinc- 
tion between applied and basic research; (d) it is part of an 
international and interdisciplinary field of study; and (e) it 
employs an eclectic range of methodologies. But description 
is not a substitute for definition. Leaving aside Proshansky 
et al.’s (1970, p. 5) oft-quoted “environmental psychology is 
what environmental psychologists do,” the same authors 
suggested that “in the long run, the only really satisfactory 
way ...1s in terms of theory. And the simple fact is that as 
yet there is no adequate theory, or even the beginnings of a 
theory, of environmental psychology on which such a defin- 
ition might be based” (p. 5). By 1978, Bell, Fisher, and 
Loomis, in the first edition of Environmental Psychology, 
cautiously suggested that it is “the study of the interrela- 
tionship between behavior and the built and natural envi- 
ronment,” although they preferred to opt for the initial 
Proshansky et al. conclusion. Other, not dissimilar, defini- 
tions followed: “an area of psychology whose focus of in- 
vestigation is the interrelationship between the physical 
environment and human behavior and experience” (Holahan, 
1982, p. 3); “is concerned with the interactions and relation- 
ships between people and their environment” (Proshansky, 
1990); “the discipline that is concerned with the interactions 
and relationships between people and their environments” 
(McAndrew, 1993, p. 2). 

The problem with some of these definitions is that 
although they describe what environmental psychologists do, 
unfortunately they also hint at what other disciplines do as 
well. For example, many (human) geographers could proba- 
bly live quite comfortably with these definitions. By 1995, 
Veitch and Arkkelin were no less specific and perhaps even 
enigmatic with the introduction of the word “enhancing”: 
“a behavioural science that investigates, with an eye towards 
enhancing, the interrelationships between the physical envi- 
ronment and human behaviour.” 

These are clearly not the only definitions of environmen- 
tal psychology, but they are reasonably representative. The 
definitions have various noteworthy features. First, because 
the area is necessarily interdisciplinary, the core theoretical 
perspectives that should inform our approaches have some- 
times been minimized. Thus Bonnes and Secchiaroli (1995) 
drew attention to the need to define the field as a function of 
the psychological processes studied. Most definitions of en- 
vironmental psychology focus on the relationship between 
the environment and behavior, yet paradoxically most of the 
research in environmental psychology has not been about 


behavior but perceptions of and attitudes toward the environ- 
ment and attitudes toward behavior in the environment. Sec- 
ond, many of the definitions refer to relationships between 
people and the physical or built environment. Proshansky 
acknowledged that this was problematic because it fails to 
recognize the importance of the social environment. The dis- 
tinction between built and natural environments is becoming 
increasing untenable given the mutual dependency and reci- 
procity that exist between them, especially within the context 
of the sustainability debate. Finally, many of the definitions 
talk about the individual interacting with the environment. 
Unfortunately, this ignores or minimizes the social dimen- 
sion of environmental experience and behavior. This is a 
strange omission given the strong influence of social psy- 
chology on the area, although it is perhaps a reflection of the 
individualistic nature of much social psychology. 

Gifford (1997) more usefully offered the following: 
“Environmental psychology is the study of transactions 
between individuals and their physical settings. In these 
transactions, individuals change the environment and their 
behaviour and experiences are changed by the environment. 
Environmental psychology includes research and practice 
aimed at making buildings more humane and improving our 
relationship with the natural environment” (p. 1). This far 
more inclusive definition captures key concepts such as ex- 
perience, change, people-environment interactions and trans- 
actions, and natural versus built environments. As long ago 
as 1987, Stokols (1987) suggested that “the translation of a 
transactional world view into operational strategies for the- 
ory development and research . . . poses an ambitious but 
promising agenda for future work in environmental psychol- 
ogy” (p. 41). The essence of a transactional approach, 
Stokols continued, is “its emphasis on the dynamic interplay 
between people and their everyday environmental settings, or 
‘contexts’ ” (p. 42). 


DOMAINS OF ENVIRONMENTAL PSYCHOLOGY 


Environmental psychology deals with the relationship be- 
tween individuals and their life spaces. That includes not 
only the environment to provide us with all what we need to 
survive but also the spaces in which to appreciate, under- 
stand, and act to fulfill higher needs and aspirations. 

The individual’s cognitions and behaviors gain meaning 
in relation to the environment in which these cognitions 
or behaviors are developed. Consequently, environmental 
psychologists are confronted with the same issues that 
concern all psychologists. The basic domains of environmen- 
tal psychology include (a) environmental perceptions and 
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cognitions, (b) environmental values, attitudes, and assess- 
ment, and (c) behavioral issues. It studies these processes in 
relation to the environmental settings and situations in which 
they occur. For instance, environmental perceptions are not 
typically studied with the aim of identifying general laws 
concerning different aspects of the perceived object. Envi- 
ronmental perception deals with built or natural landscape 
perception with an emphasis on sites treated as entities 
(Ittelson, 1973); the perceiver is considered part of the scene 
and projects onto it his or her aspirations and goals, which 
will have an aesthetic dimension as well as a utilitarian func- 
tion. The question the perceiver asks in appraising a land- 
scape is not just “Do I like the appearance of this landscape?” 
but also “What can this landscape do for me (i.e., what func- 
tion does it serve)?” (Lee, 2001). Likewise, interpersonal 
behavior within an environmental psychology context is 
studied in order that we might better understand how envi- 
ronmental settings influence these relationships (e.g., urban 
constraints on the frequency of relational behavior with 
friends or relatives; Moser, 1992). 

Because of its very focus, environmental psychology has 
been and remains above all a psychology of space to the 
extent that it analyzes individuals’ and communities’ per- 
ceptions, attitudes, and behaviors in explicit relation to the 
physical and social contexts within which people and 
communities exist. Notions of space and place occupy a 
central position. The discipline operates, then, at several 
levels of spatial reference, enabling the investigation of 
people-environment interactions (at the individual, group, or 
societal level) at each level. Reference to the spatial dimen- 
sion makes it possible to take into account different levels 
of analysis: 


1. Private spaces (individual level): personal and private 
space, dwelling, housing, workplace, office 


2. Public/private environments (neighborhood-community 
level): semipublic spaces, blocks of flats, the neighbor- 
hood, parks, green spaces 


3. Public environments (individual-community level, inhabi- 
tants): involving both built spaces (villages, towns, cities) 
as well as the natural environment (the countryside, land- 
scape, etc.) 

4. The global environment (societal level): the environment 
in its totality, both the built and the natural environment, 
natural resources 


Environmental psychology analyzes and characterizes 
people-environment interactions and/or transactions at these 
different environmental levels. These relations can best be 
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understood through perception, needs, opportunities, and 
means of control. 


Private Spaces 


Personal space and privacy are important for individual and 
community well-being and quality of life. Altman (1975, 
p. 18) defined privacy as the “selective control of access to 
the self or one’s group.” Thus, privacy implicates control 
over the immediate environment. It is important for the indi- 
vidual to be able to organize and personalize space. Privacy 
represents a dynamic process of openness and closeness to 
others (Altman & Chemers, 1980). Thus, privacy adjust- 
ments may be established with physical or even psychologi- 
cal barriers wherever individuals seek to isolate or protect 
themselves from the intrusion of others. This may be impor- 
tant in one’s home, but also in the work environment or dur- 
ing leisure activities (e.g., on the beach). Privacy involves not 
only visual but also auditory exclusivity (Sundstrom, Town, 
Rice, Osborn, & Brill, 1994). Steady or transitionally occu- 
pied places produce place attachment and are often accompa- 
nied with ties to personal objects such as furniture, pictures, 
and souvenirs that mark the appropriation (Korosec-Serfaty, 
1976). Appropriation can be defined as a particular affective 
relation to an object. The appropriated object may become 
part of the identity of the individual (Barbey, 1976). The ap- 
propriation of space has essentially a social function in the 
sense that the individual or the group marks control over the 
space (Proshansky, 1976), which in turn produces a feeling 
of security. When appropriation is not shared with others, or 
only with one’s group, control is absolute. 

The use of space in the home or the office environment has 
produced a variety of studies. The intended function of a 
room (e.g., kitchen, dormitory, etc.) implies a specific design 
and determines how the space will be used. There are consid- 
erable individual and cultural differences in the use of space 
in one’s home (Kent, 1991; Newell, 1998; Rapoport, 1969). 

Personal space is defined as the invisible boundary sur- 
rounding each individual into which others may not intrude 
without causing discomfort (Hall, 1966). Personal space reg- 
ulates interactions, and its extension depends on environmen- 
tal variables. Its functions are twofold: protection, in which it 
acts as a buffer against various interpersonal threats, and 
communication purpose, in which it determines which sen- 
sory communication-channel (touch, visual, or verbal) can 
and should be used. Thus, interpersonal distances are cues for 
understanding the specific relationship of two individuals. 
Research has looked at various social determinants of per- 
sonal space such as culture and ethnicity, age and gender 
(e.g., Aiello, 1987; Crawford & Unger, 2000), psychological 


factors (Srivastava & Mandal, 1990), and physical factors 
(Altman & Vinsel, 1977; Evans, Lepore, Shejwal, & Palsane, 
1998; Jain, 1993). 

In contrast to personal space, territoriality is visibly de- 
limited by boundaries and tends to be home or workplace 
centered. It is a demarcated and defended space and invari- 
ably is an expression of identity and attachment to a place 
(Sommer, 1969). Territories are controlled spaces that serve 
to enable the personalization and regularization of intrusion. 
Therefore, territoriality has an essential function in providing 
and promoting security, predictability, order, and stability in 
one’s life. Altman and Chemers (1980) identified three types 
or levels of territory: primary territories (e.g., home or office 
space), where control is permanent and high and personaliza- 
tion is manifest; secondary territories (e.g., the classroom or 
open plan office), where control, ownership, and personaliza- 
tion are temporary; and public territories (e.g., the street, the 
mall), where there is competition for use, intrusion is difficult 
to control, and personalization is largely absent. 


Public/Private Environments 
The Home Environment 


Analyses at this level deal with the immediate environment 
of the individual’s living space. These could be rows of 
houses or apartment blocks, the immediate neighborhood, the 
workplace, or the leisure areas in the immediate surroundings 
of the home (e.g., parks and green areas). These areas are re- 
ferred to as semipublic or semiprivate spaces, which means 
that the control over them is shared within a community. 

A great deal of research in environmental psychology 
concerns the immediate home environment. Concepts like 
attachment to place and sense of community contribute to 
our understanding of how individuals and groups create 
bonds to a specific place. Although the size of the habitable 
space is essential for residential satisfaction, other aspects of 
the living conditions modulate its importance as well. Resi- 
dents enhance the value of their neighborhood through the 
transactional relationships they establish with their place of 
residence. For those who have already acquired basic living 
conditions and who have an income that allows them to 
achieve a good quality of life, the agreeable character of the 
neighborhood has a modulating effect on satisfaction con- 
cerning available space in the dwelling. The affective rela- 
tionship with the dwelling and anchorage in childhood seem 
to play an important role. Giuliani (1991) found that affec- 
tive feelings toward the home were attributable to changing 
conceptions of the self in relation to the home over the life 
span. 


The feeling of being at home is closely connected to a 
feeling of well-being and varies with the extent of the spatial 
representation of the neighborhood. A spatially narrow repre- 
sentation is correlated with a weak affective investment in the 
neighborhood (Fleury-Bahi, 1997, 1998). The degree of sat- 
isfaction felt with three of a neighborhood’s environmental 
attributes (green spaces, aesthetics of the built framework, 
and degree of noise) has an effect on the intensity of the af- 
fectivity developed toward it, as well as feelings of well- 
being. The feeling of being at home in one’s neighborhood is 
linked to the frequency of encounters, the extent of the sphere 
of close relations, the nature of local relationships, and satis- 
faction with them. Low and Altman (1992) argued that the 
origin and development of place attachment is varied and 
complex, being influenced by biological, environmental, 
psychological, and sociocultural processes. Furthermore, the 
social relations that a place signifies may be more important 
to feelings of attachment than the place itself. 

Besides the home and neighborhood environments, other 
domains involve a problematic congruence between people 
and their environment (e.g., work, classroom, and institu- 
tional environments such as hospitals, prisons, and homes for 
children or the elderly). How can these environments be de- 
signed to meet the needs of their occupants? We illustrate this 
by examining one setting—the workplace. 


Environmental Psychology in the Workplace 


Increasing attention is being paid to the design of the work- 
place so that it matches more effectively the organization’s 
goals and cultural aspirations as well as employee needs and 
job demands and performance. There has been a long history 
of research into the workplace (Becker, 1981; Becker and 
Steele, 1995; Sundstrom, 1987; Wineman, 1986). Indeed, the 
famous Hawthorne effect first noted in the 1920s emerged 
from a study of the effect of illumination on productivity. 
Since then there have been many studies examining the am- 
bient work environment and investigating the impact of 
sound, light, furniture layout, and design on performance and 
job satisfaction. It is now recognized that the environment, 
space, and design can operate at a subtler level and have an 
impact on issues such as status, reward, and the promotion of 
corporate culture. 

Decisions about space use and design should be examined 
for their embedded assumptions regarding how they will en- 
hance or detract from the organization’s goals and values. In 
other words, whose assumptions underlie the design and 
management of space, and what are the implications of 
space-planning decisions? The relationship between the or- 
ganization’s culture, the physical planning of the buildings 
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or offices, and the feel, look, and use of the facilities becomes 
most apparent especially when there is a mismatch. A mis- 
match often occurs when a new building is planned according 
to criteria such as these: How many people should it accom- 
modate? How many square feet should it occupy? How much 
equipment should it have? How should it look to visitors? 
Questions typically posed and addressed by environmental 
psychologists have a different emphasis: Will the designs and 
space layout enhance or detract from the desired corporate 
work styles? Is the organization prepared to accept that em- 
ployees have different working styles and that these should 
be catered to in the provision of space and facilities? How 
much control does the organization currently exert over its 
employees’ use of time and space? To what extent are em- 
ployees permitted to modify their own environment so that it 
enables them to do their job more effectively? In what way, 
for whom, and how does the management and design permit, 
encourage, or enhance the following: personal and group 
recognition, environmental control (heating, lighting, venti- 
lation, amount and type of furniture, personalized space), so- 
cial integration and identity, communication within the 
working group, communication with other working groups, 
and appropriate levels of privacy? How are issues such as in- 
dividual and group identity; individual capacities, needs, and 
preferences; and working patterns reflected in space planning 
and the allocation of environmental resources? Is space and 
resource allocation used as a means of reflecting and re- 
warding status and marking distinctions between job clas- 
sifications? Is the organization prepared to redefine its 
understanding of equity and provide space and facilities on 
the basis of need rather than status? 

There are many ways of looking at the relationship be- 
tween corporate culture and physical facilities. The effective 
use of the organization’s resources lies not in fitting the staff 
to the workplace but in recognizing that there will be a trans- 
action between staff and workplace so that if the employee 
cannot or will not be forced into the setting, they will either 
attempt to modify the setting so that it does approximate more 
closely their working needs and preferences or become dis- 
satisfied, disaffected, and unproductive. For example, instead 
of assigning an employee just one space, consideration 
should be given to permitting if not encouraging. Instead of 
working in just one place (e.g., a desk), some companies are 
giving employees access to a number of spaces (e.g., hot 
desking) that will allow them to undertake their tasks and 
with more satisfaction and effectiveness. Within such an 
arrangement staff cannot claim territorial rights over specific 
spaces but are regarded as temporary lodgers for as long as 
they need that space: informal privacy spaces for talking to 
clients and colleagues; quiet, comfortable spaces for writing 
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reports; workstations for undertaking word processing and 
data analysis; meeting rooms for discussing issues with col- 
leagues; small refreshment areas for informal socializing; and 
quiet, private telephone suites for confidential matters. There 
are various possibilities—the type of spaces will depend on 
the type of work and how it can be undertaken effectively. 


Public Environments 


Cities are a human creation. They concentrate novelty, inten- 
sity, and choice more so than do smaller towns and villages. 
They provide a variety of cultural, recreational, and educa- 
tional facilities. Equally, it is argued that cities have become 
more dangerous because they concentrate all sorts of crime 
and delinquency and are noisy, overcrowded, and polluted. 
Three topics addressed at this environmental level are dis- 
cussed here: the negative effects of cities, the visual impact of 
buildings, and the restorative role of nature. 


The Negative Effects of Cities 


Living in metropolitan areas is considered to be stressful. The 
analysis of behavior in cities has concentrated on noise, den- 
sity, living conditions (difficulty of access to services), high 
crime, and delinquency rates. A series of conceptual consid- 
erations have been proposed to understand the consequences 
of these stressors for typical urban behavior, such as paying 
less attention to others and being less affiliative and less help- 
ful. Environmental overload, environmental stress, and be- 
havioral constraint all point to the potentially negative effects 
of living in cities as compared with living in small towns. En- 
vironmental conditions like noise and crowding not only af- 
fect general urban conditions but also have a specific effect 
on behavior. A comparison of behavior at the same site but 
under different environmental conditions (noisy-quiet, high- 
low density) shows a more marked negative effect in the 
case of high noise and high density (Moser, 1992). Higher 
crime and delinquency rates are commonly explained by the 
numerous opportunities that the city offers, along with dein- 
dividuation (Zimbardo, 1969). The probability of being rec- 
ognized is lower, and the criminal can escape without being 
identified. Fear of crime (which is not necessarily correlated 
with objective crime rates) restricts people’s behavior by 
making them feel vulnerable. It is exacerbated by an environ- 
ment that appears to be uncared for (e.g., through littering 
and vandalism). 

Whereas the effect of air pollution on health (e.g., respira- 
tory problems for children and the elderly) is well documented 
(Godlee & Walker, 1992; Lewis, Baddeley, Bonham, & 
Lovett, 1970), it has little direct effect on the behavior of urban 


residents. The relationship between exposure to air pollution 
and health is mediated by perceptions of the exposure (Elliot, 
Cole, Krueger, Voorberg, & Wakefield, 1999). The extent to 
which people feel that they can control the source of air pollu- 
tion, for instance, influences their response to this pollution. 
Perceptions of air pollution are also important because they 
influence people’s responses to certain strategies for air pollu- 
tion management. Whether people perceive air pollution as a 
problem is of course related to the actual existence of the prob- 
lem. Generally, people are more likely to perceive environ- 
mental problems when they can hear (noise), see (smoke), 
smell, or feel them. Another important source of information 
is the media because the media’s interpretation of pollution 
levels may have a social amplification effect and influence 
public perceptions and attitudes (Kasperson et al., 1988). Peo- 
ple believe that heavy-goods vehicles, commuters, and busi- 
ness traffic are the principal sources of urban air pollution. On 
the other hand, school traffic is often seen as one of the most im- 
portant causes of transport problems. It is often argued that 
reducing school trips by car would make a significant differ- 
ence to urban transportation problems. Paradoxically, al- 
though considered to be a major source of congestion, school 
traffic is not seen as a major source of pollution (Gatersleben & 
Uzzell, 2000). 


The Visual Impact of Buildings 


Most of us live in cities. The architecture that surrounds us is 
more than public sculpture. Research on the visual impact of 
buildings demonstrates perhaps more than any other area that 
different user groups perceive and evaluate the environment 
dissimilarly. The criteria used most widely by the public to as- 
sess the visual impact of a building is how contextually com- 
patible it appears to be with the surrounding environment 
(Uzzell, 2000b). Architects and their clients, however, tend to 
value more highly the distinctiveness and contrast of build- 
ings. Although there is a place for both, the indication is that 
there are diverging points of view on what constitutes a desir- 
able building between groups of people (Hubbard, 1994, 
1996). Groat (1994) found differences of opinion to be great- 
est between the public and architects and most similar be- 
tween the public and planners. Several studies (e.g., Purcell & 
Nasar, 1992; Nasar, 1993) have demonstrated that architects 
and educated laypeople differ in their preferences for build- 
ing styles and in the meanings that they infer from various 
styles. For example, Devlin and Nasar (1989) found that 
architects rated more unusual and distinctive residential ar- 
chitecture as more meaningful, clear, coherent, pleasant, and 
relaxing, whereas nonarchitects judged more conventional 
and popular residential architecture as such. Similarly, Nasar 


(1993) found that not only did architects differ from the pub- 
lic in their preferences and in the meanings that they inferred 
from different styles, but they also misjudged the preferences 
of the public. 

Individual design features such as color, texture, illumina- 
tion, and the shape and placement of windows can have a 
significant impact on evaluations. Overall, such research 
findings regarding order (including coherence, compatibility, 
congruity, legibility, and clarity) have been reasonably con- 
sistent; increases in order have been found to enhance the 
evaluative quality of cities (Nasar, 1979), downtown street 
scenes (Nasar, 1984), and residential scenes (Nasar, 1981, 
1983). 


The Restorative Role of Nature 


Despite city living, many urban residents desire a private 
house with garden or at least to be able to visit urban parks and 
recreational areas. Urban residents often seek nature, and 
research points consistently to its positive psychological 
function (Staats, Gatersleben, & Hartig, 1997; Staats, Hartig, 
& Kieviets, 2000). Green spaces and the natural environment 
can provide not only an aesthetically pleasing setting but also 
restorative experiences (Kaplan, 1995), including a positive 
effect on health (Ulrich, 1984; E. O. Moore, 1982). Gifford 
(1987) summarized this research and identified the following 
main benefits of nature: cognitive freedom, escape, the expe- 
rience of nature, ecosystem connectedness, growth, chal- 
lenge, guidance, sociability, health, and self-control. What 
seems to be important is the sense of freedom and control felt 
in nature, in contrast to an urban environment, which is per- 
ceived as constraining. 


The Global Environment 


Local agendas are increasingly informed by global perspec- 
tives and processes (Lechner & Boli, 1999). The interaction 
between the local and the global is crucial and is the essence 
of globalization (Bauman, 1998; Beck, 1999). Although en- 
vironmental issues are increasingly seen as international in 
terms of extent, impact, and necessary response, social psy- 
chological studies have traditionally treated many environ- 
mental problems as locally centered and limited to a single 
country. Thus they have been decontextualised in that not 
only has the local-global environmental dimension been min- 
imized, but perhaps more significantly the local-global social 
psychological effects have also been minimized. This is well 
illustrated by Bonaiuto, Breakwell, and Cano (1996), who 
examined the role of social identity processes as they mani- 
fest themselves in place (i.e., local) and national identity in 
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the perception and evaluation of beach pollution. It was 
found that subjects who were more attracted to their town or 
their nation tended to perceive their local and national 
beaches as being less polluted. 

Three phenomena—mass media coverage of environmen- 
tal issues, the growth in environmental organizations, and the 
placing of environmental issues on international political 
agendas—have, intentionally or unintentionally, emphasized 
the seriousness of global as opposed to local or even national 
environmental problems. On the other hand, it has been sug- 
gested that people are only able to relate to environmental 
issues if they are concrete, immediate, and local. Conse- 
quently, it might be hypothesized that people will consider 
environmental problems to be more serious at a local rather 
than global level. If this is the case, then what is the effect of 
the public’s perceptions of the seriousness of environmental 
problems on their sense of responsibility for taking action? In 
a series of cross-cultural studies undertaken in Australia, 
Ireland, Slovakia, and the United Kingdom, members of the 
public and environmental groups, environmental science stu- 
dents, and children were asked about the seriousness of vari- 
ous environmental problems in terms of their impact on the 
individual, the local area, the country, the continent, and the 
world (Uzzell, 2000b). It was consistently found that respon- 
dents were able to conceptualize problems at a global level, 
and an inverse distance effect was found such that environ- 
mental problems were perceived to be more serious the far- 
ther away they are from the perceiver. This phenomenon 
repeatedly occurred in each country for all groups. An in- 
verse relationship was also found between a sense of respon- 
sibility for environmental problems and spatial scale 
resulting in feelings of powerlessness at the global level. 

We are increasingly conscious of the effect of global envi- 
ronmental processes on local climate. The effects of extreme 
weather conditions—wind, heat or extreme cold—as, for ex- 
ample, investigated by Suedfeld and others in Antarctic survey 
stations, have demonstrated various impacts on individuals 
(Suedfeld, 1998; Weiss, Suedfeld, Steel, & Tanaka, 2000). The 
effect of seasonal daylight availability on mood has been de- 
scribed as seasonal affective disorder (Rosenthal et al., 1984). 
Likewise, sunlight has been found to enhance positive mood 
(Cunningham, 1979). 

The most significant topic analyzed at the level of global 
environment is without doubt individuals’ attitudes toward 
and support of sustainable development. A major challenge 
for environmental psychology is to enable the understanding 
and development of strategies to encourage environmentally 
friendly behavior. There is consistent field research in 
environmental psychology about the ways to encourage 
environmentally responsible behavior concerning resources 
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conservation (e.g., energy and water), littering, and recycling. 
Environmental education, commitment, modeling, feedback, 
rewards, and disincentives are on the whole effective only if 
such behavior is reinforced and if opportunities are provided 
that encourage environmentally friendly behavior. 

Growing ecological concern in our societies is attributed to 
a series of beliefs and attitudes favorable to the environment 
originally conceptualized by Dunlap (1980) and Dunlap and 
Van Liere (1984) as the new environmental paradigm and 
now superseded by the New Ecological Paradigm Scale (Dun- 
lap, Van Liere, Mertig, & Jones, 2000). But it is clear from re- 
search that proenvironmental attitudes do not necessarily lead 
to proenvironmental behaviors. Environmental problems can 
often be conceptualized as commons dilemma problems (Van 
Lange, Van Vugt, Meertens, & Ruiter, 1998; Vlek, Hendrickx, 
& Steg, 1993). In psychology this is referred to as a social 
dilemma. The defining characteristics of such dilemmas are 
that (a) each participant receives more benefits and less costs 
for a self-interest choice (e.g., going by car) than for a public 
interest choice (e.g., cycling) and (b) all participants, as a 
group, would benefit more if they all choose to act in the pub- 
lic interest (e.g., cycling) than if they all choose to act in self- 
interest (e.g., going by car; Gatersleben & Uzzell, in press). 
The social dilemma paradigm can explain why many people 
prefer to travel by car even though they are aware of the envi- 
ronmental costs of car use and believe that more sustainable 
transport options are necessary. It is in the self-interest of 
every individual to use cars. Nevertheless, it is in the common 
interest to use other modes of transport. However, single indi- 
viduals do not cause the problems of car use; nor can they 
solve them. They are typically collective problems. People 
therefore feel neither personally responsible for the problems 
nor in control of the solutions. 


THEORETICAL PERSPECTIVES 
ON KEY QUESTIONS IN 
ENVIRONMENTAL PSYCHOLOGY 


It was suggested at the beginning of this chapter that the 
context—the environment—in which people act out their 
lives is a critical factor in understanding human perceptions, 
attitudes, and behavior. Psychologists have largely ignored 
this context, assuming that most explanations for behavior 
are largely person centered rather than person-in-environ- 
ment centered. Because environmental psychologists are in a 
position to understand person-in-environment questions, the 
history of environmental psychology has been strongly 
influenced by the need to answer questions posed by the 


practical concerns of architects, planners, and other profes- 
sions responsible for the planning, design, and management 
of the environment (Uzzell, 2000a). These questions include 
the following: How does the environment stimulate behavior, 
and what happens with excessive stimulation? How does the 
environment constrain and cause stress? How do we form 
maps of the environment in our heads and use them to navi- 
gate through the environment? What factors are important in 
people’s evaluations of the built and natural environment, 
and how satisfied are they with different environments and 
environmental conditions? What is the influence of the envi- 
ronment or behavior setting on people? What physical prop- 
erties of the environment facilitate some behaviors and 
discourage others? Do we have a sense of place? What effect 
does this have on our identity? In this section we outline 
some of the approaches that have been taken to answer these 
questions. 

Typically, within environmental psychology these ques- 
tions have been addressed from one of three perspectives. The 
first is a determinist and essentially behaviorist perspective 
that argues that the environment has a direct impact on 
people’s perceptions, attitudes, and behaviors. The second 
approach has been referred to as interactionism: The environ- 
ment has an impact on individuals and groups, who in turn 
respond by having an impact on the environment. The third 
perspective is transactional in that neither the person nor the 
environment has priority and neither one be defined without 
reference to the other. Bonnes and Secchiaroli (1995) sug- 
gested that transactionalism has two primary features: the 
continuous exchange and reciprocity between the individual 
and the environment, and the primarily active and intentional 
role of the individual to the environment. 

It is impossible in a chapter of this length to discuss all the 
theories that have driven environmental psychology research. 
The varying scales at which environmental psychologists 
work, as we have seen, assume different models of man, make 
different assumptions about people-environment and envi- 
ronment-behavior relations, require different methodologies, 
and involve different interpretive frameworks. In this section 
we discuss the three principal approaches that have been em- 
ployed in environmental psychology to account for people’s 
behavioral responses to their environmental settings. 


Determinist and Behaviorist Approaches 


Arousal theory, environmental load, and adaptation level 
provide good illustrations of theories that are essentially 
behaviorist in their assumptions and determinist in their 
environment-behavior orientation. 
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Arousal Theory 


Arousal theory stipulates that the environment provides a cer- 
tain amount of physiological stimulation that, depending on 
the individual’s interpretation and attribution of the causes, 
has particular behavioral effects. Each particular behavior is 
best performed at a definite level of arousal. The relation be- 
tween levels of arousal and optimal performance or behavior 
is curvilinear (Yerkes-Dodson law). Whereas individuals 
seek stimulation when arousal is too low, too-high levels of 
arousal produced by either pleasant or unpleasant stimulation 
or experiences have negative effects on performance and be- 
havior. Anomic behavior in urban environments is attributed 
to high stimulation levels due to environmental conditions 
such as excessive noise or crowding (Cohen & Spacapan, 
1984). On the other hand, understimulation may occur in cer- 
tain environments such as the Arctic that cause unease and 
depression (Suedfeld & Steel, 2000). 


The Environmental Load or Overstimulation Approach 


According to this model people have a limited capacity to 
process incoming stimuli, and overload occurs when the in- 
coming stimuli exceed the individual’s capacity to process 
them. Individuals deal with an overloaded situation by concen- 
trating their attention on the most important aspects of a task or 
by focusing on a fixed goal, ignoring peripheral stimulation in 
order to avoid distraction. Paying attention to a particular task 
in an overloaded situation is very demanding and produces 
fatigue (Kaplan & Kaplan, 1989). Typical aftereffects of being 
exposed to an overload situation are, according to the over- 
load model, less tolerance to frustration, less attention, and 
reduced capacity to react in an adaptive way. Milgram (1970) 
attributed the deterioration of social life in cities to the wide va- 
riety of demands on citizens causing a reduced capacity to pay 
attention to others. The overload approach explains why cer- 
tain environmental conditions lead to undesirable behavioral 
consequences such as aggression, lack of helping behavior, 
and selfishness in urban environments. 


Adaptation Level Theory 


Adaptation level theory (Wohlwill, 1974) is in certain ways a 
logical extension of arousal theory and the overload 
approach. It assumes that there is an intermediate level of 
stimulation that is individually optimal. Three categories 
of stimulation can be distinguished: sensory stimulation, 
social stimulation, and movement. These categories can be 
described along three dimensions of stimulation: intensity, 
diversity, and patterning (i.e., the structure and degree of 


uncertainty of the stimulation). In ideal circumstances a stim- 
ulus has to be of average intensity and reasonably diverse, 
and it must be structured with a reasonable degree of uncer- 
tainty. The level of stimulation at which an individual feels 
comfortable depends on his or her past experience, or, more 
precisely, on the environmental conditions under which he or 
she has grown up. This reference level is nevertheless subject 
to adaptation when individuals change their life environ- 
ments. If rural people can be very unsettled by urban envi- 
ronments, they may also adapt to this new situation after a 
certain period of residence. Adaptation level theory postu- 
lates an active and dynamic relation of the individual with his 
or her environment. 


Interactionist Approaches 


Analyses of the individual’s exposure to environmental stres- 
sors in terms of control and of behavioral elasticity, on one 
hand, and environmental cognition (cognitive mapping, envi- 
ronmental evaluations, etc.), on the other hand, refer typi- 
cally to an interactionist rationale of individual-environment 
relations. 


Stress and Control 


Some authors (Proshansky et al., 1970; Stokols, 1978; 
Zlutnick & Altman, 1972) consider certain environmental 
conditions to be constraining to the individual. Similarly, oth- 
ers (Baum, Singer, & Baum, 1981; Evans & Cohen, 1987; 
Lazarus & Folkman, 1984) describe such situations as being 
stressful. Both approaches lead to conditions as being poten- 
tially constraining or stressful and introduce the concept 
of control. Individuals exposed to such situations engage in 
coping processes. Coping is an attempt to reestablish or gain 
control over the situation identified as stressing or constrain- 
ing. According to the psychological stress model, environ- 
mental conditions such as noise, crowding, or daily hassles 
provoke physiological, emotional, and behavioral reac- 
tions identified as stress (Lazarus, 1966). Three types of 
stressors can be distinguished: cataclysmic events (e.g., vol- 
canic eruptions, floods, earthquakes), personal life events 
(e.g., illness, death, family or work problems), and back- 
ground conditions (e.g., transportation difficulties, access to 
services, noise, crowding). Such conditions are potentially 
stressful according to their nature provided that the individ- 
ual identifies them as such (Cohen, Evans, Stokols, & Krantz, 
1986). 

An environment is constraining when something is limit- 
ing or prevents individuals from achieving their intentions. 
This may occur with environmental conditions or stressors 
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like noise or crowding, but also with specific environmental 
features like fences, barriers, or bad weather. The constraining 
situation is interpreted by the individual as being out of his or 
her control. The feeling of not being able to master the situa- 
tion produces psychological reactance (Brehm, 1966). Un- 
pleasant feelings of being constrained lead the individual to 
attempt to recover his or her freedom of action in controlling 
the situation. Having freedom of action or controlling one’s 
environment seems to be an important aspect of everyday life 
and individuals’ well-being. When people perceive control in 
a noisy situation, their performance is improved (Glass & 
Singer, 1972); they are less aggressive (Donnerstein & 
Wilson, 1976; Moser & Lévy-Leboyer, 1985); and they are 
more often helpful (Sherrod & Dowes, 1974). On the contrary, 
the perception of loss of control produced by a stressful situa- 
tion or constraints has several negative consequences on be- 
havior (Barnes, 1981) as well as on well-being and health. 

Confronted with a potentially stressful condition, the indi- 
vidual appraises the situation. Appraisals involve both as- 
sessing the situation (primary appraisal) and evaluating the 
possibilities of coping with it (secondary appraisal). The 
identification of a situation as being stressful depends on 
cognitive appraisal. Cognitive appraisal of a situation as 
being potentially disturbing or threatening or even harmful 
involves an interaction between the objective characteristics 
of the situation as well as the individual’s interpretation of 
the situation in light of past experience. The secondary 
appraisal leads to considering the situation as challenging 
with reference to a coping strategy. Coping strategies de- 
pend on individual and situational factors. They consist of 
problem-focused, direct action such as fleeing the situation, 
trying to stop, removing or reducing the identified stressor, or 
reacting with a cognitive or emotional focus such as reevalu- 
ating the threatening aspects of the situation. Reaction to a 
stressful situation may lead the individual to concentrate on 
the task, focus on the goals, or ignore or even deny the dis- 
tracting stimuli. Repeated or steady exposure to stressors 
may result in adaptation and therefore weaker reactions to 
this type of situation. If the threatening character of the situa- 
tion exceeds the coping capacities of the individual, this 
may cause fatigue and a sense of helplessness (Garber & 
Seligman, 1981; Seligman, 1975). 


The Stress-Adaptation Model 


In everyday life the individual is exposed to both background 
stressors and occasionally to excessive environmental stimu- 
lation. Consequently, the individual’s behavior can only be 
appreciated when considered in a context perceived and eval- 
uated by the persons themselves and in reference to baseline 


exposure (Moser, 1992). Any exposure to a constraining or 
disagreeable stimulus invokes a neuro-vegetative reaction. 
Confronted with such stimulation, the individual mobilizes 
cognitive strategies and evaluates the aversive situation with 
reference to her or his threshold of individual and situational 
tolerance, as well as the context in which exposure occurs. 
This evaluation creates a stimulation level that is judged 
against a personal norm of exposure. In response the individ- 
ual judges the stimulus as being weak, average and tolerable, 
or strong. Cognitive processes intervene to permit the indi- 
vidual to engage in adaptive behavior to control the situation. 
A situation in which the constraints are too high or in which 
stimulation is excessive produces increased physiological 
arousal, thereby preventing any cognitive intervention and 
therefore also control of the situation. 


Behavioral Elasticity 


This model introduces the temporal dimension of exposure to 
environmental conditions and refers to individual norms of 
exposure (Moser, in press). The influence of stressors is well 
documented, but the findings are rarely analyzed in terms of 
adaptation to long-term or before-after comparisons. Yet one 
can assume that where there are no constraining factors, indi- 
viduals will revert to their own set of norms, which are elab- 
orated through their history of exposure. The principle of 
elasticity provides a good illustration of individual behavior 
in the context of environmental conditions. Using the princi- 
ple of elasticity from solids mechanics to characterize the 
adaptive capacities of individuals exposed to environmental 
constraints, three essential behavioral specificities as a con- 
sequence of changing environmental contingencies can be 
distinguished: (a) a return to an earlier state (a point of refer- 
ence) in which constraints were not present, (b) the ability to 
adapt to a state of constraint as long as the constraint is per- 
manent, and (c) the existence of limits on one’s flexibility. 
The latter becomes manifest through reduced flexibility in 
the face of increased constraints, the existence of a breaking 
point (when the constraints are too great), and the progressive 
reduction of elasticity as a function of both continuous con- 
straints and of aging. 


Returning to an Earlier Baseline. While attention is 
mostly given to attitude change and modifying behavior in 
particular situations, the stability over time of these behaviors 
is rarely analyzed. Yet longitudinal research often shows that 
proenvironmental behavior re-sorts to the initial state before 
the constraints were encountered. This has been shown, for 
instance, in the context of encouraging people to sort their 
domestic waste (Moser & Matheau, in press) or in levels of 
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concern about global environmental issues (Uzzell, 2000b). 
Exposure to constraints creates a disequilibrium, and the in- 
dividual, having a tendency to reincorporate initial behavior, 
reverts to the earlier state of equilibrium. 


Adaptation: The Ability to Put Up With a Constraining 
Situation in so Far as It Is Continuous. Observing behav- 
ior in the urban environment provides evidence of the con- 
straining conditions of the urban context. Residents of large 
cities walk faster in the street and demonstrate greater with- 
drawal than do those living in small towns: They look 
straight ahead, only rarely maintain eye contact with others, 
and respond less frequently to the various requests for help 
from other people. In other words, faced with an overstimu- 
lating urban environment, people use a filtering process by 
which they focus their attention on those requests that they 
evaluate as important, disregarding peripheral stimulation. 
The constant expression of this type of adaptive behavior 
suggests that it has become normative. The walking speed of 
inhabitants of small towns is slower that the walking speed of 
inhabitants in large cities (Bornstein, 1979). So we can assert 
that such behavior provides evidence of the individual’s ca- 
pacity to respond to particular environmentally constraining 
conditions. 


The Extent and Limits of Flexibility. The limits of flexi- 
bility and, more particularly, the breakdown following con- 
straints that are too great are best seen in aggressive behavior. 
The distinction between instrumental and hostile aggression 
(Feshbach, 1964) recalls the distinction between adaptive be- 
havior aimed at effectively confronting a threat and a reactive 
and impulsive behavior ineffectual for adaptation. Three lim- 
its of flexibility can be identified. First is reduced flexibility in 
the face of increased constraints. When exposure to accus- 
tomed constraints is relatively high, there is a lower proba- 
bility of performing an adaptive response, and therefore an 
increase in reactive behaviors. There is decreased flexibility 
in the face of constraint, more so if the constraint is added 
onto already-existing constraints affecting the individual. 
This is most clearly evident in aggressive behaviors (Moser, 
1984). People react more strongly to the same stimulation in 
the urban environment than in small towns. Hostile aggres- 
sion thus becomes more frequent. This results in a decrease in 
adaptive capacities and therefore of flexibility if additional 
constraints are grafted onto those already present. The second 
limit is the existence of a breaking point when the constraints 
are too great: Intervention by cognitive processes is pre- 
vented if stimulation produces a neuro-vegetative reaction 
that is too extreme (Moser, 1992; Zillmann, 1978). This is 
most evident with violent or hostile aggressive behavior. This 


involves nonadaptive reactive behavior that is clearly of a 
different order. As a consequence, breakdown and a limit on 
flexibility result. Contrary to what occurs when there is elas- 
ticity, however, this breakdown fortunately occurs only occa- 
sionally and on an ad hoc basis. The third limit is the 
progressive loss of elasticity as a function of the persistence 
of exposure to constraints: This has been examined under 
laboratory conditions in the form of postexposure effects. 
Outside the laboratory, the constant mobilization of coping 
processes, for example, for those living near airports pro- 
duces fatigue and lowers the capacity to face new stressful 
situations (Altman, 1975). One encounters, in particular, 
greater vulnerability and irritability as well as a significant 
decrease in the ability to resist stressful events. These effects 
demonstrate that there is a decreased tolerance threshold, and 
so a decreased flexibility following prolonged exposure to 
different environmental constraints. 

The elasticity model is an appropriate framework to illus- 
trate the mechanisms and limits of behavioral plasticity. It 
may perhaps stimulate the generation of a model of behav- 
ioral adjustments by placing an emphasis on the temporal di- 
mension and the cognitive processes governing behavior. 
Environmental cognition, cognitive mapping, and environ- 
mental appraisals are likely to fall within an interactionist 
framework. While they can be individualistic, they are in- 
variably set within a social context. Environmental cognition 
would be enriched by more research in terms of social repre- 
sentations (Moscovici, 1989) providing the opportunity to 
emphasize the role of cultural values, aspirations, and needs 
as a frame of reference for environmental behavior. 


Cognitive Mapping 


How do we form maps of the environment in our heads and 
use them to navigate through the environment? Cognition 
and memory of places produce mental images of our envi- 
ronment. The individual has an organized mental representa- 
tion of his or her environment (e.g., neighborhood, district, 
city, specific places), which environmental psychologist call 
cognitive maps. Cities need to be legible so that people can 
“read” and navigate them. The study of cognitive maps has its 
origin in the work of Tolman (1948), who studied the way in 
which rats find their ways in mazes. Lynch (1960), an urban 
planner, introduced the topic and a methodology to study the 
ways in which people perceive the urban environment. Lynch 
established a simple but effective method to collect and ana- 
lyze mental maps. He suggested that people categorize the 
city according to five key elements: paths (e.g., streets, 
lanes), edges (e.g., spatial limits such as rivers and rail 
tracks), districts (e.g., larger spatial areas or neighborhoods 
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that have specific characteristics and are typically named, 
such as Soho), nodes (e.g., intersections, plazas), and land- 
marks (e.g., reference points for the majority of people). 

Furthermore, one can distinguish sequential representa- 
tions Oe, elements that the individual encounters when trav- 
eling from one point of the city to another, rich in paths and 
nodes) and spatial representations emphasizing landmarks 
and districts (Appleyard, 1970). Cognitive maps will vary, 
for example, as a function of familiarity with the city and 
stage in the life cycle. Such maps can be used to characterize 
either an individual’s specific environment interests or pref- 
erences (Milgram & Jodelet, 1976) or the qualities and legi- 
bility of a particular environment (Gärling & Evans, 1991; 
Kitchen, Blades, & Golledge, 1997). Way finding is a com- 
plex process involving a variety of cognitive operations such 
as localization of the target and choosing the route and the 
type of transportation to reach the goal (Gärling, Böök, & 
Lindberg, 1986). Sketch maps often carry typical errors that 
point to the cognitive elaboration of the individual’s environ- 
mental representation: nonexhaustive, spatial distortions (too 
close, too apart), simplification of paths and spaces, and over- 
estimation of the size of familiar places. 


Environmental Evaluations 


What factors are important in people’s evaluation of the built 
and natural environment, and how satisfied are they with dif- 
ferent environments and environmental conditions? Some 
environmental evaluations, called the place-centered method, 
focus on the objective physical properties of the environment 
such as pollution levels or the amount of urban development 
over the previous 10 years. The aim is to measure the quali- 
ties of an environment by experts or by actual or potential 
users. Such evaluations are done without taking into account 
the referential framework of the evaluator (i.e. the values, 
preferences, or significations attached to the place). These 
kinds of appraisals are important, but when it is remembered 
that what may be an environmental problem for one person 
may be of no consequence to another, it is clear that environ- 
mental assessment has an important subjective dimension as 
well. This person-centered method focuses on the feelings, 
subjective appreciation of, and satisfaction with a particular 
environment (Craik & Zube, 1976; Russell & Lanius, 1984). 

Some environmental appraisals take the form of contrast- 
ing social categories such as architects versus the public 
(Groat, 1994; Hubbard, 1994) or scientists versus laypeople 
(Mertz, Slovic, & Purchase, 1998) or of categorizing people 
who hold particular attitudes (e.g., pro- vs. anticonservation; 
Nord, Luloff, & Bridger, 1998). The focus of attention is on 
the role the individual occupies or the attitudes held and the 


consequent effect that this has on environmental attitudes and 
behavior. 

Evaluations can be carried out either in the environment 
that is being evaluated or through simulations. Horswill and 
McKenna (1999) developed a video-based technique for 
measuring drivers’ speed choice, and their technique has the 
advantage of maintaining experimental control and ensuring 
external and ecological validity. They found that speed 
choice during video simulation related highly to real driving 
experiences. Research consistently confirms color pho- 
tographs as a valid measure of on-site response, especially 
for visual issues (Bateson & Hui, 1992; Brown, Daniel, 
Richards, & King, 1988; Nasar & Hong, 1999; Stamps, 
1990). Stamps (1990) conducted a meta-analysis of research 
that had previously used simulated environments to measure 
perceptions of real versus photographed environments (e.g., 
presented as slides, color prints, and black-and-white prints). 
He demonstrated that there is highly significant correlation 
between evaluations of real and simulated (photographed) 
environments. The advent of digital imaging means that it is 
now possible to manipulate photographs so that environ- 
ments can be changed in a systematic and highly convincing 
way in order to assess public preferences and reactions. The 
photographs in Figure 17.1 were manipulated with the inten- 
tion of assessing the impact of different traffic calming mea- 
sures on drivers’ estimates of speed (Uzzell & Leach, 2001). 

The research demonstrated that drivers clearly were able 
to discriminate between the different conditions presented in 
manipulated photographs. When estimated speeds were cor- 
related against actual speeds along the road as it exists at pre- 
sent, this suggested which design solutions would lead to an 
increase or decrease in speeding behavior. 


Transactional Approaches 


Three approaches are discussed here as examples of transac- 
tional approaches in environmental psychology: Barker’s 
behavior setting approach; affordances; and place theory, 
identity, and attachment. 


Barker’s Behavior Settings 


Barker’s behavior settings approach has both a theoretical 
and methodological importance because it provides a frame- 
work for analyzing the logic of behavior in particular set- 
tings. Barker (1968, 1990) considered the environment as a 
place where prescribed patterns of behavior, called programs, 
occur. There is a correspondence between the nature of the 
physical milieu and a determined number and type of collec- 
tive behavior taking place in it. According to ecological 
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Figure 17.1 Digitally manipulated photographs used to assess the impact of alternative traffic-calming 
measures on drivers’ estimates of speed. 


psychology, knowing the setting will provide information 
about the number of programs (i.e., behaviors) in it. Such 
programs are recurrent activities, regularly performed by per- 
sons holding specific roles. A church, for instance, induces 
behaviors like explaining, listening, praying, singing, and so 
on, but each type of activity is performed by persons endors- 
ing specific roles. According to his or her role, the priest is a 
performer and the congregation members are nonperformers. 
This setting also has a layout and particular furniture that fits 
that purpose and fixes the program (i.e., what type of behav- 
ior should happen in it). The so-called behavior setting (i.e., 
the physical place and the behaviors) determines what type of 
behavior is appropriate and therefore can or should occur. 
Patterns of behavior (e.g., worshipping) as well as settings 
(e.g., churches) are nevertheless independent: A religious of- 
fice can be held in the open air, and the church can be used for 
a concert. It is their role-environment structure or synomor- 
phology that create the behavior setting. Barker’s analysis 
supposes an interdependency between collective patterns of 
behavior, the program, and the physical space or milieu in 
which these behaviors take place. Behaviors are supposed to 
be unique in the specific setting and dependent on the setting 
in which they occur. Settings are delimited places such as 
within walls, fences, or symbolic barriers. They can be 


identified and described. Barriers between settings also de- 
limit programs. Knowing about the setting (e.g., its purpose 
or intention) infers the typical behaviors of the people in that 
setting. Barker’s conceptualization permits an understanding 
of environment-behavior relationships such that space might 
be organized in a certain way in order to meet its various pur- 
poses. Behavior settings are dynamic structures that evolve 
over time (Wicker, 1979, 1987). 

Staffing (formerly manning) theory completes Barker’s 
approach by proposing a set of concepts related to the num- 
ber of people that the behavior setting needs in order to be 
functional (Barker, 1960; Wicker & Kirkmeyer, 1976). Be- 
sides key concepts like performers who carry out the primary 
tasks and the nonperformers who observe, the minimum 
number of people needed to maintain the functioning of a be- 
havior setting is called the maintenance minimum, and the 
maximum is called its capacity. Applicants are people seek- 
ing to become part of the behavior setting. Overstaffing or 
understaffing is a consequence of too few or too many appli- 
cants for a behavior setting. The consequence of under- 
staffing is that people have to work harder and must endorse 
a greater range of different roles in order to maintain the 
functioning of the setting. They will also feel more commit- 
ted to the group and endorse more important roles. On the 
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other hand, overstaffing requires the fulfillment of adaptive 
measures to maintain the functioning, such as increasing 
the size of the setting. 

Behavior settings and staffing theory are helpful tools to 
solve environmental design problems and to improve the 
functioning of environments. Barker’s approach has been 
applied successfully to the analyses of work environments, 
schools, and small towns. It helps to document community 
life and enables the evaluation of the structure of organiza- 
tions in terms of efficiency and responsibility. 


Affordance Theory 


Gibson (1979) argued that, contrary to the orthodox view 
held in the design professions, people do not see form and 
shape when perceiving a place. Rather, the environment can 
be seen as offering a set of affordances; that is, the environ- 
ment is assessed in terms of what it can do for us. The design 
professions are typically taught that the building blocks of 
perception comprise shape, color, and form. This stems from 
the view that architecture and landscape architecture are 
often taught as visual arts rather than as ways of providing 
functional space in which people can work, live, and engage 
in recreation. Gibson argues that “the affordances of the en- 
vironment are what it offers the animal, what it provides or 
furnishes either for good or ill” (p. 127). Affordances are eco- 
logical resources from a functional point of view. They are an 
objectively specifiable and psychologically meaningful tax- 
onomy of the environment. The environment offers opportu- 
nities for use and manipulation. How we use the environment 
as children, parents, or senior citizens will vary depending on 
our needs and interests, values, and aspirations. 

This perspective suggests that the degree to which built or 
natural environments are utilized changes as people’s roles, 
relationships, and activities in the environment change. 
Therefore, the environment can be seen to have a develop- 
mental dimension to it. As people develop their cognitive, 
affective, and behavioral capacities, the resources that the 
environment offers change. Furthermore, the environment 
can be designed to facilitate, support, and encourage this. 
Heft (1988) argued that utilizing Gibson’s theory of affor- 
dances allows us to describe environmental features in terms 
of their functional significance for an individual or group. He 
postulated that to arrive at a functional description of an 
environment, one requires three sorts of information: the 
characteristics of the person, the characteristics of the envi- 
ronment, and the behavior of the individual in question. Heft 
(1988) was interested in children’s environment-behavior 
interactions, with the aim of creating a taxonomy that would 
describe the functionally significant properties of children’s 


environments. Based on his analysis of three significant books 
on children’s use of their environment (Barker & Wright, 
1951; Hart, 1979; R. Moore, 1986), Heft created a functional 
taxonomy of children’s outdoor environments in terms of the 
environmental features and activities that they afford the 
child. The 10 environmental features were flat, relatively 
smooth surface; relatively smooth slope; graspable/detached 
object; attached object; nonrigid, attached object; climbable 
feature; aperture; shelter; moldable materials; and water. 

Heft also pointed out that as there is a developmental as- 
pect to the taxonomy, the value of the environment will 
change for the developing child. As children move from pre- 
teenagers through to adolescence, so the affordances of dif- 
ferent types of environments change in response to their need 
for social interaction and privacy (Woolley, Spencer, Dunn, & 
Rowley, 1999). Clark and Uzzell (in press) found that the use 
of the neighborhood for interaction decreased with age 
and that by the time the young people had reached 11 years 
old the number of affordances was significantly lower than 
for those aged 7 years old. There was no decrease in the use 
of the neighborhood for retreat. Therefore, the neighborhood 
retains its importance for retreat behaviors. 

Exemplifying the assertion by Bonaiuto and Bonnes 
(1996) that the experience of small- and large-city living 
is notably different, Kyttä (1995) examined children’s activi- 
ties in the city, in a small town, and in a rural area in Finland. 
Using the affordance approach but including categories on 
social affordances and nature, Kyttä found that the number of 
positive affordances was highest in the rural area and lowest 
in the city. However, when the quality of affordances was an- 
alyzed, there were no differences between the areas for 8 out 
of the 11 affordance categories. The attitudes of parents play 
a significant role in how children perceive affordances. Chil- 
dren with a limited autonomy over their spatial range, due to 
parental restrictions through fears about safety, see little of 
the environment and therefore of its affordances. 


Theories of Place, Place Identity, and Place Attachment 


One of the earliest theories of place was proposed by Canter 
(1977), whose conceptual, as opposed to behavioral, model 
proposed that the cognitive system contains information 
about where places are, what is likely to happen there, and 
who is likely to be present. Canter defined place as a unit of 
environmental experience and postulated that the unit of 
place was the result of the relationships between actions 
(i.e., behavior is associated or anticipated), conceptions, and 
physical attributes. 

A second influential theory of place is the transactional 
theory of Stokols and Shumaker (1981), who defined place 


as the entity between aspects of meaning, physical proper- 
ties, and relative activity. This is not so dissimilar from Can- 
ter’s notions of actions, conceptions, and physical properties. 
Stokols and Shumaker emphasize the collective perceptions 
of place and propose that a place has a social imageability. 
This imageability is the collectively held social mean- 
ings that the place has among its occupants or users. Within 
social psychology these would be called social representa- 
tions (Farr & Moscovici, 1984; Moscovici, 1989). Stokols 
proposed that three dimensions contribute to a group’s social 
imageability of place: functions, goals, and evaluations. Func- 
tions are individual or group activities that occur within the 
place regularly and include the norms associated with the ac- 
tivities and the identity and social roles of the occupants/users 
of the place; goals can be either personal or collective and 
relate to the purpose of the place; evaluations include the 
occupants, physical features, and social functions associated 
with the place. 

Thus, Stokols and Shumaker concluded that the perceived 
social imageability of a place is the result of the functional, 
motivational, and evaluative meanings conveyed by the envi- 
ronment. Stokols places particular emphasis on the functional 
dimension of place and the need to explore the affective and 
motivational processes in the relationship between people 
and place. As Bonnes and Secchiaroli (1995) pointed out, to 
live in an environment does not mean structuring experiences 
only with respect to its physical reality. Places carry a role in 
the fulfillment of biological, cultural, psychological, and so- 
cial needs of the person in the many situations that they will 
face over their lifetimes. 

One such role is their contribution to personal and group 
identity. Place has been related to identity in two ways. The 
first could be referred to as place identification. This refers to 
a person’s expressed identification with a place. For example, 
a person from London may refer to himself as a Londoner. In 
this sense, "Londoner" can be considered to be a social cate- 
gory that is subject to the same rules as is a social identifica- 
tion within social identity theory. Hogg and Abrams (1988) 
suggested that social identity comprises different social iden- 
tifications, any one of which will become salient depending 
on the context. Taking this position suggests that the concept 
of place identity is subsumed into and becomes a part of 
social identity. 

The second way in which place has been related to iden- 
tity is through the term place identity, a construct promoted 
by Proshansky, Fabian, and Kaminoff (1983; Proshansky, 
1987) that calls for a more radical reevaluation of the con- 
struct of identity. Proshansky et al. (1983) proposed that 
place identity is another aspect of identity comparable to so- 
cial identity that describes the person’s socialization with the 
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physical world. This understanding sets place identity along- 
side and independent of self-identity, rather than subsumed 
within it. 

Although it may be possible to discuss the relationship 
between the physical environment and identity without refer- 
ence to a group, to have two forms of identity would focus 
discussion on whether identity was more “social” or more 
“place.” This would not seem to be useful in explanatory 
terms. In addition, it contradicts environmental psycholo- 
gists’ transactional perspective on place (Saegert & Winkel, 
1990). Although we agree with Proshansky that self theorists 
have neglected the physical environment, we would suggest 
that rather than there being a separate part of identity con- 
cerned with place, all aspects of identity will, to a greater or 
lesser extent, have place-related implications. Although place 
identity is seen to be a crucial part of the relationship between 
self and environment, Proshansky never really operational- 
ized the concept. Breakwell’s (1986) identity process model, 
with its constructs of distinctiveness, continuity, self-esteem, 
and self-efficacy, provides such an investigatory and analyti- 
cal framework. Although these constructs have a particularly 
social orientation in Breakwell’s formulation, they neverthe- 
less would seem to have useful transfer relevance to other di- 
mensions of identity, including place (Bonaiuto, Breakwell, & 
Cano, 1996; Uzzell, 1995). For example, distinctiveness and 
continuity are essential elements in Korpela (1989) and Lalli’s 
(1992) conceptualizations of place identity. 

One important mechanism through which place identity is 
supported is place attachment. Spencer and Woolley (2000), 
for example, argued that children gain their personal identity 
through place attachment. Place attachment refers to an emo- 
tional bonding between individuals and their life spaces, 
which could be the home, the neighborhood, or places and 
spaces at a larger scale (Altman & Low, 1992; Giuliani, 1991; 
Giuliani & Feldman, 1993). 


TIME, SPACE, AND THE FUTURE OF 
ENVIRONMENTAL PSYCHOLOGY 


Needs and Rights in Environmental Psychology 


The emphasis of much environmental psychology has been 
on identifying and then assisting in the process of providing 
for and satisfying people’s needs. It is assumed within the 
philosophy of Brandt and Bruntland that environmental 
needs should be defined by those in power (i.e., the West), not 
by the people whose needs are supposedly being satisfied. 
This form of donor benevolence as a strategy for tackling 
environmental deficits operates at the local, national, and 
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international level. Thus, it is argued, we need to prevent pol- 
lution and conserve the rainforests, wildlife, energy, and 
water supplies. The West finds it difficult to understand why 
those experiencing environmental degradation—but also suf- 
fering poverty, malnutrition, poor housing, unemployment, 
and high mortality rates—have different priorities. The 
needs-based approach is often carried through to be an as- 
sumption that guides environmental psychology research. 

An alternative approach focuses on environmental rights 
in which those without power define their needs themselves 
and try to secure the rightful access to resources to satisfy 
those needs. There is a difficulty with trying to integrate a 
bottom-up rights approach with a top-down needs-driven 
approach because one is faced with the problem of who sets 
the agenda. Groups will have difficulties asserting their rights 
when the allocation processes and agendas are structured by 
others. A rights approach does not mean that neither help nor 
resources are required or given. Clearly it is essential that the 
haves of the world continue to provide for the have nots—but 
within a context of participation, self-determination, trans- 
parency in decision-making, and accountability by all con- 
cerned. The essential factor is that the starting point for 
discussing the allocation of resources is different. 

Long-term change and development will come about only 
through informed community action, rather than a depen- 
dency relationship on experts and technological-fix solutions. 
The development of environmental consciousness and capac- 
ities without the simultaneous development of opportunities 
for action leads to a feeling of powerlessness (Uzzell, 1999). 
For this reason cooperation between all agencies and institu- 
tions is necessary in order to secure action opportunities. Psy- 
chologists in general and environmental psychologists in 
particular have the expertise and experience to play an impor- 
tant role in this process. It is here that we can see the value of 
research in suggesting prescriptive roles and functions for an 
environmental psychology that should be taken seriously by 
policy makers and practitioners alike. Some have suggested 
that the implementation of sustainable development through, 
for example, Local Agenda 21 initiatives will be possible only 
with local community consensus (Robinson, 1997). Petts 
(1995) argued that traditional participatory approaches have 
been reactive in that the public is expected simply to respond 
to previously formulated plans. The trend now is for proac- 
tive, consensus-building approaches that attempt to involve 
people in the decision-making process itself. 


Cultural Differences and Temporal Processes 


Environmental psychology, like other areas of psychology, 
has focused almost exclusively on topics, theories, and 


methodologies that have been oriented toward Western 
assumptions and worldviews. Two topics seem to have been 
neglected in environmental psychology as they have in other 
areas of psychology: cultural differences and temporal 
processes. Both approaches are even more important at the 
beginning of the twenty-first century because on the one hand 
the processes of globalization have the effect of destroying 
cultural differences, and on the other hand, sustainable de- 
velopment is seen as a way of ensuring the long-term in- 
tegrity of biocultural systems. 

By defining sustainable development as “development 
that meets the need of the present without compromising 
the ability of future generations to meet their own needs,” 
Bruntland (1987) opened the way to concerns related to qual- 
ity of life. The reference to needs allows not only the require- 
ment that development be harmonious toward and respectful 
of the environment, but equally for the recognition of the 
individual’s own well-being. Of course, the issue just posed 
requires us to consider whether we should be thinking in 
terms of needs or rights, and, indeed, whose needs and whose 
rights. 

Globalization and its corollary, global trade and com- 
munications, create pressure toward cultural uniformity in 
lifestyles. The progressive deployment of globalization has 
brought on, with reason, fear of a standardization of values 
and increased anonymity threatening both individual and 
group identity. It gives rise to movements demanding recog- 
nition of local, regional, and national priorities and cultural 
differences and therefore also specific needs. This search 
for identity finds its expression spatially. Furthermore, the 
increase in regional, national, and international forced or vol- 
untary mobility (e.g., political refugees and asylum seekers, 
economic migration of job-seeking populations, and execu- 
tives dislocated by their companies) exacerbates confronta- 
tions between cultures with different needs, values, and 
customs. Globalization provides the impetus to situate envi- 
ronmental psychology in a more globally—and, at the same 
time, culturally—relative framework. The traditional con- 
cepts of local community, environmental appropriation, and 
identity take on new meanings in the context of sustainable 
development and globalization. 


The Cultural Dimension 


Quality of life standards are culturally determined. Needs 
concerning personal space, social life in the neighborhood, 
and urban experience are different from one culture to an- 
other. Furthermore, acting in sustainable ways depends on 
culturally marked values concerning the environment. From 
a globalization perspective, how universal is the need for 


personal space and privacy? Are they the same everywhere? 
Research in environmental psychology has taught us that, for 
instance, spatial needs vary both from one culture to another 
and also on one’s stage in the life cycle (Altman, 1975; 
Sundstrom, 1978). Some studies, such as Nasar and Min 
(1984), show that people living in the Mediterranean region 
and in Asia react very differently to confined spatial arrange- 
ments. But many such studies are conducted in a culturally 
homogeneous environment and therefore allow only for con- 
clusions concerning interpersonal differences related to the 
cultural origins of the research participants (see, e.g., Loo & 
Ong, 1984). We need more longitudinal research and inter- 
cultural studies such as those undertaken to study reactions to 
density and spatial needs. 

The norms, needs, and strategies for adapting to condi- 
tions very different from our own are likely to provide us 
with insights on the dynamics of how people relate to the 
physical and social dimensions of both their and our environ- 
ments. Such studies should be able to answer these questions 
more systematically. Privacy may signify and represent very 
different conditions not only at the individual level, but also 
between different cultures (Altman & Chemers, 1980). Indi- 
vidual versus collective housing preferences, as well as 
the use of different facilities inside and around the dwelling, 
are all culturally defined. While individual dwelling units ap- 
pear as an ideal in Anglo-Saxon cultural settings, in some 
Latin American societies there is a stronger preference for 
collective housing units, particularly in Brazil, mainly for 
reasons of increased security. More systematic research in 
this area should be able to provide guidelines for architects 
and designers, allowing them to take account of culturally 
dependent needs beyond the simplistic notions of conception 
and layout (e.g., kitchens clearly separated from dining 
rooms). Kent (1991) proposed a classification of different 
cultural groups according to their use of domestic space. 
Such a distinction is particularly relevant to the functional 
segmentation of spatial arrangements. Kent noted that occu- 
pants remodel their domestic environment to fit their own 
cultural imperatives if they find themselves in an environ- 
ment that fails to correspond to their own cultural standards. 
Well-being has different meanings in different cultures, and 
instead of imposing Western standards, environmental psy- 
chology should contribute more to identifying culturally spe- 
cific standards to enable the construction of modular spaces 
to satisfy diversified needs. This becomes more important 
than ever in the context of an increasingly mobile (forced or 
voluntary) society. 

At the neighborhood level, well-being depends on how the 
immediate environment is able to satisfy the specific needs of 
culturally different people, thereby providing opportunities 
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for appropriation. Currently there is a preference for homo- 
genization of populations within neighborhoods. Arguably, 
however, such a strategy may pose more risks for the future 
than encouraging a process of heterogeneity in terms of 
the impact on how we perceive others and how we perceive 
space occupied by foreigners. These are classic lessons to be 
learned from social psychology (Tajfel, 1982). Neighbor- 
hoods not directly controlled or appropriated by the individual 
can lead to antagonism between culturally different commu- 
nities. More sociocultural research on living in areas with het- 
erogeneous populations and transcultural relations should be 
undertaken in order to identify barriers to integration. 

Environmental psychology has repeatedly pointed to the 
negative consequences of living conditions in large urban 
centers: anonymity, insecurity, indifference to others, and ex- 
posure to various types of stress (Moser, 1992). This presents 
a rather dark portrait of urban living conditions. An environ- 
mental psychology has emerged that has deprecated urban 
centers and lauded the virtues of supposedly more attrac- 
tive suburban residential environments (Lindberg, Hartig, 
Garvill, & Gärling, 1992). Taking the Anglo-Saxon single- 
family house as its model (Cooper, 1972; Thorne, Hall, & 
Munro-Clark, 1982), this approach has failed to account for 
what is happening in cities such as Paris where the city cen- 
ter is invariably highly valued as a thriving, attractive, and 
lively residential as well as commercial and cultural environ- 
ment. Two thirds of those living in the Paris region indicate 
that they would prefer to live within Paris proper, whereas 
one fifth would prefer to live in a small provincial town and 
only 15% show a preference for the Parisian suburbs (Moser, 
Ratiu, & Fleury-Bahi, 2002). Such results are in direct con- 
trast to those found in the United States. The American expe- 
rience cannot be taken as the norm; unfortunately, this is 
often the case in environmental psychology and other 
branches of psychology. These differences go beyond merely 
the characteristics of urban and suburban environments and 
raise questions concerning the aspirations and needs of city 
dwellers and the processes that are generating the transfor- 
mation of cities. Inhabitants of large cities are increasingly 
culturally diverse; as a consequence, so are their needs. How 
do cities manage the influx of foreign populations, some of 
them culturally very different? What are the conditions of ter- 
ritorial appropriation of ethnic and cultural minorities, and 
what is the territorial behavior of these populations (e.g., seg- 
regation, assimilation, or integration in respect of the wider 
community)? 

Over the last few years environmental psychologists have 
made tentative steps toward building models of the condi- 
tions necessary for generating behavior favorable to the 
global environment, as a function of both values and human 
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well-being (Vlek, Skolnik, & Gattersleben, 1998). How are 
intercultural differences, particularly with respect to values, 
compatible with proenvironmental benefits for future genera- 
tions? Many studies point to individualistic behavior in the 
face of limited resources (i.e., “the tragedy of the commons”; 
Hardin, 1968; Thompson & Stoutemyer, 1991), which can be 
interpreted in more familiar social psychological terms as a 
social dilemma problem (Van Lange et al., 1998). Other stud- 
ies focus on the different ways of envisaging our relationship 
with the environment, such as the new environmental para- 
digm (Arcury & Christianson, 1990; Dunlap et al., 2000). 
Perception, attitudes, and behavior concerning the environ- 
ment differ from one culture to another to the extent that they 
are modulated by environmental variations, the resources 
available, and the societal context, including values, regula- 
tions, infrastructure, and opportunities for action (Lévy- 
Leboyer, Bonnes, Chase, Ferreira-Marques, & Pawlik, 1996). 
For instance, the different cultural representations of water 
form interpretative filters of the objective conditions and nor- 
mative references orienting individual and collective behav- 
ior (Moser, in press). The resolution of the dilemma between 
individual short-term behavior and collective action that is 
common in these types of problematic situations depends on 
cultural values, accessibility to resources, and the perception 
of these resources. The representation of water is shaped by 
the values attached to water: Affective and aesthetic values 
lead to a dynamic, global-ecological vision, whereas func- 
tional values and spatial and temporal proximity constitute a 
limited representation of the same phenomenon. 


The Temporal Dimension 


There has been a growing interest in recent years in the his- 
toricity of psychological processes (Gergen & Gergen, 1984). 
Too often in psychology, time, like the environment, has been 
treated as noise rather than as a valid process in itself. Even 
in areas that have an integral temporal dimension (e.g., social 
representations), little account is given of either the origins 
or the development of the representations (Herzlich, 1973; 
Moscovici, 1976; Uzzell & Blud, 1993). There are clearly 
difficulties in accessing the past from a psychological point 
of view (Lowenthal, 1985; Uzzell, 1998). Social structures 
and social processes change over time, and this in turn has an 
effect on spatial structures and processes. If psychological 
processes are molded and influenced by their social context, 
then changing social structures and regulatory mechanisms 
will affect those processes and have a consequent effect on 
the individual, the group, and the environment. Although 
environmental psychology often hints at the temporal dimen- 
sion of people-environment relations with the physical and 


social environment, the temporal dimension has in general 
been neglected (Altman & Rogoff, 1987; Proshansky, 1987; 
Werner, Altman, & Brown, 1992). 

First, the temporal dimension intervenes in different ways 
in terms of spatial anchoring and individual well-being. 
Anchoring is always a process that occurs within a time di- 
mension. It reflects the individual’s motivations, social sta- 
tus, family situation, and projects for the future. Well-being 
has to be set within a time reference, within a time horizon 
and the life cycle. 

Second, the temporal dimension intervenes as a reference 
in the individual’s construction of his or her own identity. 
Appropriating one’s place of residence is conditioned by the 
individual’s residential history. A sense of neighborliness in 
the immediate environment can compensate for mediocre liv- 
ing conditions, but such compensation does not occur if the 
person looks back with nostalgia to his or her childhood resi- 
dence (Lévy-Leboyer & Ratiu, 1993; Ratiu & Lévy-Leboyer, 
1993). Furthermore, environmental appropriation revolves 
around forming social and interpersonal relationships that 
depend largely on the duration of the person’s residence. 
Those who make emotional investments in their neighbor- 
hood and develop a sense of well-being tend to be more sat- 
isfied with their interpersonal relations in their neighborhood. 
This takes the form of relationships that go beyond simple 
politeness (Fleury-Bahi, 1997, 1998). On the other hand, the 
lack of free time available to people living in suburbs has an 
impact on residents’ relationships with neighbors (Moser, 
1997). 

Third, how do interindividual differences, and particularly 
gender differences, express themselves in relation to the tem- 
poral dimension in terms of spatial investment and environ- 
mental needs? How are these two variables interrelated? 
What is their impact on our perceptions, needs, and behav- 
iors? The division of time between leisure and nonleisure 
activities (e.g., activities involving imposed time constraints 
and activities) is fundamentally different when we compare 
urban and non-urban settings. Commuting time, due to 
the greater distance between home and work, reduces the free 
time of commuters in large urban areas in an obvious way. 
This has not been systematically considered with respect to 
its impact on the appropriation of space. One might assume 
that people who appropriate their environment and feel at 
home where they live will also care more about the environ- 
ment in general and exhibit more frequent ecologically bene- 
ficial behaviors as predicted in the Cities, Identity, and 
Sustainability model (Pol, Guardia, Valera, Wiesenfeld, & 
Uzzell, 2001; Uzzell, Pol, & Badenes, 2002). 

The cognitive and affective evaluation of the environment 
is contingent on temporal, historical, and cultural factors. 


Analyses of the perception, evaluation, and representations 
of the environment, both built and natural, generally only 
make implicit reference to the cultural and temporal dimen- 
sions. It has been found, for example, that the cognitive 
image of the city of Paris not only develops and is condi- 
tioned by the culture of origin and the sociospatial familiarity 
but also goes through well-defined representational stages 
before becoming more or less stable (Ramadier & Moser, 
1998). 

Increasing population mobility also raises questions con- 
cerning the rhythm of life and its consequential territorial im- 
plications. All places have a life rhythm. For some it may be 
shortlived—a period of high-intensity use either by day, 
week, or season. Many leisure settings fall into this category. 
Others may be 24-7 environments such as shopping malls 
and airports that are open and used every hour of the day, 
every day of the year. What differentiates the rhythm is the 
different types of groups that occupy the spaces for different 
reasons at different times. We know from research on leisure 
and recreation that what makes a recreation place is the social 
meanings ascribed to the recreational setting rather than the 
particularities of the activities undertaken (Cheek, Field, & 
Burdge, 1976). An integral component of this is time. With 
the development of new technologies, the notion of proxim- 
ity takes on new meanings that have not been fully explored 
by environmental psychologists. Finally, the temporal dimen- 
sion resurfaces in the context of the preservation of the envi- 
ronment and natural resources. One of the conditions for 
adopting proenvironmental behaviors is the ability to project 
oneself into the future and to step outside one’s own life cycle 
and act in the interests of future generations. 

Both temporal and cultural dimensions have to be taken 
into account when addressing quality of life issues. Well- 
being depends on the satisfaction of culturally determined 
needs. Environmental anchoring and appropriation leading to 
identity are progressive processes and are essential for indi- 
vidual and group behavior in respect of a sustainable devel- 
opment. The relationship to the environment (at every spatial 
level—home, neighborhood, city, nation, planet) is mediated 
by the individual’s and the group’s sense of control. Each in- 
dividual has a personal history, a representation of the past, 
and an anticipatory representation of the future (Doise, 1976) 
that condition how he or she relates to the environment. This 
means abandoning the atemporal orientation of environmen- 
tal psychology in favor of a more dynamic approach. Analy- 
ses of proenvironmental behavior have demonstrated the 
importance of a temporal horizon, yet few research studies 
explicitly incorporate this dimension. It is only by refocusing 
analysis on the person and the social group and their relation 
with the environment in its spatial, cultural, and temporal 
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dimensions that the discipline will be able to develop its own 
metatheories. It is in this context that the perspectives of sus- 
tainable development and the consequences of globalization 
can give a new impetus to environmental psychology and 
help to generate theories with wider applications. 


CONCLUSION: APPLYING 
ENVIRONMENTAL PSYCHOLOGY 


Gärling and Hartig (2000) suggested that one of the short- 
comings of environmental psychology is that environmental 
psychologists have only been able to provide general princi- 
ples in response to the specific needs of practitioners. In 
short, it is suggested that there is an applications gap. While 
this may be a valid criticism of science in general, its validity 
in relation to environmental psychology should be chal- 
lenged. If there is a gap, is it because environmental psychol- 
ogists have failed to communicate with or convince other 
scientists and practitioners of the value of their work? Or is it 
because environmental psychologists have not delivered the 
kind of answers that practitioners such as architects and de- 
signers have required or were expecting or wanted? Perhaps 
environmental psychologists have been asking the wrong 
questions? Or does environmental psychology suffer from a 
shortage of data? Some might argue that we need better theo- 
retical ways of understanding the data that we have already. It 
may also be that those who have the task of drawing upon and 
implementing the results of environmental psychological and 
other behavioral science research become frustrated at the 
amount of time, financial resources, and effort that go into 
generating marginal increases in the amount of variance 
explained in a set of data. Increasing the amount of variance 
explained from 33% to 35% is important, but we really need 
to be far more imaginative in our theoretical and conceptual 
approaches in order to make serious inroads into the 65% of 
the variance unaccounted for. 

Gifford (2000) argued that we need more challenging, 
bolder theories. Environmental psychology has an important 
role to play in providing conceptual guidelines of how to look 
at and analyze a given setting with reference to its contextual 
framework. As we suggested at the outset, the essence of en- 
vironmental psychology is the context. Context is an insepa- 
rable part of the explanation of people’s transactions with the 
environment. One way of responding to Gifford’s plea for 
bolder theories is to extend our understanding of context. In 
the last section we argued that the cultural and temporal di- 
mension of people-environment relations needs to be incor- 
porated into our analytical framework. There is every reason 
to argue that this should be the new thrust in environmental 
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psychology research because the study of globalization and 
sustainable development—two crucial issues that we have 
identified in this chapter— with their implications for people- 
environment relations will necessitate the incorporation of 
cross-cultural and temporal analyses if we are to find solu- 
tions to the challenges that they pose. 
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friendships, romantic relationships, marriages, and family 
relationships. 


DEFINING QUALITY RELATIONSHIPS 


What constitutes a good, high-quality friendship, dating rela- 
tionship, marriage, or family relationship? What differenti- 
ates a high-quality close relationship from one of lower 
quality? Surprisingly, until quite recently most social and 
even clinical psychologists had not tackled this question. We 
attempt to do so in this chapter. First, though, because rela- 
tionship quality often has been equated with relationship 
stability, with relationship satisfaction, or with the lack of 
conflict in a relationship, we begin with arguments against 
using those relationship characteristics as indexes of the 
overall quality of a relationship. 


Stability Is Not Enough 


Although at first blush equating relationship stability and 
relationship quality seems reasonable, making this general 
assumption is unwise. After all, many stable relationships are 
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characterized by unhappiness. Interdependence theorists pro- 
vide straightforward explanations as to why this is sometimes 
the case (Kelley & Thibaut, 1978; Rusbult, Arriaga, & 
Agnew, 2001; Rusbult & Van Lange, 1996). Satisfaction, 
they point out, is just one determinant of commitment to stay 
in relationships. Other powerful determinants of relationship 
stability can keep people in relationships despite unhappiness 
with that relationship. 

First, the more one has invested in a relationship, the 
less likely one is to leave that relationship (Rusbult, 1983). 
Investments include such things as joint memories, financial 
investments, friends, possessions, and children. Second, the 
poorer one’s alternatives to a relationship, including the 
alternative of being on one’s own, the less likely one is to 
leave the relationship (Kelley & Thibaut, 1978; Rusbult & 
Martz, 1995). A woman might stay in an abusive relationship 
if she perceives her alternatives to be worse, including the 
option of being alone with no job skills and no financial 
resources. Finally, personal and social prescriptives against 
leaving relationships can keep a person within a relationship 
in which satisfaction is low (Cox, Wesler, Rusbult, & Gaines, 
1997). A person may have a quite miserable relationship with 
his or her child yet stay due to very strong personal and soci- 
etal beliefs that one should never abandon one’s child. 


Satisfaction Is Not Enough 


What about satisfaction? Are relationship members’ ratings 
of their own satisfaction with their relationship valid indexes 
of the existence of a good relationship? Such ratings often 
have been used in this way, and we do believe that these are 
better indexes of the existence of a good relationship than is 
relationship stability. Problems remain, however, and interde- 
pendence theorists again provide us with good reasons not to 
accept satisfaction as the sine qua non of a good relationship. 

They point out that satisfaction is only partially deter- 
mined by the rewards and costs associated with our rela- 
tionships. A person’s comparison level for a relationship is 
another important determinant of satisfaction (Kelley & 
Thibaut, 1978). A person’s comparison level for a particular 
relationship is what that person expects (or feels he or she de- 
serves) from that relationship. It is the person’s set of stan- 
dards for the relationship. If a person has had poor-quality 
relationships in the past, a current relationship that objective 
observers judge to be a bad relationship might, to that person, 
seem quite good compared to his or her expectations. In con- 
trast, if a person has had terrific-quality relationships in the 
past, a current relationship that objective observers judge to 
be quite good might seem, to that person, to be quite unsatis- 
factory by comparison. 


Another reason satisfaction ratings are not terrific mea- 
sures of relationship quality is that they are generally collected 
from a single individual or, at best, from each member of a re- 
lationship independently. However, relationship quality is the 
characteristic of a dyad. It is certainly possible for one person 
to report being very satisfied with a relationship and his or her 
partner to report being very unsatisfied with the relationship. 
What would we then say the quality of the relationship was? 
For these reasons we do not believe that member satisfaction 
is the ideal way to judge the quality of a relationship. 


Lack of Conflict Is Not Enough 


Although many researchers have used the absence of conflict 
as an index of high-quality relationships and the presence of 
conflict as an index of low-quality relationships, we believe 
that such measures are flawed for two reasons. First, it is cer- 
tainly possible for a relationship to be characterized by low 
conflict and, simultaneously, by low mutual sharing of con- 
cerns and low mutual support. We would not consider this to 
be a high-quality relationship. For instance, two spouses may 
lead largely independent lives while sharing the same home. 
Each may go about his or her business with little or no re- 
liance on the other. Conflict in such a relationship would be 
quite low, but so too would mutual sharing of concerns, com- 
fort, and support. Indeed, many researchers define interde- 
pendence as the very essence of a relationship. They would 
not view such a relationship as being much of a relationship 
at all, much less a high-quality one. 

Second, we would not consider all conflict to be bad for 
relationships. Conflict often arises when one person in a rela- 
tionship feels that his or her needs have been neglected. Rais- 
ing this as a concern and working it out with a partner may 
give rise to conflict. However, at the same time, if the conflict 
is resolved to both persons’ satisfaction, the relationship is 
likely to have been improved relative to what it had been prior 
to the conflict. This logic suggests that the presence of some 
conflict in a close relationship (as long as it is dealt with in a 
constructive fashion) may actually be a positive indicator of 
relationship quality. 


Good Relationships Foster Members’ Well-Being 


Having rejected stability and satisfaction as valid indexes of 
good relationships, we suggest that high-quality relationships 
are ones in which members behave in such a manner as to 
foster the well-being of their partners. We define well-being, 
in turn, as each member’s good physical and mental health 
and each member’s being able to strive toward and reach 
desired individual and joint goals. 


We further suggest that the best way to define such rela- 
tionships is in terms of the interpersonal processes (and their 
impact on individual well-being) that characterize relation- 
ships. By identifying interpersonal processes likely to foster 
well-being in relationships, not only can we define high- 
quality relationships, but, simultaneously, we can also come 
to understand just why such relationships are of high quality. 

This said, recent research suggests that good relationships 
are those in which each member (a) feels an ongoing respon- 
sibility for the other member’s welfare and acts on that feel- 
ing by noncontingently meeting the needs of the partner and 
(b) feels comfortable and happy about that responsibility; in 
addition, in most mutual, adult, equal-status relationships 
each member (c) firmly believes that his or her partner feels a 
similar sense of responsibility for his or her own welfare and 
relies on that feeling by turning to the other for support with- 
out feeling obligated to repay and (d) believes that the other 
feels comfortable and happy about that responsibility. 

Members of high-quality mutual friendships, romantic rela- 
tionships, and family relationships trust each other, feel secure 
with each other, and derive satisfaction from nurturing each 
other. They understand, validate, and care for each other. 
They keep track of each other’s needs (Clark, Mills, & 
Powell, 1986), help each other (Clark, Ouellette, Powell, & 
Milberg, 1987), and feel good about doing so (Williamson 
& Clark, 1989, 1992). They feel bad when they fail to help 
(Williamson, Pegalis, Behan, & Clark, 1996). They respond to 
one another’s distress and even anger with accommodation and 
support (Finkel & Campbell, 2001; Rusbult, Verette, Whitney, 
Slovik, & Lipkus, 1991) rather than with reciprocal expres- 
sions of distress and anger or with defensiveness (Gottman, 
1979). They express their emotions to their partners (Clark, 
Fitness, & Brissette, 2001; Feeney, 1995, 1999). They turn to 
one another for help (Simpson, Rholes, & Nelligan, 1992). 
They are willing to forgive one another’s transgressions 
(McCullough, 2000). Further, members of such relationships 
are likely to hold positive illusions about partners that, in turn, 
bring out the best in those partners (Murray & Holmes, 1997; 
Murray, Holmes, & Griffin, 1996a, 1996b; Murray, Holmes, 
Dolderman, & Griffin, 2000) and to possess cognitive struc- 
tures in which even their partner’s apparent faults are linked to 
virtues (Murray & Holmes, 1993). Finally, members of such re- 
lationships appear ready to engage in some active relationship- 
protecting processes such as viewing their own relationship as 
being better than those of others (Johnson & Rusbult, 1989; 
Simpson, Gangestad, & Lerma, 1990; Van Lange & Rusbult, 
1995). All these things contribute to a sense of intimacy be- 
tween partners (Reis & Patrick, 1996; Reis & Shaver, 1988) 
and relationship members’ having the sense that their relation- 
ship is a safe haven (Collins & Feeney, 2000). 
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Relationship researchers do not have a single name for 
what we are describing as a high-quality relationship. Rather, 
several terms currently in use describe different aspects of 
such a relationship. We have called relationships in which 
people assume responsibility for another’s well-being and 
that benefit that person without expecting repayments com- 
munal relationships (Clark & Mills, 1979, 1993). However, 
assuming responsibility for another person’s needs and striv- 
ing to meet those needs on a noncontingent basis does not 
necessarily imply that one is competent or successful at so 
doing. Other terms in the literature for relationships imply 
success at following such norms. For example, relationships 
in which members successfully attend to, understand, vali- 
date, and effectively care for one another have been called 
intimate relationships by Reis and Shaver (1988; Reis & 
Patrick, 1996). Relationships in which members view the 
other as one who does care for their welfare and themselves 
as worthy of such are have been called secure relationships 
by attachment researchers (Ainsworth, Blehar, Waters, & 
Wall, 1978; Collins & Allard, 2001; Hazan & Shaver, 1987; 
Simpson et al., 1992). From our perspective, the exact ter- 
minology is not that important—an understanding of the in- 
terpersonal processes characterizing these relationships is 
important. 


Agreement on Levels of Responsibility Matters 


It is not sufficient just to characterize high-quality friend- 
ships, romantic relationships, marriages, and family relation- 
ships as those in which members assume responsibility for a 
partner’s welfare. It is also important, in our opinion, for 
members to assume the “right” levels of such responsibility. 
Of course, it is possible and easy to understand that a rela- 
tionship might be of low quality because members of a 
relationship do too little to foster the other’s welfare. That 
seems obvious. A parent who fails to feed his or her child 
adequately clearly does not have a high-quality relationship 
with that child. Spouses who ignore one another’s needs 
clearly do not have a high-quality relationship. Less obvi- 
ously, it is also possible for members to do too much to foster 
the other’s welfare. A person who receives an extravagant, 
expensive present from a casual friend is likely to feel quite 
uncomfortable and indebted and is unlikely to describe the 
relationship as high quality. A very young child might feel as 
if he must comfort his constantly distressed mother and do so. 
Objective observers would not consider this to be a sign of a 
high-quality relationship. Indeed, they are likely to consider 
this to be a sign of poor parenting. So, how are we to under- 
stand what degree of responsiveness to another’s needs is 
right for a relationship? 
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We suspect that almost everyone has a hierarchy of what 
we call communal relationships. By this we mean that people 
have a set of relationships with others about whose needs 
they believe they ought to care and to whose needs they be- 
lieve they ought to strive to be responsive in a noncontingent 
fashion (Clark & Mills, 1993; Mills & Clark, 1982). These 
relationships vary from weak to strong, with strength refer- 
ring to the degree of responsibility the person believes he or 
she ought to assume for the other’s welfare. One end of the 
hierarchy is anchored by relationships in which the person 
feels a very low degree of responsibility for the partner’s 
needs (e.g., a relationship with an acquaintance for whom the 
person might provide directions or the time of day with no 
expectation of compensation). The other end of the hierarchy 
is anchored by relationships in which the person assumes 
tremendous responsibility for the other’s needs (e.g., a 
parent-child relationship in which the parent would do just 
about anything at any cost to ensure the child’s welfare.) 

Figure 18.1 depicts one hypothetical person’s hierarchy 
of communal relationships. Communal relationships, from 
weak to strong, are depicted on the x-axis. The costs one is 
willing to incur to meet the other’s needs (noncontingently) 
are depicted on the y-axis. The dashed line in the figure 
depicts the costs the person is willing to incur in order to ben- 
efit the other on a communal basis. 

Beneath the implicit cost line benefits will be given and 
accepted on a communal, need, basis. Thus, for instance, 
strangers give one another the time of day, neighbors take in 
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Figure 18.1 The costs one hypothetical person is willing to incur to meet 
the needs of members of his or her social network on a communal basis. 


one another’s mail on a temporary basis, friends throw birthday 
parties for one another and travel to one another’s weddings, 
and parents spend years raising children and tremendous 
amounts of money to support those children. 

Above the cost line benefits are generally not given or 
even considered. When they are given, they are given on an 
exchange basis. Consider, for instance, a relationship partner 
who needs a car. This is a costly benefit and one that falls 
above the cost line for most relationships, such as those with 
acquaintances, neighbors, or friends. Under most circum- 
stances this means that this benefit will not be given (or asked 
for) in such relationships. The topic simply will not come up. 
However, a person might sell his car to a friend (an economic 
exchange in which the parties agree that the money and the 
care are of equal value). Neighbors might agree to provide 
each other’s child with rides to and from soccer practice fol- 
lowing a rule of equality (half the days one person drives, 
half the days the other drives), and so forth. 

Recognizing the existence of hierarchies of communal 
relationships should help to understand the nature of high- 
quality personal (communal) relationships. As we said earlier, 
these relationships are characterized by assumed, noncontin- 
gent responsibility for a partner’s needs. Here we add that the 
level of responsibility actually assumed on the part of a care- 
giver or expected on the part of a person in need (in the ab- 
sence of true emergencies) ought also to be appropriate to the 
location of that relationship in its members’ hierarchy of 
relationships. If the costs involved in meeting the need fall 
beneath the implicit cost boundary shown in Figure 18.1, the 
responsiveness ought to be present. If costs exceed the bound- 
ary, benefits should not be given, except for emergencies or in 
instances in which both members wish to strengthen the com- 
munal nature of the relationship. Indeed, giving a benefit that 
falls above the implicit cost boundary might harm the quality 
of the relationship. So too may asking for too costly a benefit 
or implying the existence of too strong a communal relation- 
ship by self-disclosing too much (Chaiken & Derlega, 1974; 
Kaplan, Firestone, Degnore, & Morre, 1974) be likely to hurt 
the relationship. 

This should help to explain why responsiveness must be 
within appropriate bounds even though responsiveness to 
needs is a hallmark of good relationships. A casual friend 
should not give one an extravagant present. It exceeds the 
appropriate level of responsiveness to needs. A young child is 
not supposed to assume a great deal of communal responsi- 
bility for his or her parent. Thus, a child consistently com- 
forting a troubled parent is not a sign of a high-quality 
relationship. In contrast, a parent is supposed to assume great 
communal responsibility for his or her child. Thus, a parent 


consistently comforting his or her troubled child is a sign of a 
high-quality relationship. 


NECESSARY ABILITIES AND FORTITUDES 


Having a hierarchy of communal relationships in which one 
believes one should behave communally (up to an implicit 
cost level) is one thing. Actually pulling off the task of ap- 
propriately and skillfully attending to one another’s needs in 
such relationships is quite another thing. For mutually sup- 
portive, trusting, secure, and intimate communal relation- 
ships to exist and to thrive, members must have three distinct 
sets of skills. One set allows for responding to one’s partner’s 
needs effectively. A second set allows for eliciting a partner’s 
attention to one’s own needs. The third set involves being able 
to distinguish successfully when one ought to behave in ac- 
cord with communal rules and when the application of such 
rules is socially inappropriate. 


Responding Effectively to a Partner’s Needs 


Skills and fortitudes necessary to respond effectively to a part- 
ner’s needs include empathic accuracy (Ickes, 1993) and the 
ability to draw out one’s partner’s worries and emotional states 
(Miller, Berg, & Archer, 1983; Purvis, Dabbs, & Hopper, 
1984). Many studies support the idea that understanding a 
spouse’s thoughts, beliefs, and feelings is linked with good 
marital adjustment (e.g., Christensen & Wallace, 1976; Noller, 
1980, 1981; Gottman & Porterfield, 1981; Guthrie & Noller, 
1988). Another skill important to meeting a partner’s needs is 
knowing when and how to offer help in such a way that it will 
not threaten the potential recipient’s self-esteem or make the 
potential recipient feel indebted, but will be accepted. Still an- 
other skill important to meeting a partner’s needs is the ability 
to give help that the partner (not the self) desires and from 
which the partner (not the self) will benefit. To do so requires 
accurate perception of differences in needs between the self 
and the partner. Many parents go wrong in this regard. They 
may impose their needs on the child and may be seen by out- 
siders as living “through their child,” often to the detriment of 
the child. 

Some of these abilities require learning, practice, and in- 
telligence (e.g., the ability to draw a partner out, empathic ac- 
curacy, and provision of emotional support). The keys to 
others may lie more in emotional fortitudes. A person may 
wish to express empathy or offer help but fail to do so out of 
fear of appearing awkward or being rejected. One’s history of 
personal relationships in general and one’s history within the 
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particular relationship in question provide explanations for a 
lack of emotional fortitude in providing help. If one’s past 
partners (or current partner) have not been open to accepting 
help in the past, then the person is likely to be reluctant to 
offer care. A lack of fortitude may also stem from temporary 
factors. When temporarily stressed or in a bad mood, people 
may not feel that they have the energy to help because they 
may be especially likely to anticipate that negative outcomes 
will be associated with helping (Clark & Waddell, 1983). 


Alerting Partners to Your Needs 


Next consider skills and fortitudes necessary for eliciting 
needed support for the self from one’s partner. In this regard, 
freely expressing one’s own need states to the partner through 
self-disclosure and emotional expression should be impor- 
tant. After all, a partner cannot respond to needs without 
knowing what they are. Given this, it is not surprising to us 
that self-disclosure has been found to increase positive affect 
(Vittengl & Holt, 2000), liking (Collins & Miller, 1994), and 
satisfaction in dating relationships (Fitzpatrick & Sollie, 
1999), marriages (Meeks, Hendrick, & Hendrick, 1998), 
and sibling relationships (Howe, Aquan-Assee, Bukowski, 
Rinaldi, & Lenoux, 2000). Of course, one ought also to be 
able to ask outright for help and accept it when it is offered. 
Perhaps less obviously, possessing the ability to say “no” to 
requests from the partner that interfere with one’s needs 
ought to be crucial to the partner’s being attentive and re- 
sponsive to one’s needs. It should also be important that, over 
time, one demonstrates that one does not exaggerate needs or 
constantly seek help when it is not needed (Mills & Clark, 
1986). This ought to increase a partner’s sense that one is 
appropriately, and not overly, dependent. 

Although help-seeking skills might seem easy, enacting 
them requires certain emotional fortitudes. In particular, ex- 
ercising all these skills probably requires having the firm 
sense that one’s partner truly cares for one and will, indeed, 
meet one’s needs to the best of his or her ability. Otherwise, 
self-disclosure, emotional expression, and asking for help 
seem inadvisable. Under such circumstances, one risks being 
rebuffed, rejected, or evaluated negatively. The partner may 
even use information to mock or exploit the other. Negative 
assertion on one’s own behalf may also be frightening, as it 
too many provide a basis for rejection. Thus it may seem best 
not to seek help and not to assert oneself. However, if one 
does not do so, keeping the relationship on a communal basis 
becomes difficult. It is for just these reasons that we believe 
that a sense of trust and security in relationships is key to 
following communal norms. 
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Knowing When to Be Communal 


Applying communal rules effectively within appropriate 
bounds requires the skills and fortitudes just mentioned. 
Avoiding their use in nonemergency situations outside those 
bounds may require additional fortitudes. One must be able to 
detect whether the other desires a communal relationship and, 
if so, at what strength. Being too anxious for intimate com- 
munal relationships may lead one to behave communally in 
inappropriate situations. Work by attachment researchers sug- 
gests that this is something that anxious, ambivalent, or pre- 
occupied people often do (Hazan & Shaver, 1987; Simpson & 
Rholes, 1998). 


LINKING RELATIONSHIP AND 
JUSTICE RESEARCH 


Having reviewed some relationship work suggesting what 
interpersonal processes and interpersonal skills make a per- 
sonal relationship such as a friendship, romantic relationship, 
marriage, or family relationship a high-quality relationship, 
we turn to linking this work to work on the use of distributive 
justice rules. In this regard we have already made clear that 
we believe that benefits are ideally distributed according to 
needs (and not inputs) in relationships such as friendships, 
romantic relationships, marriages, and family relationships. 
We have also made clear that we believe such responsiveness 
should be noncontingent. 

Our views fit well with some past work on distributive jus- 
tice. Specifically, our views fit well with work supporting the 
idea that use of a needs-based norm governing the giving and 
receiving of benefits is preferred to using other distributive 
justice norms in personal relationships (Clark et al., 1986; 
Clark et al., 1987; Deutsch, 1975, 1985, for family relation- 
ships; Lamm & Schwinger, 1980, 1983). At the same time, 
our views conflict with the arguments of many other distribu- 
tive justice researchers who have claimed that following other 
rules—rules such as equity (Walster, Walster, & Berscheid, 
1978; Sabatelli & Cecil-Pigo, 1985; Sprecher, 1986; Utne, 
Hatfield, Traupman, & Greenberger, 1984) or equality 
(Austin, 1980; Deutsch, 1975, 1985, for friendships)—are 
best for friendships, romantic relationships, and family rela- 
tionships. It also conflicts with the view that an individual 
solely watching out for his or her own welfare is best in 
such relationships (Cate, Lloyd, & Henton, 1985; Huston & 
Burgess, 1979). 

Are we right? Is following a noncontingent, responsiveness- 
to-needs rule best for personal relationships? Is it better than 
tules of equality or equity? If so, why? We think it is best, and 


we make the following theoretical and empirical case for this 
viewpoint. 


Following Communal Norms Affords Security; Following 
Contingent Norms Undermines Security 


The reason we believe that following a communal rule is ideal 
for ongoing intimate relationships is that it is the only rule 
that can afford members of the relationship the sense that the 
other truly cares for their welfare. If another responds to one’s 
needs on a noncontingent basis, the logical inference is that 
the other truly cares for oneself. This, in turn should heighten 
trust in the other and promote a sense of security. Instances in 
which the other benefits a person at some cost to him- or her- 
self should be especially likely to heighten trust (Holmes & 
Rempel, 1989). 

Note that, by definition, contingent distributive justice 
norms (equity, equality, exchange) involve receiving benefits 
as conditions of benefiting a person. In contrast, a need-based 
or communal norm dictates noncontingent giving and accep- 
tance of benefits. Thus, communal responsiveness should be 
uniquely valuable in terms of providing recipients of care 
with a sense of being valued and cared for—two of the com- 
ponents Reis and Shaver (1988) pointed out as essential for 
attaining a sense of intimacy in relationships. 

Looking at this from the perspective of the person who 
gives help also provides insight into the importance of 
following a communal norm in friendships, romantic rela- 
tionships, and family relationships. At the same time that 
noncontingent provision of benefits should cause a recipient 
to feel valued and cared for, so too should it cause the donor 
of the benefit to see him- or herself as a nurturant, caring 
individual. This is simply a matter of self-perception. Both 
feeling cared for and judging oneself to be a nurturant indi- 
vidual are, we suspect, deeply satisfying. It is just these feel- 
ings, we believe, that form the essence of what people desire 
from their friendships, family relationships, and romantic 
relationships. 


People Advocate and Follow Communal Norms 


Not only do we believe that—ideally and often in practice— 
people follow a communal rule in their intimate relationships 
and do not keep track of individual inputs and outcomes from 
a relationship, participants in our studies share our belief 
(Grote & Clark, 1998; Clark & Grote, 2001). What we did to 
examine this is straightforward: We asked people. First, we 
came up with prototype descriptions of a number of ways in 
which people might choose to distribute benefits within their 
intimate relationships. That is, we made up descriptions of 


communal rule, an exchange rule, an equity rule, and an equal- 
ity rule. Then we had people in a number of different types of 
close, personal relationships (i.e., friendships, dating relation- 
ships, marriages) rate the extent to which they viewed these 
rules to be ideal for their relationship (from —3 indicating “not 
at all ideal” to +3 indicating “extremely ideal”). They also 
rated each rule according to the extent to which they thought it 
was realistic on a similar scale. 

In each case the communal rule was rated as ideal for 
these relationships and as substantially more ideal than were 
any of the remaining rules (the ratings generally fell on the 
“not ideal” ends of the scales). In each case the communal 
rule also was rated as being on the realistic side of the scales 
and as being more realistic than any of the remaining rules. 


Use of Communal Norms and Relationship Satisfaction 


Some evidence that a tendency to follow contingent, record- 
keeping norms is associated with lower marital satisfaction 
comes from studies by Murstein, Cerreto, and MacDonald 
(1977) and by Buunk and VanYperen (1991). Murstein et al. 
(1977) measured the “exchange orientation” of one member 
of a group of married couples with a scale including items 
such as, “If I do dishes three times a week, I expect my 
spouse to do them three times a week.” They also adminis- 
tered a marital adjustment scale to research participants. 
Among both men and women, an exchange orientation to- 
ward marriage was negatively correlated with marital adjust- 
ment. (We would note, however, that they did not find an 
analogous negative correlation between exchange orientation 
and satisfaction in friendships, perhaps because the friend- 
ships were weak ones.) 

Buunk and VanYperen (1991) had individuals fill out an 
eight-item measure of exchange orientation and a Global 
Measure of Equity (see Walster et al., 1978). The latter mea- 
sure asks, “Considering what you put into your relationship 
relative to what you get out of it and what your partner puts 
in compared to what he gets out of it, how does your rela- 
tionship ‘stack up’?” Respondents could indicate that they 
were getting a much better or better deal, an equitable deal, or 
a much worse or worse deal than their partner. Buunk and 
VanYperen also measured satisfaction with the relationship 
with an eight-item Likert-type scale that measures the fre- 
quency with which the interaction with the partner in an inti- 
mate relationship is experienced as rewarding and not as 
aversive. 

As these researchers expected, perceiving oneself to be 
over- or underbenefited relative to one’s spouse was linked 
with lower relationship satisfaction among those high in ex- 
change orientation but not among those low in exchange 
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orientation. More important for the present point, however, 
there was a main effect of being high in exchange orientation 
on marital satisfaction. Those high in exchange orientation 
reported substantially lower marital satisfaction than did 
those low in exchange orientation. They did so regardless of 
whether they reported being underbenefited, equitably bene- 
fited, or overbenefited (Buunk & VanYperen, 1991). 


Can We Follow Contingent Rules Anyway? 


Still another reason we believe that people do not keep track 
of inputs and calculate fairness on some sort of contingent 
basis in well-functioning close relationships is simply that 
following any contingent rule in relationships in which levels 
of interdependence are high is virtually impossible. Even to 
make a substantial effort to do so day to day and week to 
week would be so effortful as to be tremendously irritating 
and painful to the relationship members involved. Consider 
the impossibility of accurately keeping track of benefits first. 

Think of the sheer number and variety of benefits that are 
likely to be given and received in an intimate relationship for 
example, between a husband and wife living together in the 
same home. Each day a very large number of household tasks 
(e.g., making beds, doing laundry, picking up clutter, prepar- 
ing food, shopping for food, putting groceries away, vacuum- 
ing, dusting, taking the mail in, feeding pets, changing light 
bulbs and toilet paper, etc.) are done. So too are a variety of 
nonhousehold services (e.g., dropping a spouse off at work, 
picking up take-out food, dropping off dry cleaning, hav- 
ing something framed, visiting relatives, etc.). Then there 
are benefits that fall within the categories of verbal affec- 
tion, physical affection, information, instructions, and goods 
that are given and received. Furthermore, things such as re- 
strained behavioral impulses might be considered benefits. 
How in the world can two people in a relationship accurately 
track these things and still accomplish anything else in their 
lives? The answer is, we think, that they cannot. 

To make matters worse, one must keep in mind that track- 
ing the equality or equity of benefits given and benefits re- 
ceived involves far more than simply keeping track of what 
has been given by each member of a relationship. To compute 
equality or equality one must place values or weights on the 
diverse benefits given and received and compute the equality, 
equity, or evenness of repeated specific exchanges. What is 
taking the garbage out worth? Does it matter if it is cold and 
rainy outside? How does it compare with another simple ser- 
vice such as putting the laundry in the machine or unloading 
the dishwasher? Tougher yet, how does it compare with giv- 
ing a hug (and does the hug get discounted because both 
people benefit)? 
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To push this even further, consider these questions. How 
does the ability and enjoyment of giving a benefit figure into 
the calculations? If one partner enjoys doing laundry and the 
other does not, is laundry done by the latter weighted higher 
than laundry done by the former? If one partner does not care 
if the living room is cluttered but the other one does, does it 
count at all if the latter person cleans up the clutter? These 
questions are difficult, and they probably seem silly. We 
suggest that the reason they may seem silly is precisely that 
people simply do not try to calculate these things in their 
day-to-day lives, primarily because in good times issues of 
fairness do not occur to them. Moreover, in times of more 
stress, when people may have some desire to compute such 
things, they realize the futility of trying to compute objective 
equality or equity across diverse domains of inputs and out- 
comes. (Later we address what we suspect they actually do in 
times of stress.) 


Even If We Could Follow Contingent Norms, Do We 
Have Access to the Necessary Information? 


Imagine that one did have the cognitive capacity to keep 
track of all benefits given and received in a relationship. Does 
one have access to all the relevant input? We do not think so. 
Again, consider a husband and wife who live together—a 
husband and wife who can surmount the obstacles to record 
keeping just discussed. We still think it would be an impossi- 
ble task to track everything that ought to be tracked simply 
because each person has better access to contributions that 
he or she has made to the relationship than to contributions 
that the other has made for a number of reasons. The most 
straightforward reason is that many contributions one partner 
makes to the relationship are made in the absence of the other 
partner. 

Picture the husband arriving home prior to the wife. He 
stops at the mailbox and brings the mail into the house. 
He throws out the junk and leaves the rest on the table. He 
notices that the cat has tipped over a plant and cleans up 
the mess. He listens to three solicitation messages left on the 
answering machine and deletes them. Although tired, he 
chats pleasantly when his mother-in-law calls in order to 
make her feel good and keep her company. He starts dinner. 
His wife arrives. She notices the mail on the table and the 
dinner cooking, but does she know anything about the other 
contributions to the relationship that her spouse has made? 
No, and he may well not mention them. The general point is 
that because of the lopsided accessibility of information 
about contributions to a relationship, there will always be a 
bias to perceive that the self has made more contributions 
than the partner has made. 


ARE CONTINGENT RULES EVER USED 
IN CLOSE RELATIONSHIPS? 


To recap, we argued that the ideal norm for giving and re- 
ceiving benefits in close relationships (within an implicit cost 
boundary) is a need-based, or communal, norm. We further 
noted that when such relationships are functioning well, is- 
sues of fairness tend not to arise. This is not, however, to say 
that complaints and distress never arise. A need may be 
neglected, and the neglected person may become distressed 
and complain. Ideally, the partner responds to that distress 
and complaint in such a manner as to address the need at 
hand, soothe the partner, and maintain the relationship on an 
even, communal keel. However, perhaps the need will not be 
addressed. It is then, we contend, that processes leading to 
concerns about fairness may begin to unfold. 

Imagine that a person neglects his or her partner’s needs; 
the latter complains, but the former does not respond by ade- 
quately addressing the need. Even then, we suspect, the situ- 
ation may unfold in such a manner that issues of fairness do 
not arise. Specifically, sometimes the partner will respond 
with a benign interpretation of the behavior. For instance, that 
partner may respond by blaming unstable, situational causes 
rather than the partner (cf. Bradbury & Fincham, 1990), and 
the behavior may simply be tolerated. Rusbult et al. (1991) 
described this action as accommodation, generally, and as an 
instance of reacting with loyalty, more specifically. 

For instance, consider a woman who lives far from her 
family of origin and misses them terribly. She tells her hus- 
band of her desire to visit them during their next vacation. He 
refuses, countering that he would rather take a relaxing trip, 
perhaps one to the beach. She then suggests that they could 
go for just a weekend, and he refuses again, saying that he re- 
ally wants her to stay home and get some work done and that 
he really needs her company. In other words, he does not re- 
spond to her needs. In the face of this his wife may interpret 
his behavior benignly by attributing it to the situation (“He’s 
very stressed. It’s not that he doesn’t love me; he just needs to 
relax”). She may then behave constructively by continuing 
on with her own communal behavior (acting loyally). She 
may even go beyond benignly attributing her partner’s be- 
havior to the nonstable, situational factors and actually con- 
nect her partner’s faults (as evidenced by the poor behavior) 
to virtues, as Murray and Holmes (1993) observed. For in- 
stance, his reluctance to visit relatives and his desire to be 
with her alone on vacation or at home might be taken as evi- 
dence of his love for her and of his sensible nature—he does 
not want to take too much on. In any case, she continues on, 
maintaining her faith in the overall communal nature of their 
relationship. 


But sometimes spouses do not respond in such an accom- 
modating manner or by connecting their partner’s faults to 
virtues. Instead, they may conclude that their partner really is 
not a good partner or that they themselves are not worthy of 
care. In such instances, people may well experience an incli- 
nation to switch to contingent rules of distributive justice, 
and they may actually do so. Doing so, we believe, is trig- 
gered by the judgment that one’s partner has not met one’s 
need combined with a judgment that this is due to a lack of 
true caring for the self. One might say that trust in the partner 
has evaporated. At such times, the adoption of a contingent 
distributive justice rule in place of a communal rule is likely 
to seem adaptive. It seems adaptive, we contend, because it is 
judged to be a more effective means of getting what one 
needs from one’s partner than is trusting that partner to be 
noncontingently responsive to one’s needs. 

Consider once again the woman who lives far from her 
family of origin and misses them terribly. This time, after she 
suggests that they could go just for a weekend and he refuses 
again, saying that he really wants her to stay home and that he 
needs her company, she becomes increasingly distressed at 
her husband’s refusal to respond in any way to her needs. She 
may attribute his behavior to himself rather than to the situa- 
tion (“He’s selfish”). Alternatively, or perhaps additionally, 
she may attribute his behavior to herself (“I’m not loveable”). 
His faults may also bring other faults to mind (“He’s selfish; 
he’s often inconsiderate”). He is really not very insightful or 
intelligent. In general, he is an embarrassment to be around. 

In any case, the wife may conclude that the only way her 
husband is going to respond to her needs is if he must do so in 
order to receive benefits himself (a contingent, exchange per- 
spective). Thus, she may counter his responses by thinking, 
“Well, OK, if that’s the way he’s going to be,” and saying, 
“Look, if you’re not willing to visit my family, then you cer- 
tainly can’t expect me to go visit yours next May when we 
were planning on going. I’m only going to go if you do the 
same for me.” 

This threat may well work in that the spouse agrees to go 
visit her family. Unfortunately, we propose, it works with 
some costs. Switching from a communal to an exchange 
norm sacrifices important things. First, the donor of a benefit 
will no longer be able to derive the same sense of nurturing 
the other. He or she must attribute at least part (or maybe all) 
of their motivation to their own selfish interests. Second, the 
recipient of the benefit no longer derives the same sense of 
being cared for and security from acquiring the benefit. He or 
she must attribute at least part (or maybe all) of the donor’s 
motivation to the donor’s own self-interest rather than to 
the donor’s sense of caring for them. Trust is also likely to 
deteriorate. 
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When Will People Switch? 


We already suggested that switches from communal to ex- 
change norms are likely to be triggered when a person feels 
that his or her needs have not been met. We have also sug- 
gested, however, that this will not always occur. Thus, an 
important question becomes when it will and will not occur. 
We have two answers to this question, one having to do with 
the situation in which a person finds him- or herself and one 
having to do with the personality of the person whose needs 
have been neglected and who is, therefore, vulnerable to 
switching. 


The Situation Matters 


Our first answer is straightforward. We predict that people 
will be more likely to switch from communal to exchange 
equality or equity norms when they perceive that their needs 
are being neglected. This may occur because a partner who 
has normally been quite responsive to the person’s needs 
ceases to be so responsive to that person’s needs. This could 
occur because the partner has become interested in someone 
or something else or because the partner is under consider- 
able stress or is distracted from the partner’s needs. It also 
may occur because the person who needs help has experi- 
enced a large increase in needs that the partner cannot meet. 
If this is the case, and if the partner continues to neglect needs 
despite any attempts on the person’s part to rectify the situa- 
tion, the person might switch to an exchange norm. 

Of course, there are other options available to the person 
who has lost faith in the communal norm. The person could 
leave the relationship altogether or switch immediately to 
simply watching out for his or her own needs without adopt- 
ing a norm such as equity or equality. 

How are decisions between these options made? We can 
only speculate at this point, but it seems to us that certain 
variables that have long been discussed by interdependence 
theorists are relevant to making these decisions. If there 
are few barriers to leaving (i.e., in interdependence terms, if 
the person has good alternative options, such as being alone 
or forming alternative relationships), if investments in the 
relationship are low, and if there are few social or personal 
prescriptives to leaving, the person whose needs are being 
neglected might simply leave. A switch to contingent, record- 
keeping norms might never take place. We suspect this often 
happens in friendships. If a friend neglects one’s needs, peo- 
ple usually have other friends (or potential friends) to whom 
they can turn. There are typically not great social or personal 
prescriptives against letting a friendship lapse, and invest- 
ments in friendships tend to be lower than those in other close 
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relationships (e.g., romantic relationships, marriages, or 
parent-child relationships). 

On the other hand, sometimes there are considerable bar- 
riers to leaving a close relationship. Investments may be high, 
alternatives may not seem attractive, and there may be strong 
social and personal pressures working against a person’s 
leaving the relationship. We suspect that when such barriers 
are high, people whose needs have been seriously neglected 
will stay in the relationship but switch from adherence to a 
communal norm to adherence to a contingent record-keeping 
norm such as equity, equality, or exchange. This may happen 
in many marriages in which investments that cannot be 
recouped have been made (e.g., children, a joint house, finan- 
cial success, joint friends), alternatives seem poor (being 
poorer, living alone, leaving the house), and strong prescrip- 
tions against leaving exist (one’s church or parents would dis- 
approve). In such circumstances, the best option may seem to 
be to continue relationship but to switch the basis on which 
benefits are given in such a way that one feels more certain 
that one’s needs will be met. This may seem most workable 
even if one has to sacrifice a sense of being nurtured and of 
nurturing. 


Individual Differences Matter 


We believe that it is the situation that triggers people to 
switch from communal to contingent, record-keeping distrib- 
utive justice norms, but we also believe that personality mat- 
ters. People differ from one another in terms of their chronic 
tendencies to believe that others will be responsive to their 
needs and that they are worthy of such responsiveness. This 
has been a major theme in recent relationship literature. It is 
especially evident in the attachment theory and the empirical 
work that has been based on that theory. Secure individuals 
are assumed to view close others as likely to respond to their 
needs on a consistent basis, and they feel comfortable de- 
pending on others for support. Insecure people do not. How- 
ever, attachment theorists are not the only ones who have 
emphasized differences in how people tend to view their part- 
ners. Others have talked about people differing in their 
chronic tendencies to trust other people in close relationships 
without necessarily referring to attachment theory (Holmes & 
Rempel, 1989), or about how chronic levels of self-esteem 
may relate to views of, and reactions to, close partners 
(Murray, Holmes, MacDonald, & Ellsworth, 1998). For our 
own part, we have discussed chronic individual differences in 
communal orientation, which refers to the tendency to re- 
spond to the needs of others and to expect others to respond 
to one’s own needs on a noncontingent basis (Clark et al., 
1987). 


We suspect that these chronic individual differences that 
people bring to their close relationships will be important 
determinants of switching from communal to exchange 
norms in the face of evidence that one’s partner is neglecting 
one’s needs. We suspect that almost everyone (regardless of 
attachment style, trust, self-esteem, or communal orientation) 
understands communal norms as we have discussed them. 
Moreover, we would assert that almost everyone believes that 
communal norms are ideal for friendships, romantic relation- 
ships, and marriages and that people start off such relation- 
ships following such norms. Indeed, it is by following such 
norms in the first place that people signal to potential partners 
that they want a friendship or romantic relationship with an- 
other person. 

However, we also suspect that people who are insecure, 
have low trust in others, are low in self-esteem, or are low in 
chronic communal orientation (variables that we suspect co- 
occur) will be especially vulnerable to switching from a com- 
munal to an exchange norm in the face of real or imagined 
evidence that the other is neglecting their needs. They are the 
people, we assert, who react to the slightest evidence of such 
neglect with conclusions that the evidence indicates that the 
other is selfish and does not care for them or that they are un- 
worthy of care. Further, we suggest that such conclusions, in 
turn, lead them to back away from the relationship. Alter- 
natively (perhaps because they also are likely to perceive 
that they have fewer good alternatives than others do), these 
insecure individuals might be led to switch to a contingent, 
record-keeping norm such as equality, equity, or exchange as 
a basis for giving and receiving benefits in their relationship. 


Evidence 


The arguments we have just made suggest something that, 
to date, has not received attention in the distributive justice 
literature. Researchers in that tradition have typically advo- 
cated that there is one real rule that governs the giving and re- 
ceiving of benefits within close relationships. Some suggest it 
is equality (Austin, 1980; Deutsch, 1975, 1985, for friend- 
ships); some suggest it is equity (Sabatelli & Cecil-Pigo, 
1985; Utne et al., 1984; Walster et al., 1978); and some say it 
is a need-based rule (Deutsch, 1975, 1985, for family rela- 
tionships; Lamm & Schwinger, 1980, 1983; Mills & Clark, 
1982). Whatever rule they advocate, though, it has tended to 
be a single rule, and they have suggested that people in close 
relationships generally follow that rule. If a person does a 
good job following the particular rule, all is well. If the rule is 
violated, unhappiness results, and either the distress must be 
resolved or the relationship may end. We are suggesting 
something quite different. 


We are suggesting that people in general start off their 
close relationships believing in a communal norm and doing 
their best to follow it. Such a norm requires mutual respon- 
siveness to needs. However, it is inevitable that needs will be 
neglected. When this happens or when it is perceived to have 
happened, distress will occur, as we have argued. The dis- 
tress, in turn, sets the stage for a possible switch to a contin- 
gent, record-keeping norm such as equity or exchange. Thus, 
it is likely that it is distress that—in most relationships some 
of the time and in some relationships very often—leads to 
record keeping. Such record keeping will, however, necessar- 
ily be retrospective at first. As such, it is very likely that it will 
be biased in such a manner as to result in evidence of inequity. 
Perceptions of inequity will, in turn, lead to judgments of un- 
fairness. Then, in an iterative fashion, these perceptions will 
lead to further distress. Note that this is the reverse of what 
has typically been argued in the past, which is that record 
keeping and calculations of equity come first and that distress 
results when inequities are detected (Walster et al., 1978). 

Is there any evidence for our proposal that distress pre- 
cedes perception of unfairness in close relationships (rather 
than vice versa)? The answer is yes (Grote & Clark, 2001). In 
a recent study we tracked both conflict and perceptions of 
unfairness in a sample of about 200 married couples. These 
couples were enrolled in the study at a time when the wife 
was in the third trimester of her first pregnancy. Marital con- 
flict (distress) was tapped at that time, again a few months 
after the baby was born, and a third time when the baby was 
about 1 year old. We also asked many questions about the di- 
vision of household labor at all three points in time and about 
how fair the husband and wife felt that division of labor to be. 
(Notably, the division of labor was almost always judged to 
be unfair, with the wife performing more whether she stayed 
at home, worked part time or worked full time, and with both 
spouses agreeing that this was unfair.) 

The longitudinal panel design of this study allowed us to 
conduct path analyses on the data in order to ascertain 
whether conflict at Time 1 predicted perceptions of unfair- 
ness at Time 2 (controlling for perceptions of unfairness at 
Time 1). Our theoretical position led us to the prediction that 
it would. We were also able to test whether perceptions of un- 
fairness at Time 1 would predict conflict at Time 2 (control- 
ling for conflict at Time 1). Traditional perspectives would 
lead to the prediction that it would. However, our theoretical 
perspective led us to predict that that would not necessarily 
be the case. That is, we believed that the division of labor 
could be inequitable and could be judged to be unfair when a 
social scientist came along and asked about it but still might 
not disrupt the relationship if both partners felt that their 
needs were being met and did not feel stressed. 
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Figure 18.2 Links between relationship conflict and perceptions of fair- 
ness in relationships across time. 


The results, which are shown in Figure 18.2, were as we 
expected. Conflict at Time 1 (which we felt was indicative of 
situations in which at least one person was feeling that his or 
her needs were not being met) prospectively and significantly 
predicted perceptions of unfairness at Time 2 controlling for 
perceptions of unfairness at Time 1. Perceptions of unfairness 
at Time 1, however, were not significant prospective predic- 
tors of conflict at Time 2 controlling for perceptions of con- 
flict at Time 1. This occurs, we assert, because in low-stress 
times when partners’ needs are being met (as we suspected 
was the case for most couples prior to the birth of an eagerly 
anticipated first child), people are not keeping track of inputs 
and outcomes day to day and are not calculating fairness. 
Whereas they can report on inequities in housework when a 
social scientist asks them to do so, we believe that most of 
our couples were not doing this on their own. That is why our 
measures of perceived unfairness did not predict conflict. In 
contrast, the early measures of conflict, we suspect, did pick 
up on those couples including at least one member who felt 
that his or her needs were being neglected. It is among these 
couples, we suspect, that record keeping (much of it ret- 
rospective and biased) emerged, resulting in perceptions of 
unfairness. 

Once record keeping does emerge and unfairness is per- 
ceived, we have predicted that those perceptions of unfair- 
ness will increase unhappiness further. Evidence for this 
subsequent process emerged in the Grote and Clark data as 
well. Specifically, when changes in the patterns of data from 
Time 2 until Time 3 were examined, it was found that per- 
ceptions of unfairness at Time 2 (shortly after the baby had 
been born) until Time 3 (when the baby was about 1 year old) 
did significantly predict increases in conflict, controlling for 
conflict at Time 2. This occurred, we believe, because once 
couples were stressed and record keeping commenced, find- 
ing evidence of inequities increased distress still further. One 
interesting result was that conflict measured at Time 2 did 
not predict further increases in perceptions of unfairness as 
Time 3 (controlling for perceptions of unfairness at Time 2). 
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Thus, we have acquired and reported evidence consistent 
with the notion that the existence of inequities in a marriage 
will not necessarily lead to distress. We have also acquired 
and reported evidence consistent with the notions that dis- 
tress might be what triggers contingent record keeping and 
perceptions of unfairness. We do not yet have hard empirical 
evidence that there are individual differences in people’s ten- 
dencies to feel that their needs have been neglected and, in 
turn, to switch from adherence to a communal norm to adher- 
ence to some sort of record keeping, contingent, distributive 
justice norm. However, we are currently collecting and be- 
ginning to analyze data relevant to just that question. 


Permanent or Temporary Switches? 


A final issue we wish to address in this chapter is whether the 
change will be permanent or temporary once people switch 
from a communal to a contingent, record-keeping norm for 
distributing benefits within their relationship. We propose 
that most such switches will be temporary. These changes 
will occur when a person is dissatisfied with how a relation- 
ship is going, wishes to ensure that his or her needs are met 
by the partner, and (not incidentally) wishes to signal his or 
her distress to the partner. Indeed, communicating displea- 
sure may be just as important a motivator of the switch as is 
ensuring that one gets what one wants. Once the switch has 
been made and communicated, the protest function of having 
done so is largely accomplished. So, too, may the person 
have accomplished the short-term goal of having one imme- 
diate need addressed. 

However, once a contingent, record-keeping distributive 
justice norm begins to be used, all the disadvantages of fol- 
lowing such a norm will emerge. That is, record keeping will 
have to be done. It is tedious; it is virtually impossible to do 
competently; and given all the sorts of biases already dis- 
cussed in this chapter, there will inevitably be disagreements 
over whether equity, equality, or fair exchange have been 
achieved. Moreover, the advantages of following a communal 
norm will evaporate. The recipient of benefits will not feel that 
the other cares for him or her, and the donor of benefits will not 
derive satisfaction for having nurtured a partner. These things 
combined with the strong societal norm that communal rules 
ought to characterize marriages and other close relationships 
will combine to push couples back to following a communal 
norm. Moreover, stresses in relationships themselves will 
often dissipate, and reminders of a partner’s true caring atti- 
tudes will reemerge. Thus, we would predict that couples will 
often bounce back to using communal norms. 

On the other hand, there should also be cases when cou- 
ples do not bounce back. Chronic neglect of at least one 


partner’s needs by the other may predict this. So too may 
either partner’s long-term, pessimistic views of the likelihood 
of the other being caring (and of the self being worthy of 
care) predict such a lack of resilience. These two things, in 
combination, may be especially likely to predict that a switch 
to contingent norms will be longer term. Such a switch, as we 
have already noted, is unlikely to constitute a satisfying solu- 
tion. Therefore, we believe that it will likely be followed by a 
further switch to purely self-interested behavior or to the dis- 
solution of the relationship. Whether the relationship persists 
long term (and perhaps happily), given the use of contingent 
record-keeping norms, or whether it ends will depend on the 
presence or absence of the sorts of barriers to leaving that in- 
terdependence theorists have discussed. That is, having poor 
alternatives, high investments, and feeling prescriptions 
against leaving are factors likely to keep couples together de- 
spite giving and receiving benefits on what we consider to be 
nonoptimal bases. Good alternatives, low investments, and 
low prescriptions to leaving are likely to predict relationship 
dissolution. 


CONCLUSIONS 


In this chapter we described what we believe to be the char- 
acteristics of a high-quality friendship, dating relationship, 
marriage, or family relationship. We suggested that quality 
ought not be defined in terms of stability, satisfaction, or con- 
flict but rather in terms of the presence of interpersonal 
processes that facilitate the well-being of its members. We 
also suggested that members ought to agree implicitly on the 
degree of responsiveness to needs that is expected in the rela- 
tionship and that relationships can go bad not only if respon- 
siveness to needs is not present when expected but also if it is 
present when it is not called for. 

Next we pointed out that viewing close relationships in this 
way suggests taking a new approach to understanding the use 
(and nonuse) of contingent, record-keeping distributive jus- 
tice norms in intimate relationships. In well-functioning inti- 
mate relationships people should respond to one another’s 
needs in a noncontingent fashion as those needs arise. Record 
keeping should not be an issue, and fairness should not be dis- 
cussed. Fairness simply should not be a salient issue for people 
in such relationships. (Of course, if some social scientist 
comes along and asks participants to judge the fairness of the 
giving and receiving of benefits in that relationship, we have 
no doubt that members will come up with such ratings. We just 
do not think they do this spontaneously on their own.) Mem- 
bers of such relationships appear to be following a communal 
rule, and we believe that following such a rule promotes a 
sense of intimacy, security, and well-being in the relationship. 


However, we argued further, record keeping may become 
an issue in relationships that members would like to see op- 
erate on a communal basis. It becomes an issue if and when 
needs are perceived to have been neglected and attributions 
of a lack of caring are made. In such cases, partners in a rela- 
tionship may switch to record-keeping norms such as ex- 
change, equity, or equality in an effort to ensure that their 
needs are met. Once this is done, it is very likely that unfair- 
ness will be perceived (whether it objectively exists or not), 
and distress is likely to increase. However, we do believe that 
many couples are resilient and will “bounce back” to follow- 
ing communal norms with time. Others will not be resilient, 
and members of such relationships will continue, often times 
unhappily, to use record-keeping rules to give and receive 
benefits or even to rely on pure self-interest. Depending 
on barriers to exiting the relationship, it may or may not 
dissolve. 
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The word prosocial does not appear in most dictionaries; it 
was created by social scientists as an antonym for antisocial. 
Prosocial behavior covers the broad range of actions 
intended to benefit one or more people other than oneself— 
behaviors such as helping, comforting, sharing, and cooper- 
ating. The word altruism has at times been used to refer to a 
subset of these behaviors—for example, self-sacrificial help- 
ing or helping in the absence of obvious, external rewards. 
Such usage seems inappropriate, however, because altruism 
is a motivational concept. Altruism is the motivation to in- 
crease another person’s welfare; it is contrasted to egoism, 
the motivation to increase one’s own welfare (MacIntyre, 
1967). There is no one-to-one correspondence between 
prosocial behavior and altruism. Prosocial behavior need not 
be motivated by altruism; altruistic motivation need not pro- 
duce prosocial behavior. 


WHY DO—AND DON’T—PEOPLE 
ACT PROSOCIALLY? 


Addressing the question of why people act prosocially may 
seem natural and necessary for social psychologists. Indeed, 
in the field’s first text William McDougall (1908) made this 
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question focal: “The fundamental problem of social psychol- 
ogy is the moralization of the individual by the society into 
which he is born as a creature in which the non-moral and 
purely egoistic tendencies are so much stronger than any altru- 
istic tendencies” (p. 16). When Kurt Lewin, his students, and 
his colleagues ushered in modern social psychology in the 
1930s and 1940s, however, other questions took precedence. 
These were the pressing social-problem questions provoked 
by the rise of Nazism, two world wars, the Holocaust, the 
advent of the nuclear age, the Cold War, and racial injustice. 
Attention was directed to totalitarian and autocratic leadership, 
conformity and obedience to authority, aggression, prejudice, 
ethnocentrism, interpersonal and intergroup conflict, propa- 
ganda, persuasion, and attitude formation and change. 

The 1960s brought the question of why people act proso- 
cially to the fore once again. This question did not replace the 
social-problem questions; it was added to the list. Several 
shocking cases in which bystanders failed to help persons in 
desperate need raised concern about the breakdown of social 
structure and social decency, especially in urban environ- 
ments. Best known is the case of Kitty Genovese, whose bru- 
tal stabbing and eventual death was witnessed by 38 of her 
neighbors in the Kew Gardens area of Queens, New York. 
Her murder took more than half an hour, and despite her 
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pleading screams, no one intervened; no one even called the 
police. More heartening were the courageous acts of Freedom 
Riders and other civil rights workers, Black and White, who 
suffered beatings, imprisonment, and in some cases death to 
further the cause of racial equality in the American South. 
Youth were in the streets to protest the Vietnam War and to 
proclaim the dawning of the Age of Aquarius. The times they 
were a-changin’. Social psychologists were asked, Why do— 
and don’t—people act prosocially? 

Before attempting to offer an answer to this question, one 
should probably inquire of the questioner, “Why do you ask?” 
This response is necessary because the question has been 
asked for two very different reasons. Some have asked in 
order to reach the practical goal of encouraging prosocial be- 
havior; others, in order to challenge currently dominant theo- 
ries of social motivation. The dominant motivational theories 
in psychology, sociology, economics, and political science 
are firmly founded on assumptions of universal egoism 
(Mansbridge, 1990; Wallach & Wallach, 1983). Can one ac- 
count for all prosocial behavior in terms of egoism, or must 
one make room for altruism as well? Might there be other 
forms of prosocial motivation besides egoism and altruism? 

These two reasons for asking why people act prosocially 
beg for very different answers. So, if one is not clear which 
reason lies behind the question, the answer provided may 
appear irrelevant and the research on which it is based mis- 
guided. To avoid such confusion, this chapter addresses the 
two concerns in turn—first the practical, then the theoretical. 


VARIANCE-ACCOUNTED-FOR 
EMPIRICAL ANALYSIS 


Psychologists pursuing the practical concern of promoting 
prosocial behavior usually employ one of two strategies: (a) a 
variance-accounted-for empirical analysis or (b) application 
and extension of existing social psychological theory. One 
view of science that has long been popular among psy- 
chologists, especially psychologists with an applied orien- 
tation, is empirical prediction and control. From this 
perspective, promoting prosocial behavior requires, first, 
identification of its most powerful predictors. Then one can 
engage in social engineering, creating an environment that 
optimizes these predictors and, thereby, prosocial behavior. 
The logic seems straightforward. Its apparent simplicity has, 
however, proved deceptive. 


Dispositional Versus Situational Determinants 


Operating with an implicit variance-accounted-for model, sev- 
eral investigators around 1970 attempted to determine whether 


dispositional or situational factors were better predictors of 
prosocial behavior. The dispositional variables studied include 
anomie, authoritarianism, autonomy, deference, intelligence, 
Machiavellianism, nurturance, religiosity, self-esteem, social 
desirability, social responsibility, submissiveness, and succo- 
rance. Not one of these, by itself, was a clear predictor. In con- 
trast, situational factors—ambiguity of need, severity of need, 
physical appearance of victim, similarity to victim, friendship, 
number of bystanders, location (urban vs. rural), cost of help- 
ing, and so on—seemed powerful. These results led several 
reviewers (e.g., Huston & Korte, 1976; J. A. Piliavin, Dovidio, 
Gaertner, & Clark, 1981) to conclude that situational variables 
are better predictors of prosocial behavior than are disposi- 
tional variables. 

Soon, however, this conclusion was challenged as part of 
the general counterattack by personality researchers against 
situationist critiques. Staub (1974) found that an aggregate 
dispositional measure, a prosocial orientation index (combin- 
ing measures of feelings of personal responsibility, social 
responsibility, moral reasoning, prosocial values, and a low 
level of Machiavellianism), was a reasonably good predictor 
of helping across several different measures. Rushton (1980) 
reanalyzed previous research (notably, the classic studies by 
Hartshorne and May in the late 1920s) by computing aggre- 
gate measures of prosocial behavior and found far better evi- 
dence of cross-situation consistency than had analyses based 
on individual measures. 

Other researchers pointed to the greater predictive poten- 
tial of dispositional factors for the higher cost, nonsponta- 
neous, longer term helping that occurs in the natural stream of 
behavior outside the psychological laboratory. For example, 
Oliner and Oliner (1988) conducted a major study using inter- 
views and questionnaires to identify predictors of acting to 
rescue Jews in Nazi Europe. They claimed evidence for the 
predictive power of three dispositional factors: (a) a proclivity 
to feel empathy for those in need, (b) sensitivity to normative 
pressure from social groups, and (c) adherence to inclusive, 
universal moral principles such as justice or care. Presumably, 
better prediction is possible outside the laboratory because the 
more reflective decision process involved in planned (non- 
spontaneous) helping permits more chance for personal val- 
ues, attitudes, and dispositions to come into play. 

Still other researchers argued that it was an oversimplifi- 
cation to expect a personality variable to relate to helping in 
all situations. Many pointed to the greater success of predict- 
ing prosocial behavior using disposition-situation interac- 
tions (e.g., Romer, Gruder, & Lizzardo, 1986). For example, 
self-confidence and independence seem to correlate with 
helping in emergency situations, especially dangerous ones, 
but not in response to a request to contribute to the United 
Way (Wilson, 1976). Snyder and Ickes (1985) suggested that 


the predictive power of dispositional factors should be mani- 
fest only when situational pressure is weak, not when it is 
strong. Carlo, Eisenberg, Troyer, Switzer, and Speer (1991) 
claimed support for this distinction between weak and strong 
pressure when predicting prosocial behavior. Within these 
more recent studies, then, dispositional predictors have 
fared better than in earlier work. Still, correlations between 
personality measures and prosocial behavior—however 
measured—rarely rise above .30 to .40, leaving 85% to 90% 
of the variance unaccounted for. 

At the same time that dispositional predictors were being 
revived, the health of situational predictors took a turn for 
the worse: Their ecological validity was questioned (Bar-Tal, 
1984). Could one expect a situational predictor of single-act 
helping by college students in a controlled laboratory experi- 
ment to be equally powerful in predicting naturally occurring 
prosocial behavior outside the lab, such as volunteerism 
(Clary & Snyder, 1991)? 


Proliferating Predictors and Predictions 


Since 1970, proposed predictors of prosocial behavior have 
proliferated well beyond the initial dichotomy between 
dispositional and situational factors. Krebs and Miller (1985) 
presented an interlocking three-tier classification. Most distal 
from the specific prosocial behavior are biological and cul- 
tural predictors (see also Fiske, 1992). These predictors com- 
bine to produce enduring dispositional characteristics, which 
are more proximal. Dispositional factors then combine with 
situational factors to produce cognitive and affective reac- 
tions, which are considered the most proximal predictors of 
prosocial behavior. Within each of these broad classes, nu- 
merous specific variables can be identified. 

In additions to proliferating predictors, there are also many 
different forms of prosocial behavior to be predicted, and the 
variables that predict one form may not predict another. For 
example, within the domain of helping are rescuing, donating, 
assisting, volunteering, and giving social support (Pearce & 
Amato, 1980). Moreover, each of these categories includes a 
wide range of specific behaviors. One can assist by holding a 
door, answering a request for directions, splinting a broken 
leg at the scene of an automobile accident, securing false pa- 
pers for a Jew in Nazi Europe, or enabling a suicide. One can 
volunteer to serve on the board of directors for the local sym- 
phony, to call potential blood donors, to be a buddy for some- 
one who has AIDS, or to join the rescue squad. Critics 
claim—and research supports the claim (Levine, Martinez, 
Brase, & Sorenson, 1994; Omoto & Snyder, 1995)—that 
variables accounting for variance in one form of prosocial be- 
havior in one setting are not likely to account for the same 
amount of variance (if any) in other forms of behavior or in 
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other settings. Talk of prediction based on interactions among 
person, situation, and behavior has become common (e.g., 
Bandura, 1991; Carlo et al., 1991). 

One need not pursue this logic very far—adding predic- 
tors, behaviors to be predicted, situations in which prediction 
can be made, and populations for which predictions can be 
made—to realize that a general variance-accounted-for an- 
swer to the question of why people act prosocially is impos- 
sible. All one can hope for is the identification of predictors 
that account for a specific prosocial behavior in a specific 
situation for a specific population at a specific time (Snyder, 
1993). Although useful to address some applied questions, 
such research is apt to become ideographic rather than nomo- 
thetic (Allport, 1961), with very little generalizability. 


APPLICATION AND EXTENSION 
OF EXISTING THEORY 


Well aware of the limited, ad hoc nature of a variance- 
accounted-for approach, Lewin (1951) reminded us, “There 
is nothing so practical as a good theory” (p. 169). In opposi- 
tion to the Aristotelian approach to science that guides the 
variance-accounted-for strategy, in which the scientist’s goal 
is to identify essential features to predict outcomes, Lewin 
advocated a Galilean approach. Galileo’s goal was to identify 
underlying genotypic (conditional-genetic) constructs and 
the highly general—even universal—relations among them 
that account for observable phenotypic events. Lewin was 
convinced that explanatory theories developed and tested 
following Galileo are of far more practical value than are 
explanations developed following Aristotle, even though the 
Galilean model relies on contrived laboratory experiments 
rather than on direct, real-world observation. 

Psychologists approaching the study of prosocial behavior 
from Lewin’s Galilean perspective are not likely to look to 
empirical research to identify predictors accounting for the 
most variance. They are likely instead to look to existing the- 
ory about genotypic psychological processes, using research 
to illustrate and document the relevance of these processes to 
understanding prosocial behavior. At least seven broad theo- 
retical perspectives have been applied in this way: social 
learning, tension reduction, norms and roles, exchange or 
equity, attribution, esteem enhancement/maintenance, and 
moral reasoning. Let us briefly consider each of these. 


Social Learning 


Social learning theory suggests that if you want to know 
why people act prosocially, you should consider their learn- 
ing history. You should consider not only the rewards and 
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punishments received following helping (or not), but also the 
relative rewards—the benefits minus the costs. You should 
consider observational learning or modeling that comes from 
watching the actions of others. You should consider self- 
rewards. Much research has supported a social learning expla- 
nation of prosocial behavior (for reviews, see Bandura, 1977; 
Rushton, 1980). Integrating and coordinating social learning 
principles, Cialdini, Baumann, and Kenrick (1981) proposed a 
three-step developmental sequence: (a) In the young child 
prosocial behavior is a product of material rewards and pun- 
ishments; (b) in the preadolescent it is a product of social as 
well as material rewards and punishments; and (c) in the ado- 
lescent and adult it is a product of internalized self-reward, as 
well as social and material rewards and punishments. 


Mood Effects 


Building on the idea that helping can be a basis for self- 
reward, Cialdini, Darby, and Vincent (1973) proposed a 
negative-state relief hypothesis: that adults are more likely to 
help when they feel bad. The reason is that adults have 
learned that they can reward themselves for helping and so 
feel better. 

Not only does helping have reward value for people who 
feel bad, but it also seems rewarding for people who feel 
good. Indeed, the effect is even clearer for good mood. 
Across a range of studies (e.g., Isen & Levin, 1972; Weyant, 
1978), people induced to feel good have been more likely to 
give help to good causes. 

What accounts for this pervasive reward value of helping 
for people in a good mood? One possibility is a desire to 
maintain the good mood. Seeing another person in need can 
throw a wet blanket on a good mood, so one may help in 
order to shed this blanket and maintain the mood (Wegener & 
Petty, 1994). Isen, Shalker, Clark, and Karp (1978) suggested 
a second possibility: Being in a good mood may bias one’s 
memories about and attention to the positive and negative as- 
pects of various activities, including helping. When in a good 
mood, a person is more likely to recall and attend to positive 
rather than negative aspects of life. Applied to helping, a 
good mood makes people more likely to remember and 
attend to the positive, rewarding features and less likely to 
attend to the negative features, such as the costs involved. 


General Assessment 


Social learning theory finds itself in an awkward position in 
contemporary social psychology. There seems little doubt 
that the theory is in large measure correct. However, perhaps 
because of its relatively straightforward explanation of 


behavior, without the ironic twists and the revelations of sub- 
tle faux pas for which cognitive explanations have become 
renown, social learning theory generates little excitement. 
The direct focus on behavior and reinforcement history 
seems almost unpsychological in its lack of nuance. Even 
with the added emphasis on self-reward, cognitive represen- 
tation, self-regulation, and reciprocal determinism (Bandura, 
1977, 1991), social learning theory seems bland. Still, were 
one forced to choose a single theory to explain why people 
do—and do not—act prosocially, social learning theory 
should almost certainly be the choice. “As Einstein has em- 
phasized, the goal is to account for the most facts with the 
fewest principles” (Dollard & Miller, 1950, p. 6). Social 
learning theory has probably come closer to this goal than has 
any other theory in the history of social psychology. 


Tension Reduction 


Tension reduction has long been a popular explanation of 
why people help others in need, especially others in obvious 
pain or distress. The general idea is that people find it upset- 
ting to see another person suffer and that preferring not to be 
upset, they relieve the other’s suffering. 

Perhaps the best way to describe the relationship between 
tension reduction, which is a form of motivation, and social 
learning is to say that they are related by marriage. Social 
learning can exist without tension reduction, as in the pure 
operant theories descendant from Watson and Skinner. 
Tension reduction can exist without social learning, as in 
reactions to pain, extreme temperatures, hunger, thirst, and 
other physiological needs. Yet social learning and tension 
reduction lived together for many years in relative harmony, 
housed within Hull’s (1943) general learning theory and its 
descendants, including Dollard and Miller’s (1950) version 
of social learning theory. In response to the current cognitive 
zeitgeist, social learning theory has of late been less attached 
to tension reduction, showing more interest in cognitive 
processes (Bandura, 1977, 1991). Whether this philandering 
is grounds for divorce is hard to say. In any case, tension 
reduction has also been seen stepping out without operant 
processes by its side, most notably in dissonance theory—at 
least as originally conceived by Festinger (1957). 

Why should the suffering of others upset someone? Most 
straightforward is the answer proposed by J. A. Piliavin et al. 
(1981), among others. They suggested that witnessing an- 
other’s distress evokes vicarious distress that has much the 
same character as the victim’s distress, and the witness is mo- 
tivated to escape his or her own distress. One way to escape 
is to help because helping terminates the stimulus causing the 
distress. Of course, running away may enable the witness to 


escape just as well and at less cost, as long as the old adage 
“out of sight, out of mind” works. 

Variations on the theme of aversive-arousal reduction have 
been provided by Hornstein (1982), Reykowski (1982), and 
Lerner (1982). Focusing on the self-other relationship, 
Hornstein suggested that when certain others are in need— 
specifically, those whom one cognitively links to self as “us” 
and “we” rather than “them” and “they’”—one experiences a 
state of promotive tension in which one is “aroused by 
another’s needs almost as if they were one’s own” (Hornstein, 
1982, p. 230). Once so aroused, one is motivated to reduce this 
tension by aiding the fellow “we-grouper.” 

Reykowski’s (1982) proposed explanation, though quite 
different, also involves reduction of aversive tension: “The 
sheer discrepancy between information about the real or pos- 
sible state of an object and standards of its normal or desir- 
able state will evoke motivation” (p. 361). Reykowski 
applied this general principle to prosocial motivation as 
follows: If a person perceives a discrepancy between the cur- 
rent state and the expected or ideal state of another person 
(i.e., perceives the other to be in need), cognitive inconsis- 
tency and motivation to reduce this aversive inconsistency 
will result. Relieving the other’s need is one way to remove 
the inconsistency and escape the situation. Another, less 
prosocial way is to change one’s perception and decide that 
the other’s suffering is acceptable, even desirable. 

Lerner’s (1980, 1982) just-world hypothesis led him to an 
explanation similar to but more specific than Reykowski’s. 
Lerner suggested that most people believe in a just world—a 
world in which people get what they deserve and deserve 
what they get. The existence of a victim of innocent suffering 
is inconsistent with this belief. In order to reduce the arousal 
produced by this inconsistency, a person may help another in 
need. Alternatively, the person may derogate the innocent 
victim, making the suffering appear deserved. 

At first glance, Cialdini’s negative-state relief model may 
appear to be another example of aversive-arousal reduction. 
In fact, it is not. Although it too begins with the proposition 
that seeing someone in need evokes a negative affective state, 
from this common starting point the two explanations 
diverge. The negative-state relief explanation claims that the 
goal of helping is to obtain mood-enhancing self-rewards that 
one has learned are associated with helping; aversive-arousal 
reduction explanations claim that the goal of helping is to 
eliminate the mood-depressing stimulus. Negative-state re- 
lief is a social learning explanation that assumes that the in- 
creased need for some type—any type—of mood-enhancing 
reward motivates helping; aversive-arousal reduction expla- 
nations make no assumptions about prior learning history but 
focus instead on reduction of current tension. 
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Norms and Roles 


Theories that seek to explain prosocial behavior in terms of 
norms and roles often make heavy use of social learning prin- 
ciples. Yet norm and role theories are not direct descendants 
of classic learning theory and behaviorism. Instead, they 
trace their ancestry to symbolic interactionism and its 
analysis of social behavior using a dramaturgical metaphor 
(cf. Goffman, 1959; Mead, 1934). Within this metaphor, 
norms provide the script of the social drama, specifying what 
should be done and said when; roles are the parts to be 
played. (More formally, norms are a group’s written or un- 
written rules of appropriate behavior for those occupying 
particular roles; roles are behavior patterns that are character- 
istic, and expected, of a person who occupies a particular 
position in a social structure.) 

In both developmental and social psychology, norms and 
roles have been adopted into the social learning family; it is 
assumed that people learn the norms and roles appropriate to 
a given situation through social reinforcement and modeling. 
At the same time that people are learning that acting proso- 
cially can bring rewards, they are also learning the norms for 
prosocial behaviors that should be performed by individuals 
in various roles in different social situations. These norms 
dictate that one should help people in need—at least some 
people under some circumstances—to avoid social or self- 
administered sanctions. 


Reciprocity 


One prosocial norm that has been studied extensively is reci- 
procity. Gouldner (1960) suggested that this norm tells people 
both that they should help people who help them and that they 
should not injure these people. He believed that this norm was 
universal, an important part of the moral code of every culture. 
He also believed that the pressure on a person to comply with 
the norm of reciprocity depends on the circumstances under 
which the initial help was given—including (a) how badly one 
needed help, (b) one’s perception of how much the other 
person gave relative to his or her total resources, (c) one’s 
perception of the other person’s motives for helping (was it a 
bribe?), and (d) whether the other person helped voluntarily or 
was pressured into it. Much evidence supports the claim that 
people are motivated to comply with the norm of reciprocity 
(e.g., Wilke & Lanzetta, 1982). 


Social Responsibility 


A second norm that psychologists have suggested motivates 
helping is social responsibility. This norm dictates that one 
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person should help another in need when the latter is depen- 
dent on the former—that is, when others are not available to 
help and thus the second person is counting specifically on 
the first. Although this norm does seem to exist, its effect on 
helping has been surprisingly difficult to demonstrate. After 
more than a decade of research attempts to do so, Berkowitz 
(1972) concluded, “The findings do not provide any clear-cut 
support for the normative analysis of help-giving. . . . The 
potency of the conjectured ‘social responsibility norm’ was 
greatly exaggerated” (pp. 68, 77). 

Why has evidence that the norm of social responsibility 
leads to prosocial behavior been so elusive? Darley and 
Latané (1970) suggested that this norm may be at once too 
general and too specific. The norm may be too general in that 
everyone in our society adheres to it. If this is true, it cannot 
account for why one person helps and another does not. On the 
other hand, the norm may be too specific in that it comes with 
a complex pattern of exceptions, situations in which an indi- 
vidual may feel exempt from acting in accordance with the 
norm. The norm may be characterized not simply by arule that 
says, “If someone is dependent on you for help, then help,” but 
by amore complex rule that says, “If someone is dependent on 
you for help, then help, except when . . .” There may be indi- 
vidual differences in readiness to accept exceptions—that is, 
to deny responsibility (Schwartz, 1977). Moreover, excep- 
tions may vary for individuals in different roles and in differ- 
ent social situations. One advantage of remembering the 
dramaturgical roots of the concept of norms is that it makes 
explicit their role specificity. 

Darley and Latané (1970) also pointed out that in addition 
to norms for helping, there are norms for not helping. A per- 
son may be taught, “Help those in need,” and at the same time, 
“Mind your own business.” Which norm is the one to follow? 
If the former, one may help; if the latter, probably not. 


Effects of Race and Sex 


Exceptions to and conflicts among norms may account for the 
highly inconsistent effects on prosocial behavior of demo- 
graphic variables such as race and sex. It has sometimes been 
found that same-race helping is more frequent (e.g., Gaertner & 
Bickman, 1971), sometimes that cross-race helping is more 
frequent (Katz, Cohen, & Glass, 1975), and sometimes that the 
race of the victim or helper makes no difference (Wispé & 
Freshley, 1971). Similarly, sometimes men help more than 
women (West, Whitney, & Schnedler, 1975), sometimes 
women help more than men (Wegner & Crano, 1975), and 
sometimes the sex of the helper makes no difference (J. A. 
Piliavin & Piliavin, 1972). It does appear, however, that women 
are generally more likely to be helped than are men (Gruder & 
Cook, 1971). 


How can we account for these seemingly contradictory 
findings? One possibility is that given their different social 
roles in different situations, Blacks and Whites—and men 
and women—may feel more or less obligated to help a 
dependent other. For example, Black students on a predomi- 
nantly White campus, acutely aware of their minority status, 
may feel strong responsibility for helping a fellow Black stu- 
dent but very little responsibility for helping a White student; 
White students may be more likely to help a Black student 
when failure to do so clearly violates norms proscribing 
racial prejudice. Helping may be more normative for men 
than for women in one situation—for example, intervening in 
a potentially dangerous emergency. Helping may be more 
normative for women than for men in another situation—for 
example, providing sympathy and support after a friend’s 
breakup with her fiancé (Eagly & Crowley, 1986). A role- 
sensitive normative analysis renders the apparent inconsis- 
tencies comprehensible. 


Norm Salience 


Some researchers have suggested that the problem with 
social norms lies in norm salience and focus of attention. 
Only when attention is focused on the norm as a standard for 
behavior is concern about violating it likely to affect behav- 
ior (Cialdini, Kallgren, & Reno, 1991). Consistent with this 
suggestion, Gibbons and Wicklund (1982) found that if nor- 
mative standards of helpfulness were salient and thus a focus 
of attention, then focusing on oneself increased helping. 
Presumably, being self-focused when the norm was salient 
highlighted the threat of sanctions for failing to act in line 
with personal standards. In the absence of salient standards 
for helpfulness, however, self-focus led to less helping; 
it seemed to inhibit attention to others’ needs (see also 
Karylowski, 1984). 


Personal Norms 


Because broad social norms like social responsibility have 
limited ability to predict whether a person will help, 
Schwartz (1977) proposed a change of focus in thinking 
about norms. Rather than thinking about social norms, 
Schwartz suggested that we should think of more specific, 
personal norms. By personal norms he meant internalized 
rules of conduct that are socially learned, that vary among in- 
dividuals within the same society, and that direct behavior in 
particular situations. 

Applied to helping, a personal norm involves a sense of 
obligation to perform a specific helping act. For example, 
people may say (either publicly or to themselves), “I ought to 
give a pint of blood in the blood drive.” Such statements 


appear to be far more predictive of whether a person will give 
blood than are statements of agreement with broad social 
norms like the norm of social responsibility—at least if the 
person in question is one who believes in acting responsibly 
(Schwartz & Howard, 1981). Specific statements like this are 
particularly powerful as predictors when one also takes into 
account extenuating circumstances, such as whether an indi- 
vidual was in town during the blood drive, had no major 
scheduling conflicts, and was physically able to give blood 
(Zuckerman & Reis, 1978). At this level of specificity, how- 
ever, it is not clear whether the statement about giving blood 
reflects a sense of personal obligation stemming from an in- 
ternalized rule of conduct (i.e., a personal norm) or simply an 
intention to act in a particular way. 


Exchange or Equity 


Perhaps the most direct extension of social learning princi- 
ples into interpersonal relations is exchange or equity theory. 
When developing exchange theory, Homans (1961) explic- 
itly and proudly declared his agenda to be the reduction of 
social relations—including cooperation, helping, and other 
prosocial behaviors—to reinforcement principles operating 
within the individual. Equity theorists were not so reduction- 
ist. They considered social relations to have emergent prop- 
erties that were irreducible to the benefits and costs for the 
individuals involved. In their view, social learning teaches 
one to value equitable relations, in which the ratio of out- 
comes to inputs is equal for the relating individuals. Walster, 
Berscheid, and Walster (1973) claimed that equity theory was 
a general theory that subsumed social learning theory (and 
psychoanalytic theory). Although this may seem a myopic 
inversion, equity theory does add an important dimension to 
the understanding of prosocial behavior by introducing both 
social comparison and distributive justice. Needs and bene- 
fits are no longer defined by looking at the individual alone; 
the definition is broadened to include needs based on relative 
deprivation (Adams, 1965). 

Homans (1961) pointed out that if a recipient of help can- 
not return the favor in a tangible way, then he or she must re- 
turn esteem and deference. Otherwise, the relationship will 
not remain beneficial to both parties and thus will not con- 
tinue. Walster et al. (1973) argued that not only the relatively 
underbenefited but also the relatively overbenefited are moti- 
vated to restore equity (although they acknowledged that 
inequity in one’s favor is more tolerable than the reverse). 
Acting prosocially to redistribute resources more fairly is one 
way to restore equity—but only one. Equity may also be re- 
stored psychologically by enhancing the perceived inputs of 
the advantaged or devaluing the inputs of the disadvantaged, 
thereby justifying the difference in outcomes. 
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Attribution 


Attribution theory concerns inferences drawn about the 
causes of events (Heider, 1958; Jones & Davis, 1965). Attri- 
butions can affect prosocial behavior in two major ways. 
First, attributions about why a person is in need are made not 
only by potential helpers and bystanders but also by the per- 
son in need, with consequences for each. Second, attributions 
about the character of a person who helps are made not only 
by the helpers themselves but also by the persons helped, 
again with consequences for each. 


Attributing the Cause of Others’ Needs 


People are far more likely to help innocent victims than to 
help those who bring their troubles on themselves (Weiner, 
1980). Although this relationship is no surprise, the reason 
for it is not entirely clear. Perhaps causing one’s own need (or 
not working to prevent it) violates ingrained standards for 
self-sufficiency and prudence; perhaps causing one’s own 
need but not suffering the consequences violates our sense of 
justice; perhaps it seems inequitable to those who perceive 
themselves to have exerted effort to avoid need. In any case, 
people are less likely to help those who bring their troubles 
on themselves, even though the explanation for this behavior 
has never been carefully explored. 


Attributing the Cause of One’s Own Need 


People in need may be predisposed to attribute their need to 
situational causes, as something thrust upon them by un- 
avoidable circumstances and carrying no implications about 
personal ability or worth. This attribution may, however, be 
hard to sustain when the need is produced by failure on a task 
that one expected to perform successfully, especially when 
comparable peers succeed (Fisher, Nadler, & Whitcher- 
Alagna, 1982). To avoid an esteem-damaging dispositional 
attribution, the person in need may attempt to deny the failure 
and not seek or appreciate help (Nadler, 1991). 


Attributing the Cause of Help 


Helpers make attributions about the nature and cause not only 
of others’ needs but also of their own helping. A helper may 
ask, “Why did I help in this situation?” Possible answers in- 
clude the following: (a) because I am a kind, caring, helpful 
person—a dispositional attribution likely to be self-rewarding 
and encourage one to help in a range of situations in the 
future; (b) because I am the kind of person who helps in this 
particular situation (e.g., I am a blood donor; J. A. Piliavin, 
Callero, & Evans, 1982)—a dispositional attribution likely to 
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encourage one to help again in this situation; (c) because of 
situational pressure—a situational attribution not likely to in- 
crease helping in the future, at least not when situational pres- 
sure is absent; and (d) because I am a compliant schnook and 
a pushover who cannot say no—a dispositional attribution 
likely to be self-punishing and to discourage future helping. 
Grusec (1991) traced the development and demonstrated the 
prosocial benefits of children attributing their helping to a 
broad disposition to be helpful. 

An attributional analysis suggests a complicating limit on 
the effects of social learning. To the extent that subsequent 
helping is mediated by self-attributions of helpfulness, induc- 
ing help by providing material or social rewards in the form 
of incentives or salient models, norms, and so on may actu- 
ally diminish rather than increase subsequent helping, much 
as providing extrinsic incentives can diminish activity based 
on intrinsic motivation (Lepper, Greene, & Nisbett, 1973). 
Consistent with this possibility, research suggests that provid- 
ing incentives—whether money, models, or norms—reduces 
self-perceived altruism following helping (e.g., Thomas, 
Batson, & Coke, 1981). 

These results reveal a dilemma. One important source of 
motivation to help, the external reward that comes from pay- 
ment or praise for helping, actually undermines a second 
important source of motivation to help, the self-reward that 
comes from seeing oneself as a good, kind, caring person. 
Consider the long-term consequences. As self-reward is un- 
dermined, additional external pressure may be necessary to 
coerce the person to help. This additional external pressure 
further erodes the helper’s chances for self-reward. Over 
time, the result may be a slide toward a more and more cyni- 
cal self-concept, in which personal kindness plays an increas- 
ingly minor role and help is offered only for a price. 

The person helped is also likely to make attributions about 
why the helper acted. The most obvious and most frequently 
studied attributions for helping are that the helper acted 
(a) out of concern, with no strings attached, or (b) in order to 
indebt, control, or demean the recipient. Attributions of the 
second kind may be especially problematic when made by re- 
cipients of international aid. Research by Greenberg and his 
colleagues (e.g., Greenberg & Frisch, 1972) demonstrated, as 
expected, that aid is not appreciated to the degree that it is 
perceived as an attempt to control. In return, the benefactor 
is likely to receive hostility rather than gratitude (Tesser, 
Gatewood, & Driver, 1968). 


Esteem Enhancement/Maintenance 


Models of esteem enhancement/maintenance have been both 
popular and numerous in social psychology since about 1980. 


As an explanation for prosocial behavior, these models gen- 
erally assume that people act prosocially to enhance or re- 
cover self-esteem (Brown & Smart, 1991). 

One might expect perceptions of the esteem-enhancing 
potential of helping to follow the same three-step devel- 
opmental sequence outlined by Cialdini et al. (1981). For 
the young child, gaining material rewards for doing good en- 
hances esteem; for the middle child, social approval enhances 
esteem; by adolescence, self-directed and uncoerced—even 
anonymous—help may be necessary to feel good about 
oneself. 

Not only benefactors, but also recipients, may act and 
react with an eye to their self-esteem. Fisher et al. (1982) pro- 
posed an esteem-loss explanation for recipients’ negative 
reactions to receiving aid. Consistent with the comparative 
aspects of self-esteem, Nadler, Fisher, and Ben-Itzhak (1983) 
found that when individuals were having trouble on a task 
that reflected on their abilities, receipt of help from a friend 
produced more negative self-evaluation than did receipt of 
help from a stranger. 

DePaulo, Nadler, and Fisher (1983) pointed out that con- 
cern over loss of esteem both in others’ and in one’s own eyes 
may go a long way toward explaining reticence to seek help 
when in need. To seek help is to admit that you lack the com- 
petence, knowledge, or other valuable resources necessary to 
cope and, moreover, that the person from whom you seek 
help has these resources. Consistent with this analysis, peo- 
ple are less likely to seek help to the degree that they hold 
themselves in high esteem and do not anticipate a chance to 
reciprocate the help (Nadler, 1991). 

This analysis must be qualified by roles and norms, how- 
ever. For the young child, seeking help from his or her par- 
ents is not likely to be upsetting or damaging to self-esteem. 
For a middle-level executive who finds himself out of a job, 
the thought of applying for welfare assistance to feed his 
family may be devastating. 


Moral Reasoning 


Moral reasoning theories (also called cognitive developmen- 
tal or rational developmental theories of morality) build on 
the classic work of Piaget. Typically, they accept his account 
of intellectual development as a process of adaptation 
through assimilation and accommodation proceeding in an 
invariant developmental sequence from sensorimotor to pre- 
operational to concrete operational to formal operational 
thought (Piaget, 1926). They also accept Piaget’s (1932) 
application of this model of intellectual development to moral 
judgment. Moral reasoning theories, of which Kohlberg’s 
(1976) is the best known, treat situations in which one person 


might act to benefit another as problems or puzzles to be 
solved, much like the problems in volume conservation that 
Piaget gave his children. The key to prosocial action is the 
level of moral reasoning used to solve the puzzle or dilemma. 
In Kohlberg’s (1976) words, “To act in a morally high way 
requires a high stage of moral reasoning. . . . Moral stage is a 
good predictor of action” (p. 32). 

Kohlberg claimed to have identified a universal and in- 
variant sequence of six stages in moral reasoning, grouped in 
pairs into three levels: (a) preconventional (judgment based 
on immediate consequences for self), (b) conventional (judg- 
ment based on social norms, rules, and laws), and (c) post- 
conventional (judgment based on universal moral principles 
that at once transcend and undergird the moral conventions of 
society). The moral principle that Kohlberg considered most 
important was a neo-Kantian principle of justice whereby 
each individual is accorded equal rights and dignity in a 
Kingdom of Ends. 

Controversy has surrounded moral reasoning theories 
from the start. First, evidence that moral reasoning develops 
universally in the invariant sequence of stages that Kohlberg 
described is equivocal at best (Kurtines & Greif, 1974). Sec- 
ond, the link between level of moral reasoning and prosocial 
behavior is far less clear than one might expect (Blasi, 1980; 
Eisenberg, 1991). In defense, supporters of moral reasoning 
models have pointed out that (a) adequate measurement of 
moral reasoning is difficult and (b) in almost any moral 
dilemma one may justify a given course of action in different 
ways, using different levels of moral reasoning. Both points 
seem true, but they reduce the explanatory power of moral 
reasoning theories, casting doubt on Kohlberg’s claim that 
moral stage is a good predictor of prosocial action. Modified 
models of moral reasoning that incorporate social learning 
principles offer better explanatory power (e.g., Eisenberg, 
1986). One must ask of these models, however, whether the 
social learning principles do all the explanatory work. 

In addition to being challenged from outside by re- 
searchers who question the value of moral reasoning as a suf- 
ficient or even necessary explanation of prosocial behavior, 
Kohlberg’s focus on justice as the capstone of moral maturity 
has been challenged from inside the moral-reasoning camp. 
The most notable challenge has come from his former student 
and colleague Carol Gilligan. In addition to an ethic of justice 
and fairness, Gilligan (1982) called for recognition of an 
ethic of care. Although she believed that both men and 
women display reasoning based on justice and reasoning 
based on care, she claimed that the former is more character- 
istic of men and the latter more characteristic of women. She 
also claimed that Kohlberg’s exclusive focus on justice led 
to a perception that men are superior to women in moral 
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reasoning. Finally, she claimed that this apparent superiority 
will disappear if one listens to the moral voice of women, 
who speak more of care than of justice. 

Evidence for the claimed sex difference in use of per- 
spectives of justice and care has been limited and weak 
(Walker, 1991). But research has supported Gilligan’s claim 
that moral dilemmas can be approached from a perspective 
of care rather than justice (Gilligan, Ward, & Taylor, 1988; 
Walker, 1991). It remains unclear, however, what a care per- 
spective is. Is it (a) a reflection of Kohlberg’s conventional 
stage of morality, (b) an alternative mode of moral reasoning 
with its own developmental sequence, or (c) not a form of 
moral reasoning at all but an emotional reaction or bond? 
In sum, although the distinction between justice and care 
seems to have value, considerably more conceptual preci- 
sion is needed to know the nature and significance of this 
distinction. 


Amalgamated Models 


One need not rely on just one of these seven theoretical per- 
spectives to explain prosocial behavior. It is possible to in- 
voke more than one in a given situation or to invoke one in 
one situation and another in a different situation. It is also 
possible to combine perspectives into an amalgamated 
model. Sometimes, such an amalgamation has been created 
by the integration of different theoretical perspectives (e.g., 
social learning and norm theories); more often, it has resulted 
from arranging perspectives in sequence, adding boxes and 
arrows to a flowchart of steps that lead ultimately to prosocial 
behavior. The impetus for creating amalgamated models 
seems to be the desire to be comprehensive, a desire that 
stems from the same aspirations for prediction and control 
that underlie the more ad hoc variance-accounted-for ap- 
proach. But in amalgamated models, this desire takes advan- 
tage of existing theories to pull together and organize a range 
of explanations. 

Perhaps the best known and most enduring amalgamated 
model is the arousal/cost-reward model originally proposed 
by I. M. Piliavin, Rodin, and Piliavin (1969) and devel- 
oped and elaborated by J. A. Piliavin et al. (1981), Dovidio 
(1984), and Dovidio, Piliavin, Gaertner, Schroeder, and 
Clark (1991). Originally, this model combined a tension- 
reduction motivational component with a cost-reward assess- 
ment of the various behavioral means to reduce the tension. 
Over the years, norms, equity concerns, and attribution 
processes have been incorporated as well, producing a flow- 
chart with 8 boxes and 17 arrows that is too complex to 
describe here. Other amalgamated models include those de- 
veloped by Bar-Tal (1982), who relies most heavily on social 
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learning and moral-reasoning perspectives, and by Schwartz 
(1977), who relies most heavily on norms, especially per- 
sonal norms. 

Amalgamated models make three useful contributions. 
First, they remind us of the complexity of prosocial behavior 
and thereby caution against simplistic explanations. Second, 
they sketch a causal ordering of the various psychological 
processes assumed to be operating—although these orderings 
are rarely tested. Third, they provide a mnemonic for relevant 
psychological processes and theoretical perspectives. 

Amalgamated models have potential liabilities too. First, 
the desire to be comprehensive exerts pressure toward prolif- 
eration of boxes and arrows. As more intervening steps are 
added and more arrows are drawn, multiple paths connect 
postulated antecedent and consequent variables. This makes 
achieving clear causal prediction increasingly difficult. The 
models become less explanatory and more purely descrip- 
tive. It seems to be a general and ironic rule in science that the 
greater the number of different explanatory models com- 
bined, the less the resulting explanatory power (recall 
Einstein’s admonition to account for the most facts with the 
fewest principles). 

Second, having accepted the goal of making an amalga- 
mated model comprehensive, one can expend much energy 
trying to make anomalous data fit. The breadth and complex- 
ity of these models make success almost inevitable. With 
effort, data can be made to fit even when they do not. The 
consequence is that opportunities for new insight and under- 
standing are lost—or at least discouraged. This is a very seri- 
ous liability if, as we wish to suggest in the next section of 
this chapter, the anomalous aspects of prosocial behavior are 
what have contributed the most to psychology. 

The seven perspectives reviewed thus far reveal the scope 
and power of existing psychological theory available to 
explain why people act prosocially. Yet in spite of this scope 
and power, these existing theories sometimes seem inade- 
quate. Even after hearing their explanations, one may ex- 
perience a nagging sense of “yes, but” when faced with 
a dramatic display of concern for another’s welfare—or a 
dramatic display of callousness. Such displays have long 
intrigued and puzzled not only psychologists but also philoso- 
phers and other behavioral and social scientists. They call for 
a rethinking of our existing theories about why people do and 
do not act prosocially, even a rethinking of our assumptions 
about human nature. 

By attending to these anomalies, researchers have ex- 
tended and altered our theories of social motivation. Attempts 
to explain prosocial anomalies have not caused a total rewrite 
of our theories, of course, but they have caused some rewrit- 
ing, and likely there will be more. 


ANOMALOUS FAILURES TO ACT PROSOCIALLY 


The anomalous aspects of prosocial behavior have been of 
particular interest to those concerned with the theoretical 
rather than practical implications of why people do—and 
don’t—act prosocially. At times, a failure to act prosocially 
can be baffling. How can individuals who were raised in car- 
ing and nurturing homes, whose parents rewarded them for 
showing concern, who become upset when they hear about 
suffering in remote corners of the world, who have a well- 
developed sense of duty, justice, and social responsibility, 
and who are highly sensitive to how they look in others’ eyes 
as well as in their own fail to respond to the needs of others, 
even when it would cost little to do so? Given all the pressure 
that society brings to bear, failures to act prosocially can 
seem quite anomalous, almost amazing. Yet they happen. 

Let’s return to the murder of Kitty Genovese. At the time, 
explanations bandied about in the media focused on the 
breakdown in modern urban society of moral fiber, social 
norms, and sense of community. Her death was said to be a 
product of apathy, alienation, anomie, and angst. 


Effect of Others on Decisions Under Pressure 


Bibb Latané and John Darley (1970) came up with an inge- 
nious alternative to these dispositional explanations. Their 
explanation was based in part on existing psychological the- 
ory and in part on new theoretical insights. They observed 
that once we notice a possible emergency situation, we must 
make several decisions in order to help. We must decide that 
an emergency exists, that it is our personal responsibility to 
act, and that there is something we can do to help. To compli- 
cate matters, these decisions must be made under pressure; 
emergencies involve threat, ambiguity, urgency, and stress. 
The presence of other bystanders can influence this pressure- 
packed decision sequence at each step, tipping the scales 
toward inaction. 

Is a scream in the night a woman being attacked or harm- 
less high-spirited play? Uncertain, bystanders may turn to 
others present, seeking cues to help them decide. No one 
wishes to appear foolishly excited over an event that is not an 
emergency, so each individual reacts initially with a calm 
outward demeanor, while looking at other people’s reactions. 
Others do the same. No one appears upset, creating a state of 
pluralistic ignorance (Miller & McFarland, 1987). Everyone 
decides that since no one else is upset, the event must not 
be an emergency (Latané & Darley, 1968; Latané & Rodin, 
1969). 

Even if one decides that the situation is an emergency and 
that someone is in dire need of help, the presence of others can 


still discourage action. To explain how, Darley and Laang 
(1968) moved beyond existing theory and proposed a diffu- 
sion of responsibility. If others are available, each individual 
may feel less personal obligation to come forward and help. 
One call to the police is as helpful, if not more helpful, than 
20 calls. In the Kitty Genovese case, her neighbors may have 
seen lights in other windows and assumed that other neigh- 
bors had heard the screams and that someone else had already 
called. Some may have thought, “Something should be done, 
but why should I be the one to do it?” Thoughts like these, 
made possible by awareness of other bystanders without 
knowing what the others are doing, diffuses the responsibility 
to help among all the bystanders present and makes it less 
likely that any one bystander will help. 

Latané and Darley’s (1970) answer to the question of why 
none of the 38 witnesses to the murder of Kitty Genovese 
helped has stood up remarkably well to experimental test (see 
Latané & Nida, 1981). Still, the psychological process that 
underlies diffusion of responsibility remains unclear. Do the 
costs of helping lead to a motivated, optimistic redefinition 
of the situation (“I’m sure someone else has already helped, 
so there is no longer a need”)? Is there a recognition of con- 
tinuing need but denial of personal responsibility, either by 
reasoning that others present are better qualified to act 
(“Somebody’s got to do something, but not me; they’re the 
ones who know what to do”) or shifting from a prescriptive to 
a descriptive norm (“I can’t be blamed; no one else is doing 
anything either”)? Might some people fail to act out of defer- 
ence or modesty (“Ill let someone else be the hero”)? Each 
of these processes involves the effect of others on decision 
making under pressure, and they are often confounded in 
research; yet these processes are distinct. Any or all could 
operate, suggesting that more research is needed. 


Blaming the Victim 


Another important theoretical development stimulated by 
reflection on bystander “apathy” was Melvin Lerner’s (1970, 
1980) just-world hypothesis. The anomaly on which Lerner 
focused was not the failure to help victims of accidents, 
attacks, or other emergencies, but rather the more pervasive 
and pernicious tendency for the haves in society to be unre- 
sponsive to the needs of the have-nots. Lerner observed, as 
did Ryan (1971), that people often not only fail to notice need 
or to show concern for victims, but that they actively dero- 
gate and blame victims. 

To explain this apparent anomaly, Lerner turned to the 
seemingly prosocial principle of justice. He reasoned as fol- 
lows. If children are to delay gratification and pursue long- 
term goals, they must develop a belief that effort brings 
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results. For most of us, this belief in contingency leads in turn 
to a belief in a just world, a sense of appropriateness—that 
people get what they deserve (and deserve what they get)— 
necessary for trust, hope, and confidence in our future. 
Witnessing the suffering of innocent victims violates the be- 
lief in a just world. In order to reduce the discomfort produced 
by this threat, we may help. But there is an alternative: We 
may derogate or blame the victims (if they have less, they 
must deserve less; that is, they must be less deserving). Lerner 
and his associates provided extensive evidence that witness- 
ing an innocent victim suffer can lead to derogation (see 
Lerner, 1980, for a review). The insight that a natural—even 
noble—belief in justice, when carried into an unjust world, 
can itself become a source of injustice has proved major. 


ANOMALOUS PROSOCIAL ACTS 


In the 1960s, heightened social conscience focused attention 
on anomalous failures to act prosocially. In the broader sweep 
of Western thought, this focus is itself anomalous. Through 
the centuries, the puzzle that has intrigued those contemplat- 
ing the human condition has not been why people fail to care 
for others in need; the puzzle has been why people care. 

From Aristotle and Aquinas through Hobbes and Bentham 
to Nietzsche and Freud, the dominant view in Western 
thought has been that people are, at heart, exclusively self- 
interested. Given this view, what explains the enormous ef- 
fort and energy directed toward benefiting others? At times, 
what people do for others can be spectacular. Soldiers have 
thrown themselves on live grenades to protect their com- 
rades. Crews worked around the clock in extreme danger to 
free the trapped victims of the Oklahoma City bombing. 
Firemen died directing others to safety when the World Trade 
Center towers collapsed. Surviving an airline crash, Arland 
Williams lost his life in the icy waters of the Potomac be- 
cause he repeatedly gave others his place in the rescue 
helicopter. Mother Teresa dedicated her life to the dying of 
Calcutta, the poorest of the poor, bringing care and comfort to 
thousands. Rescuers of Jews in Nazi Europe, such as Miep 
Gies (1987), who helped hide Anne Frank and her parents, 
and Oskar Schindler, risked their own lives—and often the 
lives of their loved ones—day after day for months, or even 
years. 

How can we reconcile these actions with a view that 
people are exclusively self-interested? Could some people, to 
some degree, under some circumstances, be capable of hav- 
ing another person’s interest at heart? Is it possible for one 
person to have another person’s welfare as an ultimate goal 
(altruism), or is all helping simply an instrumental means of 
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obtaining one or another form of self-benefit (egoism)? This 
has been called the altruism question (Batson, 1991). 


The Altruism Question 


One easy answer to the altruism question that can quickly be 
laid to rest goes like this: Even if it were possible for a person 
to be motivated to increase another’s welfare, such a person 
would be pleased by attaining this desired goal, so even this 
apparent altruism would be a product of egoism. In the 
words of Tolman’s (1923) well-turned epithet, this argument 
is “more brilliant than cogent” (p. 203). Philosophers have 
shown it to be flawed by pointing out that it involves a confu- 
sion between two different forms of psychological hedonism. 
The strong form of hedonism asserts that the ultimate goal of 
human action is always the attainment of personal pleasure; 
the weak form asserts only that goal attainment always brings 
pleasure. The weak form is not inconsistent with the altruistic 
claim that the ultimate goal of some action is to benefit an- 
other rather than to benefit oneself; the pleasure obtained can 
be a consequence of reaching this goal without being the goal 
itself. The strong form of psychological hedonism is incon- 
sistent with the possibility of altruism, but to affirm this form 
is simply to assert that altruism does not exist, an empirical 
assertion that may or may not be true (see MacIntyre, 1967, 
for discussion of these philosophical arguments). 

More serious advocates of universal egoism argue that 
some specific self-benefit is always the ultimate goal of help- 
ing; benefiting the other is simply an instrumental goal on the 
way to one or another ultimately self-serving end. They point 
to all the self-benefits of helping: the material, social, and self- 
rewards received; the material, social, and self-punishments 
avoided; and aversive-arousal reduction. Advocates of altru- 
ism counter that simply because self-benefits follow from 
benefiting another, this does not prove that the self-benefits 
were the helper’s ultimate goal. These self-benefits may be 
unintended consequences of reaching the ultimate goal of 
benefiting the other. If so, the motivation would be altruistic, 
not egoistic. 

Advocates of altruism claim more than possibility, of 
course. They claim that altruistic motivation exists, that at 
least some people under some circumstances act with the 
ultimate goal of increasing another person’s welfare. 


The Empathy-Altruism Hypothesis 


Over the centuries, the most frequently proposed source of 
altruistic motivation has been an other-oriented emotional 
response congruent with the perceived welfare of another 
person—today usually called empathy (Batson, 1987) or 


sympathy (Wispé, 1986). If another person is in need, these 
empathic emotions include sympathy, compassion, tender- 
ness, and the like. The empathy-altruism hypothesis claims 
that these emotions evoke motivation with an ultimate goal of 
benefiting the person for whom the empathy is felt—that is, 
altruistic motivation. Various forms of this hypothesis have 
been espoused by Thomas Aquinas, David Hume, Adam 
Smith, Charles Darwin, Herbert Spencer, and William 
McDougall, as well as in contemporary psychology by 
Hoffman (1975), Krebs (1975), and Batson (1987). 

Considerable evidence supports the idea that feeling em- 
pathy for a person in need leads to increased helping of that 
person (see Batson, 1991; Eisenberg & Miller, 1987, for re- 
views). Observing an empathy-helping relationship, how- 
ever, tells us nothing about the nature of the motivation that 
underlies this relationship. Increasing the other person’s wel- 
fare could be (a) an ultimate goal, producing self-benefits as 
unintended consequences; (b) an instrumental goal on the 
way to the ultimate goal of gaining one or more self-benefits; 
or (c) both. That is, the motivation could be altruistic, egois- 
tic, or both. 


Egoistic Alternatives to the Empathy-Altruism Hypothesis 


Three general classes of self-benefits can result from helping 
a person for whom one feels empathy. Such help can (a) re- 
duce one’s empathic arousal, which may be experienced as 
aversive; (b) enable one to avoid possible social and self- 
punishments for failing to help; and (c) enable one to gain 
social and self-rewards for doing what is good and right. The 
empathy-altruism hypothesis does not deny that these self- 
benefits of empathy-induced helping exist. It claims that they 
are unintended consequences of the empathically aroused 
helper reaching the ultimate goal of reducing the other’s 
suffering. Proponents of egoistic alternatives to the empathy- 
altruism hypothesis disagree. They claim that one or more of 
these self-benefits are the ultimate goal of empathy-induced 
helping. In the past two decades more than 30 experiments 
have tested these three egoistic alternatives against the 
empathy-altruism hypothesis. 

The most frequently proposed egoistic explanation of the 
empathy-helping relationship is aversive-arousal reduction. 
This explanation claims that feeling empathy for someone 
who is suffering is unpleasant, and empathically aroused in- 
dividuals help in order to benefit themselves by eliminating 
their empathic feelings. Benefiting the victim is simply a 
means to this self-serving end. 

Over half a dozen experiments have tested the aversive- 
arousal reduction explanation against the empathy-altruism 
hypothesis by varying the ease of escape from further exposure 


to the empathy-evoking need without helping. Because em- 
pathic arousal is a result of witnessing the need, either termi- 
nating this need by helping or terminating exposure to it by 
escaping should reduce one’s own empathic arousal. Escape 
does not, however, enable one to reach the altruistic goal of 
relieving the victim’s need. Therefore, the aversive-arousal 
explanation predicts elimination of the empathy-helping rela- 
tionship when escape is easy; the empathy-altruism hypothesis 
does not. Results of these experiments have consistently 
patterned as predicted by the empathy-altruism hypothesis 
and not by the aversive-arousal reduction explanation, casting 
doubt on this popular egoistic account (see Batson, 1991, fora 
review). 

A second egoistic explanation invokes empathy-specific 
punishment. It claims that people learn through socialization 
that additional obligation to help, and thus additional shame 
and guilt for failure to help, is attendant on feeling empathy for 
someone in need. As a result, when people feel empathy, they 
are faced with impending social or self-censure beyond any 
general punishment associated with not helping. They say to 
themselves, “What will others think—or what will I think of 
myself—if I don’t help when I feel like this?” and then they 
help out of an egoistic desire to avoid these empathy-specific 
punishments. Once again, experiments designed to test this 
explanation have failed to support it; the results have consis- 
tently supported the empathy-altruism hypothesis instead 
(Batson, 1991). 

The third major egoistic explanation invokes empathy- 
specific reward. It claims that people learn through socializa- 
tion that special rewards in the form of praise and pride are 
attendant on helping a person for whom they feel empathy. 
As a result, when people feel empathy, they think of these 
rewards and help out of an egoistic desire to gain them. 

The general form of this explanation has been tested in 
several experiments and received no support (Batson et al., 
1988, Studies 1 & 5; Batson & Weeks, 1996), but two varia- 
tions have also been proposed. Best known is the negative- 
state relief explanation proposed by Cialdini et al. (1987). 
Cialdini et al. suggested that the empathy experienced when 
witnessing another person’s suffering is a negative affective 
state—a state of temporary sadness or sorrow—and the per- 
son feeling empathy helps in order to gain self-rewards to 
counteract this negative state. 

Although this egoistic alternative received some initial 
support (Cialdini et al., 1987; Schaller & Cialdini, 1988), sub- 
sequent research has revealed that this was likely due to pro- 
cedural artifacts. Experiments avoiding these artifacts have 
instead supported the empathy-altruism hypothesis (Batson 
et al., 1989; Dovidio, Allen, & Schroeder, 1990; Schroeder, 
Dovidio, Sibicky, Matthews, & Allen, 1988). It now seems 
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clear that the motivation to help evoked by empathy is not 
directed toward the egoistic goal of negative-state relief. 

A second interesting variation on an empathy-specific 
reward explanation was proposed by Smith, Keating, and 
Stotland (1989). They claimed that rather than helping to gain 
the rewards of seeing oneself or being seen by others as a 
helpful person, empathically aroused individuals help in order 
to feel joy at the needy individual’s relief: “It is proposed that 
the prospect of empathic joy, conveyed by feedback from the 
help recipient, is essential to the special tendency of empathic 
witnesses to help. . . . The empathically concerned witness to 
the distress of others helps in order to be happy” (Smith et al., 
1989, p. 641). 

Some early self-report data were supportive, but more 
rigorous experimental evidence has failed to support this 
empathic-joy hypothesis. Instead, experimental results have 
once again consistently supported the empathy-altruism 
hypothesis (Batson et al., 1991; Smith et al., 1989). The 
empathic-joy hypothesis, like other versions of the empathy- 
specific reward explanation, seems unable to account for the 
empathy-helping relationship. 


A Tentative Conclusion 


Reviewing the empathy-altruism research, as well as related 
literature in sociology, economics, political science, and biol- 
ogy, J. A. Piliavin and Charng (1990) concluded that 


There appears to be a “paradigm shift” away from the earlier 
position that behavior that appears to be altruistic must, under 
closer scrutiny, be revealed as reflecting egoistic motives. 
Rather, theory and data now being advanced are more compati- 
ble with the view that true altruism—acting with the goal of ben- 
efiting another—does exist and is a part of human nature. (p. 27) 


Pending new evidence or a plausible new egoistic explana- 
tion of the existing evidence, this conclusion seems correct. 
It appears that the empathy-altruism hypothesis should— 
tentatively—be accepted as true. 


Implications of the Empathy-Altruism Hypothesis 


If the empathy-altruism hypothesis is true, the implications 
are wide ranging. Universal egoism—the assumption that all 
human behavior is ultimately directed toward self-benefit— 
has long dominated not only psychology but other social and 
behavioral sciences as well (Campbell, 1975; Mansbridge, 
1990; Wallach & Wallach, 1983). If individuals feeling 
empathy act, at least in part, with an ultimate goal of increas- 
ing the welfare of another, then the assumption of universal 
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egoism must be replaced by a more complex view of motiva- 
tion that allows for altruism as well as egoism. Such a shift in 
our view of motivation requires, in turn, a revision of our 
underlying assumptions about human nature and human 
potential. It implies that we humans may be more social 
than we have thought—that other people can be more to us 
than sources of information, stimulation, and reward as we each 
seek our own welfare. To some degree and under some circum- 
stances, we can care about their welfare as an end in itself. 

The evidence for the empathy-altruism hypothesis also 
forces us to face the question of why empathic feelings exist. 
What evolutionary function do they serve? Admittedly spec- 
ulative, the most plausible answer relates empathic feelings 
to parenting among higher mammals, in which offspring 
live for some time in a very vulnerable state (de Waal, 1996; 
Hoffman, 1981; McDougall, 1908; Zahn-Waxler & Radke- 
Yarrow, 1990). Were parents not intensely interested in the 
welfare of their progeny, these species would quickly die out. 
Empathic feelings for offspring, and the resulting altruistic 
motivation, may promote one’s reproductive potential not by 
increasing the number of offspring but by increasing the 
chance of their survival. 

Clearly, however, empathic feelings extend well beyond 
one’s own children. People can feel empathy for a wide range 
of individuals (including nonhumans) as long as there is no 
preexisting antipathy (Batson, 1991; Krebs, 1975; Shelton & 
Rogers, 1981). From an evolutionary perspective, this exten- 
sion may be attributed to cognitive generalization whereby 
one “adopts” others, making it possible to evoke the primi- 
tive and fundamental impulse to care for progeny when these 
adopted others are in need (Batson, 1987; MacLean, 1973). 
Such cognitive generalization may be possible because of 
(a) human cognitive capacity, including symbolic thought, 
and (b) the lack of evolutionary advantage for sharp discrim- 
ination of empathic feelings in the small hunter-gatherer 
bands of early humans. In these bands, those in need were 
often one’s children or close kin, and one’s own welfare was 
tightly tied to the welfare even of those who were not close 
kin (Hoffman, 1981). 

The empathy-altruism hypothesis also may have wide- 
ranging practical implications. Given the power of empathic 
feelings to evoke altruistic motivation, people may some- 
times suppress or avoid these feelings. Loss of the capacity 
to feel empathy for clients may be a factor, possibly a central 
one, in the experience of burnout among case workers in the 
helping professions (Maslach, 1982). Aware of the extreme 
effort involved in helping or the impossibility of helping 
effectively, these case workers—as well as nurses caring for 
terminal patients, and even pedestrians confronted by the 
homeless—may try to avoid feeling empathy in order to 
avoid the resulting altruistic motivation (Shaw, Batson, & 


Todd, 1994; Stotland, Mathews, Sherman, Hansson, & 
Richardson, 1978). There seems to be, then, egoistic motiva- 
tion to avoid altruistic motivation. 

More positively, experiments have tested the possibility that 
empathy-induced altruism can be used to improve attitudes 
toward stigmatized out-groups. Thus far, results look quite 
encouraging. Inducing empathy has improved racial attitudes, 
as well as attitudes toward people with AIDS, the homeless, 
and even convicted murderers (Batson, Polycarpou, et al., 
1997; Dovidio, Gaertner, & Johnson, 1999). Empathy-induced 
altruism has also been found to increase cooperation in a com- 
petitive situation (a prisoner 's dilemma), even when one knows 
that the person for whom one feels empathy has acted compet- 
itively (Batson & Ahmad, 2001; Batson & Moran, 1999). 


Other Possible Sources of Altruistic Motivation 


Might there be sources of altruistic motivation other than 
empathic emotion? Several have been proposed, including 
an altruistic personality (Oliner & Oliner, 1988), principled 
moral reasoning (Kohlberg, 1976), and internalized pro- 
social values (Staub, 1974). There is some evidence that 
each of these potential sources is associated with increased 
prosocial motivation, but as yet, it is not clear whether this 
motivation is altruistic. It may instead be an instrumental 
means to the egoistic ultimate goals of (a) maintaining one’s 
positive self-concept or (b) avoiding guilt (Batson, 1991; 
Batson, Bolen, Cross, & Neuringer-Benefiel, 1986; Carlo 
et al., 1991; Eisenberg et al., 1989). More and better research 
exploring these possibilities is needed. 


Beyond the Egoism-Altruism Debate: 
Other Prosocial Motives 


Thinking more broadly, beyond the egoism-altruism debate 
that has been the focus of attention and contention for the 
past two decades, might there be other forms of prosocial 
motivation—forms in which the ultimate goal is neither to 
benefit oneself nor to benefit another individual? Two possi- 
bilities seem especially worthy of consideration: collectivism 
and principlism. 


Collectivism: Benefiting a Group 


Collectivism involves motivation to benefit a particular group 
as a whole. The ultimate goal is not to increase one’s own wel- 
fare or the welfare of the specific others who are benefited; the 
ultimate goal is to increase the welfare of the group. Robyn 
Dawes and his colleagues put it succinctly: “Not me or thee 
but we” (Dawes, van de Kragt, & Orbell, 1988). They also 
suggested that collectivist prosocial motivation is a product of 


group identity (Brewer & Kramer, 1986; Tajfel, 1981; Turner, 
1987). 

As with altruism, however, what looks like collectivism 
may actually be a subtle form of egoism. Perhaps attention to 
group welfare is simply an expression of enlightened self- 
interest. After all, if one recognizes that ignoring group needs 
and the common good in a headlong pursuit of self-benefit 
will only lead to less self-benefit in the long run, then one 
may decide to benefit the group as a means to maximize over- 
all self-benefit. Appeals to enlightened self-interest are often 
used by politicians and social activists trying to encourage 
prosocial response to societal needs. They warn of the long- 
term consequences for oneself and one’s children of pollution 
and squandering natural resources. They remind that if the 
plight of the poor becomes too severe, those who are well off 
may face revolution. Such appeals seem to assume that col- 
lectivism is simply a form of egoism. 

The most direct evidence that collectivism is independent 
of egoism comes from research by Dawes, van de Kragt, and 
Orbell (1990). They examined the responses of individuals 
who had been given a choice between allocating money to 
themselves or to a group. Allocation to oneself maximized in- 
dividual but not group profit, whereas allocation to the group 
maximized collective but not individual profit. 

Dawes et al. (1990) found that if individuals faced with 
this dilemma made their allocation after discussing it with 
other members of the group, they gave more to the group 
than if they had no prior discussion. Moreover, this effect 
was specific to the in-group with whom the discussion oc- 
curred; allocation to an out-group was not enhanced. Based 
on this research, Dawes et al. claimed evidence for collec- 
tivist motivation independent of egoism, arguing that their 
procedure ruled out the two most plausible egoistic explana- 
tions: (a) enlightened self-interest (by having no future con- 
tact and only one allocation round) and (b) socially instilled 
conscience (a norm to share, if evoked, should increase shar- 
ing with the out-group as well as the in-group). There is rea- 
son to doubt, however, that their procedure effectively ruled 
out self-rewards and self-punishments associated with con- 
science. The research on norms reviewed earlier suggests 
that norms can be more refined than Dawes and his cowork- 
ers allowed. We may have a norm that says “share with 
your buddies” rather than a norm that simply says “share.” 
So, although this research is important and suggestive, more 
and better evidence is needed to justify the conclusion that 
collectivist prosocial motivation is not reducible to egoism. 


Principlism: Upholding a Moral Principle 


Not only have most moral philosophers argued for the impor- 
tance of a prosocial motive other than egoism, but most since 
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Kant (1724-1804) have shunned altruism and collectivism as 
well. They reject appeals to altruism, especially empathy- 
induced altruism, because feelings of empathy, sympathy, and 
compassion are too fickle and too circumscribed. Empathy is 
not felt for everyone in need, at least not to the same degree. 
They reject appeals to collectivism because group interest is 
bounded by the limits of the group; it may even encourage 
doing harm to those outside the group. Given these problems 
with altruism and collectivism, moral philosophers have typ- 
ically advocated prosocial motivation with an ultimate goal of 
upholding a universal and impartial moral principle, such as 
justice (Rawls, 1971). We shall call this moral motivation 
principlism. 

Is acting with an ultimate goal of upholding a moral prin- 
ciple really possible? When Kant (1785/1898) briefly shifted 
from his analysis of what ought to be to what is, he was 
ready to admit that even when the concern we show for 
others appears to be prompted by duty to principle, it may 
actually be prompted by self-love (pp. 23-24). The goal of 
upholding a moral principle may be only an instrumental 
goal pursued as a means to reach the ultimate goal of self- 
benefit. If so, then principle-based motivation is actually 
egoistic. 

The self-benefits of upholding a moral principle are 
conspicuous. One can gain the social and self-rewards of 
being seen and seeing oneself as a good person. One can also 
avoid the social and self-punishments of shame and guilt for 
failing to do the right thing. As Freud (1930) suggested, soci- 
ety may inculcate such principles in the young in order to bri- 
dle their antisocial impulses by making it in their best 
personal interest to act morally (see also Campbell, 1975). 
Alternatively, through internalization (Staub, 1989) or devel- 
opment of moral reasoning (Kohlberg, 1976), principles may 
come to be valued in their own right and not simply as in- 
strumental means to self-serving ends. 

The issue here is the same one faced with altruism and 
collectivism. We need to know the nature of the underlying 
motive. Is the desire to uphold justice (or some other moral 
principle) an instrumental goal on the way to the ultimate 
goal of self-benefit? If so, this desire is a form of egoism. Is 
upholding the principle an ultimate goal, and the ensuing 
self-benefits merely unintended consequences? If so, princi- 
plism is a fourth type of prosocial motivation, independent of 
egoism, altruism, and collectivism. 

Recent research suggests that people often act so as to 
appear moral while, if possible, avoiding the cost of actually 
being moral (Batson, Kobrynowicz, Dinnerstein, Kampf, 
& Wilson, 1997; Batson, Thompson, Seuferling, Whitney, & 
Strongman, 1999). This research also suggests that if moral 
motivation exists, it is easily overpowered by self-interest. 
Many of us are, it seems, quite adept at moral rationalization. 
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We are good at justifying to ourselves (if not to others) why a 
situation that benefits us or those we care about does not 
violate our moral principles—for example, why storing our 
nuclear waste in someone else’s backyard is fair, why terrorist 
attacks by our side are regrettable but necessary evils whereas 
terrorist attacks by the other side are atrocities, and why we 
must obey orders even if it means killing innocent people. The 
abstractness of most moral principles, and their multiplicity, 
makes rationalization all too easy (see Bandura, 1991; 
Bersoff, 1999; Staub, 1990). 

But this may be only part of the story. Perhaps in some 
cases upholding a moral principle can serve as an ultimate 
goal, defining a form of motivation independent of egoism. If 
so, perhaps these principles can provide a basis for respond- 
ing to the needs of others that transcends reliance on self- 
interest or on vested interest in and feeling for the welfare of 
certain other individuals or groups. Quite an “if,” but it seems 
well worth conducting research to find out. 


Conflict and Cooperation of Prosocial Motives 


To recognize the range of possible prosocial motives makes 
available more resources to those seeking to produce a more 
humane, caring society. At the same time, a multiplicity of 
prosocial motives complicates matters. These different mo- 
tives for helping others do not always work in harmony. They 
can undercut or compete with one another. 

Well-intentioned appeals to extended or enlightened self- 
interest can backfire by undermining other prosocial motives. 
Providing people with money or other tangible incentives for 
showing concern may lead people to interpret their motiva- 
tion as egoistic even when it is not (Batson, Coke, Jasnoski, 
& Hanson, 1978). In this way, the assumption that there is 
only one answer to the question of why we act for the com- 
mon good—egoism—may become a self-fulfilling prophecy 
(Batson, Fultz, Schoenrade, & Paduano, 1987) and may cre- 
ate a self-perpetuating norm of self-interest (Miller, 1999; 
Miller & Ratner, 1998). 

Nor do the other three prosocial motives always work in 
harmony. They can conflict with one another. For example, 
altruism can—and often does—conflict with collectivism or 
principlism. We may ignore the larger social good, or we may 
compromise our principles, not only to benefit ourselves but 
also to benefit those individuals about whom we especially care 
(Batson, Batson, et al., 1995; Batson, Klein, Highberger, & 
Shaw, 1995). Indeed, whereas there are clear social sanctions 
against unbridled self-interest, there are not clear sanctions 
against altruism. Batson, Ahmad, et al. (1999) found that altru- 
ism can at times be a greater threat to the common good than is 
egoism. 


Each of the four possible prosocial motives that we have 
identified has its strengths. Each also has its weaknesses. The 
potential for the greatest good may come from strategies that 
orchestrate these motives so that the strengths of one can 
overcome the weaknesses of another. Strategies that combine 
appeals to either altruism or collectivism with appeals to prin- 
ciple seem especially promising. For example, think about the 
principle of justice. Upholding justice is a powerful motive, 
but it is vulnerable to rationalization. Empathy-induced altru- 
ism and collectivism are also powerful motives, but they are 
limited in scope. They produce partiality—special concern 
for a particular person or persons or for a particular group. If 
we can lead people to feel empathy for the victims of injustice 
or to perceive themselves in a common group with them, we 
may be able to get these motives working together rather than 
at odds. Desire for justice may provide perspective and rea- 
son; empathy-induced altruism or collectivism may provide 
emotional fire and a force directed specifically toward relief 
of the victims’ suffering, preventing rationalization. 

Something of this sort occurred, we believe, in a number 
of rescuers of Jews in Nazi Europe. A careful look at data col- 
lected by the Oliners and their colleagues (Oliner & Oliner, 
1988) suggests that involvement in rescue activity frequently 
began with concern for a specific individual or individuals for 
whom compassion was felt—often individuals known previ- 
ously. This initial involvement subsequently led to further 
contacts and rescue activity and to a concern for justice that 
extended well beyond the bound of the initial empathic con- 
cern. Something of this sort also lay at the heart of Gandhi’s 
and Martin Luther King’s practice of nonviolent protest. The 
sight on the TV news of a small Black child in Birmingham 
being literally rolled down the street by water from a fire hose 
under the direction of Police Chief Bull Connor, and the 
emotions this sight evoked, seemed to do more to arouse a 
concern for justice than did hours of reasoned argument and 
appeals for equal civil rights. 

Something of this sort also can be found in the writing of 
Jonathan Kozol. Deeply concerned about the “savage inequal- 
ities” in public education between rich and poor communities 
in the United States, Kozol (1991) does not simply document 
the inequity. He takes us into the lives of individual children. 
We come to care deeply for them and, as a result, about the 
injustice. 


RESEARCH METHOD MATTERS 


Efforts to explain prosocial behavior, especially its seemingly 
anomalous aspects, have raised thorny issues about research 
methods that, though not specific to this area, flourish here. 


Most of these issues are rooted in mire produced by two fea- 
tures. First, psychologists are not the only ones who care about 
prosocial behavior. Most research participants see themselves 
as good, kind, caring people, and they want to be seen that way 
by others. Second, although cool, cognitive analysis and in- 
ference are often involved, theory and research on prosocial 
behavior focuses on relatively hot, active processes—the 
interplay of values, emotions, motives, and behavior. These 
processes may not be accessible to cool introspection. 

To reap a fruitful harvest from the mire that these two 
features create, researchers need to avoid the pitfalls of 
demand characteristics, evaluation apprehension, social de- 
sirability, self-presentation, and reactive measures. Conse- 
quently, research on prosocial behavior still relies heavily on 
high-impact deception procedures of the sort made famous 
in the social psychology of the 1960s (Aronson, Brewer, & 
Carlsmith, 1985). The currently popular procedure of present- 
ing research participants with descriptions of hypothetical 
situations and asking them to report what they would do is of 
limited use when studying prosocial behavior. Commitment to 
actual behavior—if not the behavior itself—is almost always 
required (Lerner, 1987). Rather than relying heavily on self- 
reports, thought listing, or retrospective analysis to reveal me- 
diating psychological processes, we must often study these 
processes indirectly by designing research that allows the 
effect of mediators to be inferred from observable behav- 
ior. Typically, this means one must successfully deceive par- 
ticipants, run the experiments on each participant individually, 
use between-group designs, and so on. Clearly, such research is 
difficult. Equally clearly, it requires careful sensitivity to and 
protection of the welfare and dignity of participants. 

Deeming care and sensitivity insufficient, some universi- 
ties have instituted a blanket prohibition on the use of high- 
impact deceptions of the kind needed to address key research 
questions concerning prosocial behavior. It is ironic that the 
study of prosocial, ethical behavior is one of the areas to suf- 
fer most from restrictions imposed in response to concerns 
about research ethics. 

Few would disagree that society could benefit from in- 
creased prosocial behavior. Rage and hate crimes, terrorist 
attacks, child and spouse abuse, neglect of the homeless, the 
plight of people with AIDS, and the growing disparity be- 
tween rich and poor (and smug callousness toward the latter) 
provide all-too-frequent reminders of crying need. Given the 
societal importance of understanding why people act to ben- 
efit others, given the apparent necessity of using high-impact 
deception research to provide this understanding, and given 
the dangers of obtaining misleading information using other 
methods, it is not the use of these methods, but rather a blan- 
ket prohibition of them, that seems unethical. 
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CONCLUSION 


Over the past 30 years the practical concern to promote 
prosocial behavior has led to both a variance-accounted-for 
empirical approach and the application of existing psycho- 
logical theories. In addition, existing theory has been chal- 
lenged and new theoretical perspectives developed by a focus 
on anomalous aspects of why people do—and don’t—act 
prosocially. Research has challenged currently dominant the- 
ories of social motivation and even of human nature—views 
that limit the human capacity to care to self-interest. This 
research has raised the possibility of a multiplicity of social 
motives—altruism, collectivism, and principlism, as well as 
egoism. It also has raised important theoretical questions—as 
yet unanswered—about how these motives might be most 
effectively orchestrated to increase prosocial behavior. More 
broadly, research in this area takes exception to the currently 
dominant focus in social psychology on cognitive representa- 
tion of the social environment and processing of social infor- 
mation, calling for increased attention to motives, emotions, 
and values. 

Research on prosocial behavior provides evidence that in 
addition to our all-too-apparent failing and fallibilities, we 
humans are, at times, capable of caring, and caring deeply, 
for people and issues other than ourselves. This possibility 
has wide-ranging theoretical implications, suggesting that we 
are more social than even our most social theories have led 
us to believe. It also has wide-ranging practical implications, 
suggesting untapped resources for social change. At present, 
however, these theoretical and practical implications are 
only partly realized, providing a pressing—and daunting— 
agenda. 
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SOCIAL CONFLICT AND INTEGRATION 


Humans are fundamentally social animals. Not only is group 
living of obvious contemporary importance (see Spears, 
Oakes, Ellemers, & Haslam, 1997), but also it represents the 
fundamental survival strategy that has likely characterized 
the human species from the beginning (see Simpson & 
Kenrick, 1997). The ways in which people understand their 
group membership thus play a critical role in social conflict 
and harmony and in intergroup integration. This chapter 
examines psychological perspectives on intergroup relations 
and their implications for reducing bias and conflict and for 
enhancing social integration. First, we review social psycho- 
logical theories on the nature of individual and collective 
identities and their relation to social harmony and conflict. 
Then, we examine theoretical perspectives on reducing inter- 
group bias and promoting social harmony. Next, we explore 
the importance of considering majority and minority perspec- 
tives on intergroup relations, social conflict, and integration. 
The chapter concludes by considering future directions and 
practical implications. 
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INDIVIDUAL AND COLLECTIVE IDENTITY 


Perspectives on social conflict, harmony, and integration have 
reflected a variety of disciplinary orientations. For instance, 
psychological theories of intergroup attitudes have com- 
monly emphasized the role of the individual, in terms of 
personality and attitude, in social biases and discrimination 
(see Duckitt, 1992; Jones, 1997). Traditional psychological 
theories, such as the work on the authoritarian personality 
(Adorno, Frenkel-Brunswik, Levinson, & Sanford, 1950), 
have considered the role of dysfunctional processes in the 
overt expression of social biases. More contemporary ap- 
proaches to race relations, such as aversive racism and sym- 
bolic racism perspectives, have considered the contributions 
of normal processes (e.g., socialization and social cognition) 
to the expression of subtle, and often unconscious, biases 
(Dovidio & Gaertner, 1998; S. Gaertner & Dovidio, 1986; 
Kovel, 1970; Sears, 1988; Sears & Henry, 2000). In addition, 
the role of social norms and standards is emphasized in recent 
reconceptualizations of older measures, such as authoritarian- 
ism. Right-wing authoritarianism (Altemeyer, 1996, 1998) 
has been found to be associated with negative attitudes 
toward a number of groups, particularly those socially stig- 
matized by society (e.g., Altemeyer, 1996; Esses, Haddock, & 
Zanna, 1993). 

Recent approaches to intergroup relations within psychol- 
ogy have also considered the role of individual differences in 


486 Social Conflict, Harmony, and Integration 


representations of group hierarchy. Social dominance theory 
(Pratto & Lemieux, 2001; Pratto, Sidanius, Stallworth, & 
Malle, 1994; see also Sidanius & Pratto, 1999) assumes that 
people who are strongly identified with high-status groups 
and who see intergroup relations in terms of group competi- 
tion will be especially prejudiced and discriminatory toward 
out-groups. These biases occur spontaneously as a function of 
individual differences in social dominance orientation, in 
contexts in which in-group—out-group distinctions are salient 
(Pratto & Shih, 2000). Scales developed to measure social 
dominance orientation pit the values of group dominance and 
equality against each other (see Pratto et al., 1994; Sidanius & 
Pratto, 1999). People high in social dominance orientation 
believe that group hierarchies are inevitable and desirable, 
and they may thus see the world as involving competition 
between groups for resources. They endorse items such as, 
“Some groups of people are simply inferior to other groups” 
and “Sometimes other groups must be kept in their place.” In- 
dividuals high in social dominance orientation believe that 
unequal social outcomes and social hierarchies are appropri- 
ate and therefore support an unequal distribution of resources 
among groups in ways that usually benefit their own group 
(see Pratto et al., 1994; Sidanius, Levin, & Pratto, 1996). In- 
dividuals low in social dominance orientation, in contrast, are 
generally concerned about the welfare of others and are em- 
pathic and tolerant of other individuals and groups (Pratto 
et al., 1994). They tend to endorse items such as, “Group 
equality should be our ideal” and “We would have fewer 
problems if we treated people more equally.” 

Sociological theories, in contrast, have frequently empha- 
sized the role of large-scale social and structural dynamics in 
intergroup relations in general and in race relations in partic- 
ular (Blauner, 1972; Bonacich, 1972; Wilson, 1978). These 
theories have considered the dynamics of race relations 
largely in economic and class-based terms—and often to the 
exclusion of individual influences (see Bobo, 1999). 

Despite the existence of such divergent views, both socio- 
logical and psychological approaches have converged to rec- 
ognize the importance of understanding the impact of group 
functions and collective identities on race relations (see Bobo, 
1999). In terms of group functions, Blumer (1958a, 1958b, 
1965a, 1965b), for instance, offered a sociologically based 
approach focusing on defense of group position, in which 
group competition and threat were considered fundamental 
processes in the development and maintenance of social bi- 
ases. With respect to race relations, Blumer (1958a) wrote, 
“Race prejudice is a defensive reaction to such challenging of 
the sense of group position... . As such, race prejudice is a 
protective device. It functions, however shortsightedly, to pre- 
serve the integrity and position of the dominant group” (p. 5). 


From a psychological orientation, Sherif, Harvey, White, 
Hood, and Sherif (1961) similarly proposed that the func- 
tional relations between groups are critical in determining 
intergroup attitudes. According to this position, competition 
between groups produces prejudice and discrimination, 
whereas intergroup interdependence and cooperative interac- 
tion that result in successful outcomes reduce intergroup bias 
(see also Bobo, 1988; Bobo & Hutchings, 1996; Campbell, 
1965; Sherif, 1966). 

With respect to the importance of collective identity, 
psychological research has emphasized how the salience of 
group versus individual identity can influence the way in 
which people process social information. In particular, the 
operation of group-level processes has been hypothesized to 
be dynamically distinct from the influence of individual-level 
processes. Different modes of functioning are involved, and 
these modes critically influence how people perceive others 
and experience their own sense of identity. In terms of per- 
ceptions of others, for example, Brewer (1988) proposed a 
dual process model of impression formation (see also the 
continuum model; Fiske & Neuberg, 1990; see also Fiske, 
Lin, & Neuberg, 1999). The primary distinction in Brewer’s 
model is between two types of processing: person based and 
category based. Person-based processing is bottom up and 
data driven, involving the piecemeal acquisition of informa- 
tion that begins “at the most concrete level and stops at the 
lowest level of abstraction required by the prevailing pro- 
cessing objectives” (Brewer, 1988, p. 6). Category-based 
processing, in contrast, proceeds from global to specific; it is 
top-down. In top-down processing, how the external reality is 
perceived and experienced is influenced by category-based, 
subjective impressions. According to Brewer, category-based 
processing is more likely to occur than is person-based pro- 
cessing because social information is typically organized 
around social categories. 

With respect to one’s sense of identity, social identity 
theory (Tajfel & Turner, 1979) and self-categorization theory 
(Turner, 1985; see also Onorato & Turner, 2001) view the 
distinction between personal identity and social identity as a 
critical one (see Spears, 2001). When personal identity is 
salient, a person’s individual needs, standards, beliefs, and 
motives primarily determine behavior. In contrast, when 
social identity is salient, “people come to perceive them- 
selves as more interchangeable exemplars of a social cate- 
gory than as unique personalities defined by their individual 
differences from others” (Turner, Hogg, Oakes, Reicher, & 
Wetherell, 1987, p. 50). Under these conditions, collective 
needs, goals, and standards are primary. 

This perspective also proposes that a person defines or cat- 
egorizes the self along a continuum that ranges at one extreme 


from the self as a separate individual with personal motives, 
goals, and achievements to the self as the embodiment of a so- 
cial collective or group. At the individual level, one’s personal 
welfare and goals are most salient and important. At the group 
level, the goals and achievements of the group are merged 
with one’s own (see Brown & Turner, 1981), and the group’s 
welfare is paramount. At each extreme, self-interest fully is 
represented by the pronouns “T” and “We,” respectively. Inter- 
group relations begin when people think about themselves as 
group members rather than solely as distinct individuals. 

Illustrating the dynamics of this distinction, Verkuyten and 
Hagendoorn (1998) found that when individual identity was 
primed, individual differences in authoritarianism were the 
major predictor of the prejudice of Dutch students toward 
Turkish migrants. In contrast, when social identity Oe, 
national identity) was made salient, in-group stereotypes 
and standards primarily predicted prejudiced attitudes. Thus, 
whether personal or collective identity is more salient criti- 
cally shapes how a person perceives, interprets, evaluates, and 
responds to situations and to others (Kawakami & Dion, 1993, 
1995). 

Although the categorization process may place the person 
at either extreme of the continuum from personal identity to 
social identity, people often seek an intermediate point to 
balance their need to be different from others and their need 
to belong and share a sense of similarity to others (Brewer, 
1991). This balance enhances one’s feelings of connection 
to the group and increases group cohesiveness and social 
harmony (Hogg, 1996). However, social categorization into 
in-groups and out-groups also lays the foundation for the 
development of intergroup bias or ethnocentrism. In addition, 
intergroup relations tend to be less positive than interpersonal 
relations. Insko, Schopler, and their colleagues have demon- 
strated a fundamental individual-group discontinuity effect 
in which groups are greedier and less trustworthy than indi- 
viduals (Insko et al., 2001; Schopler & Insko, 1992). As a 
consequence, relations between groups tend to be more com- 
petitive and less cooperative than those between individuals. 
In general, then, the social categorization of others and oneself 
plays a significant role in prejudice and discrimination. 

Although social categorization generally leads to inter- 
group bias, the nature of that bias—whether it is based on in- 
group favoritism or extends to derogation and negative 
treatment of the out-group—depends on a number of factors, 
such as whether the structural relations between groups and 
associated social norms foster and justify hostility or contempt 
(Mummendey & Otten, 2001; Otten & Mummendey, 2000). 
However, different treatment of in-group versus out-group 
members, whether rooted in favoritism for one group or 
derogation of another, can lead to different expectations, 
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perceptions, and behavior toward in-group versus out-group 
members that can ultimately create a self-fulfilling prophecy. 
Initial in-group favoritism can also provide a foundation for 
embracing more negative intergroup feelings and beliefs that 
result from intrapersonal, cultural, economic, and political 
factors. In the next section we describe alternative, and ulti- 
mately complementary, theoretical approaches to intergroup 
conflict and integration. 


Perspectives on Intergroup Relations and Conflict 


In general, research on social conflict, harmony, and inte- 
gration has adopted one of two perspectives, one with an 
emphasis on the functional relations between groups and the 
other on the role of collective identities. 


Functional Relations Between Groups 


Theories based on functional relations often point to competi- 
tion and consequent perceived threat as a fundamental cause 
of intergroup prejudice and conflict. Realistic group conflict 
theory (Campbell, 1965; Sherif, 1966), for example, posits 
that perceived group competition for resources produces ef- 
forts to reduce the access of other groups to the resources. This 
process was illustrated in classic work by Muzafer Sherif and 
his colleagues (Sherif et al., 1961). In 1954 Sherif and his col- 
leagues conducted a field study on intergroup conflict in an 
area adjacent to Robbers Cave State Park in Oklahoma. In this 
study 22 12-year-old boys attending summer camp were ran- 
domly assigned to two groups (who subsequently named 
themselves Eagles and Rattlers). Over a period of weeks they 
became aware of the other group’s existence, engaged in a 
series of competitive activities that generated overt intergroup 
conflict, and ultimately participated in a series of cooperative 
activities designed to ameliorate conflict and bias. 

To permit time for group formation (e.g., norms and a 
leadership structure), the two groups were kept completely 
apart for one week. During the second week the investigators 
introduced competitive relations between the groups in the 
form of repeated competitive athletic activities centering 
around tug-of-war, baseball, and touch football, with the win- 
ning group receiving prizes. As expected, the introduction of 
competitive activities generated derogatory stereotypes and 
conflict among these groups. These boys, however, did not 
simply show in-group favoritism as we frequently see in lab- 
oratory studies. Rather, there was genuine hostility between 
these groups. Each group conducted raids on the other’s cab- 
ins that resulted in the destruction and theft of property. 
The boys carried sticks, baseball bats, and socks filled with 
rocks as potential weapons. Fistfights broke out between 
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members of the groups, and food and garbage fights erupted 
in the dinning hall. In addition, group members regularly ex- 
changed verbal insults (e.g., “ladies first”) and name-calling 
(e.g., “sissies,” “stinkers,” “pigs,” “bums, 
“communists”). 

During the third week, Sherif and his colleagues arranged 
intergroup contact under neutral, noncompetitive condi- 
tions. These interventions did not calm the ferocity of the 
exchanges, however. Mere intergroup contact was not suffi- 
cient to change the nature of the relations between the groups. 
Only after the investigators altered the functional relations 
between the groups by introducing a series of superordinate 
goals—ones that could not be achieved without the full 
cooperation of both groups and which were successfully 
achieved—did the relations between the two groups become 
more harmonious. 
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Sherif et al. (1961) proposed that functional relations 
between groups are critical in determining intergroup 
attitudes. When groups are competitively interdependent, the 
interplay between the actions of each group results in positive 
outcomes for one group and negative outcomes for the other. 
Thus, in the attempt to obtain favorable outcomes for them- 
selves, the actions of the members of each group are also 
realistically perceived to be calculated to frustrate the goals 
of the other group. Therefore, a win-lose, zero-sum competi- 
tive relation between groups can initiate mutually negative 
feelings and stereotypes toward the members of the other 
group. In contrast, a cooperatively interdependent relation be- 
tween members of different groups can reduce bias (Worchel, 
1986). 

Functional relations do not have to involve explicit compe- 
tition with members of other groups to generate biases. In the 
absence of any direct evidence, people typically presume that 
members of other groups are competitive and will hinder the 
attainment of one’s goals (Fiske & Ruscher, 1993). Moreover, 
feelings of interdependence on members of one’s own group 
may be sufficient to produce bias. Rabbie’s behavioral interac- 
tion model (see Rabbie & Lodewijkx, 1996; Rabbie & Schot, 
1990; cf. Bourhis, Turner, & Gagnon, 1997), for example, 
argues that either intragroup cooperation or intergroup com- 
petition can stimulate intergroup bias. Similarly, L. Gaertner 
and Insko (2000), who unconfounded the effects of catego- 
rization and outcome dependence, demonstrated that depen- 
dence on in-group members could independently generate 
intergroup bias among men. Perhaps as a consequence of 
feelings of outcome dependence, allowing opportunities for 
greater interaction among in-group members increases in- 
tergroup bias (L. Gaertner & Schopler, 1998), whereas in- 
creasing interaction between members of different groups 
(S. Gaertner et al., 1999) or even the anticipation of future 


interaction with other groups (Insko et al., 2001) decreases 
intergroup bias. 

Recently, Esses and her colleagues (Esses, Dovidio, 
Jackson, & Armstrong, 2001; Esses, Jackson, & Armstrong, 
1998; Jackson & Esses, 2000) have integrated work on re- 
alistic group conflict theory (Campbell, 1965; LeVine & 
Campbell, 1972; Sherif, 1966; see also Bobo, 1988) and 
social dominance theory (Pratto, 1999; Sidanius & Pratto, 
1999) within the framework of the instrumental model of 
group conflict. This model proposes that resource stress (the 
perception that access to a desired resource, such as wealth or 
political power, is limited) and the salience of a potentially 
competitive out-group lead to perceived group competition 
for resources. Several factors may determine the degree 
of perceived resource stress, with the primary ones including 
perceived scarcity of resources and individual or group sup- 
port for an unequal distribution of resources, which is closely 
related to social dominance orientation (Pratto et al., 1994). 
Moreover, resource stress is likely to lead to perceived group 
competition when a relevant out-group is present. Some 
groups are more likely to be perceived as competitors than are 
others. Out-groups that are salient and distinct from one’s 
own group are especially likely to stand out as potential com- 
petitors. However, potential competitors must also be similar 
to the in-group on dimensions that make them likely to take 
resources. That is, they must be interested in similar resources 
and in a position to potentially take these resources. 

The combination of resource stress and the presence of a 
potentially competitive out-group leads to perceived group 
competition. Such perceived group competition is likely to 
take the form of zero-sum beliefs: beliefs that the more the 
other group obtains, the less is available for one’s own group. 
There is a perception that any gains that the other group might 
make must be at the expense of one’s own group. The model is 
termed the instrumental model of group conflict because atti- 
tudes and behaviors toward the competitor out-group are hy- 
pothesized to reflect strategic attempts to remove the source of 
competition. Efforts to remove the other group from competi- 
tion may include out-group derogation, discrimination, and 
avoidance of the other group. One may express negative atti- 
tudes and attributions about members of the other group in an 
attempt to convince both one’s own group and other groups of 
the competitors’ lack of worth. Attempts to eliminate the 
competition may also entail discrimination and opposition to 
policies and programs that may benefit the other group. Limit- 
ing the other group’s access to the resources also reduces 
competition. Consistent with this model, Esses and her col- 
leagues have found that individuals in Canada and the United 
States perceive greater threat, are more biased against, and are 
more motivated to exclude immigrant groups that are seen as 


involved in a zero-sum competition for resources with nonim- 
migrants (Esses et al., 1998, 2001; Jackson & Esses, 2000). 

Discrimination can serve less tangible collective functions 
as well as concrete instrumental objectives. Blumer (1958a) 
acknowledged that the processes for establishing group posi- 
tion may involve goals such as gaining economic advantage, 
but they may also be associated with the acquisition of intan- 
gible resources such as prestige. Taylor (2000), in fact, sug- 
gested that symbolic, psychological factors are typically 
more important in intergroup bias than are tangible resources. 
Theoretical developments in social psychology, stimulated by 
social identity theory (Tajfel & Turner, 1979), further high- 
light the role of group categorization, independent of actual 
realistic group conflict, in motivations to achieve favorable 
group identities (“positive distinctiveness”) and consequently 
on the arousal of intergroup bias and discrimination. 


Collective Identity 


Human activity is rooted in interdependence. Group systems 
involving greater mutual cooperation have substantial sur- 
vival advantages for individual group members over those 
systems without reciprocally positive social relations (Trivers, 
1971). However, the decision to cooperate with nonrelatives 
(i.e., to expend resources for another’s benefit) is a dilemma of 
trust because the ultimate benefit for the provider depends on 
others’ willingness to reciprocate. Indiscriminate trust and al- 
truism that are not reciprocated are not effective survival 
strategies. 

Social categorization and group boundaries provide a basis 
for achieving the benefits of cooperative interdependence 
without the risk of excessive costs. In-group membership is a 
form of contingent cooperation. By limiting aid to mutually 
acknowledged in-group members, total costs and risks of non- 
reciprocation can be contained. Thus, in-groups can be de- 
fined as bounded communities of mutual trust and obligation 
that delimit mutual interdependence and cooperation. The 
ways in which people understand their group membership 
thus play a critical role in social harmony and conflict. 

Models of category-based processing (Brewer, 1988; see 
also Fiske et al., 1999) assume that “the mere presentation of a 
stimulus person activates certain classification processes that 
occur automatically and without conscious intent.... The 
process is one of ‘placing’ the individual social object along 
well-established stimulus dimensions such as age, gender, 
and skin color” (Brewer, 1988, pp. 5-6). We have further 
hypothesized that “a primitive type of categorization may also 
have a high probability of spontaneously occurring, perhaps in 
parallel process. This is the categorization of individuals as 
members of one’s ingroup or not” (Dovidio & Gaertner, 1993, 
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p. 170). Because of the centrality of the self in social per- 
ception (Higgins & Bargh, 1987; Kihlstrom et al., 1988), we 
propose that social categorization involves most fundamen- 
tally a distinction between the group containing the self (the 
in-group) and other groups (the out-groups) between the 
“we’s” and the “they’s” (see also Tajfel & Turner, 1979; Turner 
et al., 1987). This distinction has a profound influence on 
evaluations, cognitions, and behavior. 

Social identity theory (Tajfel & Turner, 1979) and, more 
recently, self-categorization theory (Turner, 1985; Turner 
et al., 1987) address the fundamental process of social catego- 
rization. From a social categorization perspective, when peo- 
ple or objects are categorized into groups, actual differences 
between members of the same category tend to be perceptually 
minimized (Tajfel, 1969) and often ignored in making deci- 
sions or forming impressions. Members of the same category 
seem to be more similar than they actually are, and more sim- 
ilar than they were before they were categorized together. In 
addition, although members of a social category may be dif- 
ferent in some ways from members of other categories, these 
differences tend to become exaggerated and overgeneralized. 
Thus, categorization enhances perceptions of similarities 
within groups and differences between groups—emphasizing 
social difference and group distinctiveness. This process is not 
benign because these within- and between-group distortions 
have a tendency to generalize to additional dimensions (e.g., 
character traits) beyond those that differentiated the categories 
originally (Allport, 1954, 1958). Furthermore, as the salience 
of the categorization increases, the magnitude of these distor- 
tions also increases (Abrams, 1985; Brewer, 1979; Brewer & 
Miller, 1996; Dechamps & Doise, 1978; Dion, 1974; Doise, 
1978; Skinner & Stephenson, 1981; Turner, 1981, 1985). 

Moreover, in the process of categorizing people into two 
different groups, people typically classify themselves into 
one of the social categories and out of the other. The insertion 
of the self into the social categorization process increases the 
emotional significance of group differences and thus leads to 
further perceptual distortion and to evaluative biases that 
reflect favorably on the in-group (Sumner, 1906), and conse- 
quently on the self (Tajfel & Turner, 1979). Tajfel and Turner 
(1979), in their social identity theory, proposed that a person’s 
need for positive identity may be satisfied by membership in 
prestigious social groups. This need also motivates social 
comparisons that favorably differentiate in-group from out- 
group members, particularly when self-esteem has been chal- 
lenged (Hogg & Abrams, 1990). For example, Meindl and 
Lerner (1984) found that experiencing an esteem-lowering 
experience (committing an unintentional transgression) moti- 
vated people to reject an opportunity for equal status contact 
between the in-group and an out-group in favor of interaction 
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that implied the more positive status of the in-group. Within 
social identity theory, successful intergroup discrimination 
is then presumed to restore, enhance, or elevate one’s self- 
esteem (see Rubin & Hewstone, 1998). 

As we noted earlier, the social identity perspective (see also 
self-categorization theory; Turner et al., 1987) also proposes 
that a person defines or categorizes the self along a continuum 
that ranges at one extreme from the self as the embodiment 
of a social collective or group to the self as a separate individ- 
ual with personal motives, goals, and achievements. Self- 
categorization in terms of collective identity, in turn, increases 
the likelihood of the development of intergroup biases and 
conflict (Schopler & Insko, 1992). As Sherif et al.’s (1961) 
initial observations revealed, intergroup relations begin to 
sour soon after people categorize others in terms of in-group 
and out-group members: “Discovery of another group of 
campers brought heightened awareness of ‘us’ and ‘ours’ as 
contrasted with ‘outsiders’ and ‘intruders,’ [and] an intense 
desire to compete with the other group in team games” (Sherif 
et al., 1961, p. 95). Thus, social categorization lays the foun- 
dation for intergroup bias and conflict that can lead to, and be 
further exacerbated by, competition between these groups. 

Additional research has demonstrated just how powerfully 
mere social categorization can influence differential thinking, 
feeling and behaving toward in-group versus out-group mem- 
bers. Upon social categorization of individuals into in-groups 
and out-groups, people spontaneously experience more posi- 
tive affect toward the in-group (Otten & Moskowitz, 2000; 
Otten & Wentura, 1999). They also favor in-group members 
directly in terms of evaluations and resource allocations 
(Mullen, Brown, & Smith, 1992; Tajfel, Billig, Bundy, & 
Flament, 1971), as well as indirectly in valuing the products of 
their work (Ferguson & Kelley, 1964). In addition, in-group 
membership increases the psychological bond and feelings of 
“oneness” that facilitate the arousal of promotive tension or 
empathy in response to others’ needs or problems (Hornstein, 
1976). In part as a consequence, prosocial behavior is offered 
more readily to in-group than to out-group members (Dovidio 
et al., 1997; Piliavin, Dovidio, Gaertner, & Clark, 1981). Peo- 
ple are also more likely to initiate “heroic” action on behalf of 
an in-group member than another person, for example by di- 
rectly confronting a transgressor who insults the person 
(Meindl & Lerner, 1983). Moreover, people are more likely to 
be cooperative and exercise more personal restraint when 
using endangered common resources when these are shared 
with in-group members than with others (Kramer & Brewer, 
1984), and they work harder for groups they identify more as 
their in-group (Worchel, Rothgerber, Day, Hart, & Butemeyer, 
1998). 


In terms of information processing, people retain more 
information in a more detailed fashion for in-group members 
than for out-group members (Park & Rothbart, 1982), have 
better memory for information about ways in which in-group 
members are similar and out-group members are dissimilar to 
the self (Wilder, 1981), and remember less positive informa- 
tion about out-group members (Howard & Rothbart, 1980). 
Perhaps because of the greater self-other overlap in represen- 
tations for people defined as in-group members (E. R. Smith & 
Henry, 1996), people process information about and make 
attributions to in-group members more on the basis of self- 
congruency than they do for out-group members (Gramzow, 
Gaertner, & Sedikides, 2001). 

People are also more generous and forgiving in their expla- 
nations for the behaviors of in-group relative to out-group 
members. Positive behaviors and successful outcomes are 
more likely to be attributed to internal, stable characteristics 
(the personality) of in-group than out-group members, 
whereas negative outcomes are more likely to be ascribed to 
the personalities of out-group members than of in-group mem- 
bers (Hewstone, 1990; Pettigrew, 1979). Observed behaviors 
of in-group and out-group members are encoded in memory at 
different levels of abstraction (Maass, Ceccarelli, & Rudin, 
1996). Undesirable actions of out-group members are en- 
coded at more abstract levels that presume intentionality and 
dispositional origin (e.g., she is hostile) than identical behav- 
iors of in-group members (e.g., she slapped the girl). Desirable 
actions of out-group members, however, are encoded at more 
concrete levels (e.g., she walked across the street holding 
the old man’s hand) relative to the same behaviors of in-group 
members (e.g., she is helpful). 

Language plays another role in intergroup bias through as- 
sociations with collective pronouns. Collective pronouns such 
as “we” or “they” that are used to define people’s in-group or 
out-group status are frequently paired with stimuli having 
strong affective connotations. As a consequence, these pro- 
nouns may acquire powerful evaluative properties of their 
own. These words (we, they) can potentially increase the 
availability of positive or negative associations and thereby 
influence beliefs about, evaluations of, and behaviors to- 
ward other people—often automatically and unconsciously 
(Perdue, Dovidio, Gurtman, & Tyler, 1990). 

The process of social categorization, however, is not com- 
pletely unalterable. Categories are hierarchically organized, 
and higher level categories (e.g., nations) are more inclusive 
of lower level ones (e.g., cities or towns). By modifying a per- 
ceiver’s goals, motives, past experiences, and expectations, as 
well as factors within the perceptual field and the situational 
context more broadly, there is opportunity to alter the level of 


category inclusiveness that will be primary in a given situa- 
tion. Although perceiving people in terms of a social category 
is easiest and most common in forming impressions, particu- 
larly during bitter intergroup conflict, appropriate goals, 
motivation, and effort can produce more individuated impres- 
sions of others (Brewer, 1988; Fiske et al., 1999). This mal- 
leability of the level at which impressions are formed—from 
broad to more specific categories to individuated responses— 
is important because of its implications for altering the way 
people think about members of other groups, and conse- 
quently about the nature of intergroup relations. 

Although functional and social categorization theories of 
intergroup conflict and social harmony suggest different 
psychological mechanisms, these approaches may offer 
complementary rather than necessarily competing explana- 
tions. For instance, realistic threats and symbolic threats 
reflect different hypothesized causes of discrimination, but 
they can operate jointly to motivate discriminatory behavior. 
W. Stephan and his colleagues (Stephan, Diaz-Loving, & 
Duran, 2000; Stephan & Stephan, 2000; Stephan, Ybarra, 
Martinez, Schwarzwald, & Tur-Kaspa, 1998) have found 
that personal negative stereotypes, realistic group threat, and 
symbolic group threat all predict discrimination against 
other groups (e.g., immigrants), and each accounts for a 
unique portion of the effect. In addition, personal-level 
biases and collective biases may also have separate and 
additive influences. Bobo and his colleagues (see Bobo, 
1999) have demonstrated that group threat and personal 
prejudice can contribute independently to discrimination 
against other groups. The independence of these effects 
points to the importance of considering each of these per- 
spectives for a comprehensive understanding of social con- 
flict and integration, while at the same time reinforcing the 
theoretical distinctions among the hypothesized underlying 
mechanisms. 

Given the centrality and spontaneity of the social catego- 
rization of people into in-group and out-group members, and 
given the important role of functional relations between 
groups in a world of limited resources that depend on differ- 
entiation between in-group and out-group members, how can 
bias be reduced? Because categorization is a basic process that 
is fundamental to prejudice and intergroup conflict, some con- 
temporary work has targeted this process as a place to begin to 
improve intergroup relations. This work also considers the 
functional relations among groups. In the next section we 
explore how the forces of categorization may be disarmed or 
redirected to promote more positive intergroup attitudes—and 
potentially begin to penetrate the barriers to reconciliation 
among groups with a history of antagonistic relations. One of 


Intergroup Contact and the Reduction of Bias 491 


the most influential strategies involves creating and structur- 
ing intergroup contact. 


INTERGROUP CONTACT AND THE REDUCTION 
OF BIAS 


For the past 50 years the contact hypothesis (Allport, 1954; 
Amir, 1969; Cook, 1985; Watson, 1947; Williams, 1947; see 
also Pettigrew, 1998a; Pettigrew & Tropp, 2000) has repre- 
sented a promising and popular strategy for reducing inter- 
group bias and conflict. This hypothesis proposes that simple 
contact between groups is not automatically sufficient to im- 
prove intergroup relations. Rather, for contact between groups 
to reduce bias successfully, certain prerequisite features must 
be present. These characteristics of contact include equal sta- 
tus between the groups, cooperative (rather than competitive) 
intergroup interaction, opportunities for personal acquain- 
tance between the members (especially with those whose per- 
sonal characteristics do not support stereotypic expectations), 
and supportive norms by authorities within and outside of the 
contact situation (Cook, 1985; Pettigrew, 1998a). Research in 
laboratory and field settings generally supports the efficacy of 
the list of prerequisite conditions for achieving improved 
intergroup relations (see Pettigrew & Tropp, 2000). 


Contact and Functional Relations 


Consistent with functional theories of intergroup relations, 
changing the nature of interdependence between members 
of different groups from perceived competition to coopera- 
tion significantly improves intergroup attitudes (Blanchard, 
Weigel, & Cook, 1975; Cook, 1985; Deutsch & Collins, 
1951; Green, Adams, & Turner, 1988; Stephan, 1987; 
Weigel, Wiser, & Cook, 1975). Cooperative learning (Slavin, 
1985), jigsaw classroom interventions in which students are 
interdependent on one another in problem-solving exercises 
(Aronson & Patnoe, 1997), and more comprehensive ap- 
proaches in schools that involve establishing a cooperative 
community, resolving conflicts, and internalizing civic values 
(e.g., Peacekeepers; Johnson & Johnson, 2000) also support 
the fundamental principles outside of the laboratory. Al- 
though it is difficult to establish all of these conditions in 
intergroup contact situations, the formula is effective when 
these conditions are met (Cook, 1984; Johnson, Johnson, & 
Maruyama, 1983; Pettigrew, 1998a). 

Structurally, however, the contact hypothesis has repre- 
sented a list of loosely connected, diverse conditions rather 
than a unifying conceptual framework that explains how these 
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prerequisite features achieve their effects. This is problematic 
because political and socioeconomic circumstances (e.g., real 
or perceived competitive, zero-sum outcomes) often preclude 
introducing these features (e.g., cooperative interdependence, 
equal status) into many contact settings. Despite substantial 
documentation that intergroup cooperative interaction re- 
duces bias (Allport, 1954; Aronson, Blaney, Stephan, Sikes, 
& Snapp, 1978; Cook, 1985; Deutsch, 1973; Johnson et al., 
1983; Sherif et al., 1961; Slavin, 1985; Worchel, 1979), it is 
not clear how cooperation achieves this effect. One basic 
issue involves the psychological processes that mediate this 
change. 

The classic functional relations perspective by Sherif et al. 
(1961) views cooperative interdependence as a direct media- 
tor of attitudinal and behavioral changes. However, recent 
approaches have extended research on the contact hypothesis 
by attempting to understand the potential common processes 
and mechanisms that these diverse factors engage to reduce 
bias. Several additional explanations have been proposed 
(see Brewer & Miller, 1984; Miller & Davidson-Podgorny, 
1987; Worchel, 1979, 1986). For example, cooperation may 
induce greater intergroup acceptance as a result of dissonance 
reduction serving to justify this type of interaction with the 
other group (Miller & Brewer, 1986). It is also possible that 
cooperation can have positive, reinforcing outcomes. When 
intergroup contact is favorable and has successful conse- 
quences, psychological processes that restore cognitive bal- 
ance or reduce dissonance produce more favorable attitudes 
toward members of the other group and toward the group as 
a whole to be consistent with the positive nature of the inter- 
action. In addition, the rewarding properties of achieving 
success may become associated with members of other 
groups (Lott & Lott, 1965), thereby increasing attraction 
(S. Gaertner et al., 1999). Also, cooperative experiences can 
reduce intergroup anxiety (Stephan & Stephan, 1984). 

Intergroup contact can also influence how interactants 
conceive of the groups and how the members are socially 
categorized. Cooperative learning and jigsaw classroom inter- 
ventions (Aronson & Patnoe, 1997), which are designed to in- 
crease interdependence between members of different groups 
and to enhance appreciation for the resources they bring to the 
task, may reduce bias in part by altering how interactants con- 
ceive of the group boundaries and memberships. In the next 
section we consider how the effects of intergroup contact can 
be mediated by changes in personal and collective identity. 


Contact, Categorization, and Identity 


From the social categorization perspective, the issue to be 
addressed is how intergroup contact can be structured to alter 


cognitive representations in ways that eliminate one or more 
of the basic features of the negative intergroup schema. 
Based on the premises of social identity theory, three alterna- 
tive models for contact effects have been developed and 
tested in experimental and field settings: decategorization, 
recategorization, and mutual differentiation. 

Each of these models can be described in terms of recom- 
mendations for how to structure cognitive representations of 
situations in which there is contact between the groups, the 
psychological processes that promote attitude change, and 
the mechanisms by which contact experiences are general- 
ized to change attitudes toward the out-group as a whole. 
Each of these strategies targets the social categorization 
process as the place to begin to understand and to combat 
intergroup biases. Decategorization encourages members to 
deemphasize the original group boundaries and to conceive 
of themselves as separate individuals rather than as members 
of different groups. Mutual differentiation maintains the orig- 
inal group boundaries, maintaining perceptions as different 
groups, but in the context of intergroup cooperation during 
which similarities and differences between the memberships 
are recognized and valued. Recategorization encourages the 
members of both groups to regard themselves as belonging to 
a common, superordinate group—one group that is inclusive 
of both memberships. 

Rather than viewing these as competing positions and argu- 
ing which one is correct, we suggest that these are comple- 
mentary approaches and propose that it is more productive to 
consider when each strategy is most effective. To the extent 
that it is possible for these strategies, either singly or in concert, 
to alter perceptions of the “Us versus Them” that are reflected 
in conflictive intergroup relations, reductions in bias and social 
harmony may be accomplished. Moreover, decategorization 
and recategorization strategies may increase the willingness of 
group representatives to view the meaning of the intergroup 
conflict from the other group’s perspective and to offer solu- 
tions that recognize both groups’ needs and concerns. 


Decategorization: The Personalization Model 


The first model is essentially a formalization and elaboration 
of the assumptions implicit in Allport’s contact hypothesis 
(Brewer & Miller, 1984). A primary consequence of salient 
in-group-out-group categorization is the deindividuation of 
members of the out-group. Social behavior in category-based 
interactions is characterized by a tendency to treat individual 
members of the out-group as undifferentiated representatives 
of a unified social category, ignoring individual differences 
within the group. The personalization perspective on the 
contact situation implies that intergroup interactions should 


be structured to reduce the salience of category distinctions 
and promote opportunities to get to know out-group mem- 
bers as individual persons, thereby disarming the forces of 
categorization. 

The conditional specifications of the contact hypothesis 
(e.g., Cooperative interaction) can be interpreted as features 
of the situation that reduce category salience and promote 
more differentiated and personalized representations of the 
participants in the contact setting. Interdependence typically 
motivates people to focus more on the individual characteris- 
tics of a person, with whom their outcomes are linked, than 
more general category representations (Fiske, 2000). Attend- 
ing to personal characteristics of group members not only 
provides the opportunity to disconfirm category stereotypes, 
but it also breaks down the monolithic perception of the out- 
group as a homogeneous unit (Wilder, 1978). In this scheme, 
the contact situation encourages attention to information at 
the individual level that replaces category identity as the most 
useful basis for classifying participants. 

With a more differentiated representation of out-group 
members, there is the recognition that there are different types 
of out-group members (e.g., sensitive as well as tough profes- 
sional hockey players), thereby weakening the effects of 
categorization and the tendency to minimize and ignore dif- 
ferences between category members. When personalized in- 
teractions occur, in-group and out-group members slide even 
further toward the individual side of the self as individual ver- 
sus group member continuum. Members “attend to informa- 
tion that replaces category identity as the most useful basis for 
classifying each other” (Brewer & Miller, 1984, p. 288) as 
they engage in personalized interactions. Repeated personal- 
ized contacts with a variety of out-group members should, 
over time, undermine the value and meaningfulness of the 
social category stereotype as a source of information about 
members of that group. This is the process by which contact 
experiences are expected to generalize—via reducing the 
salience and meaning of social categorization in the long run 
(Brewer & Miller, 1996). 

A number of studies provide evidence supporting this 
perspective on contact effects (Bettencourt, Brewer, Croak, & 
Miller, 1992; Marcus-Newhall, Miller, Holtz, & Brewer, 
1993). Miller, Brewer, and Edwards (1985), for instance, 
demonstrated that a cooperative task that required personal- 
ized interaction with members of the out-group resulted not 
only in more positive attitudes toward out-group members in 
the cooperative setting but also toward other out-group mem- 
bers shown on a videotape, compared to cooperative contact 
that was task focused rather than person focused. 

During personalization, members focus on information 
about an out-group member that is relevant to the self (as an 
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individual rather than as a group member). Repeated person- 
alized interactions with a variety of out-group members 
should over time undermine the value of the category stereo- 
type as a source of information about members of that group. 
Thus, the effects of personalization should generalize to new 
situations as well as to heretofore unfamiliar out-group mem- 
bers. For the benefits of personalization to generalize, how- 
ever, it is of course necessary for the identities of out-group 
members to be salient—at least somewhat—during the inter- 
action to allow the group stereotype to be weakened. 

Further evidence of the value of personalized interactions 
for reducing intergroup bias comes from data on the effects 
of intergroup friendships (Hamberger & Hewstone, 1997; 
Pettigrew, 1997; Pettigrew & Meertens, 1995). For example, 
across samples in France, Great Britain, the Netherlands, and 
Germany, Europeans with out-group friends were lower 
on measures of prejudice, particularly affective prejudice 
(Pettigrew, 1998a). This positive relation did not hold for 
other types of acquaintance relationships in work or residen- 
tial settings that did not involve formation of close interper- 
sonal relationships with members of the out-group. In terms 
of the direction of causality, although having more positive 
intergroup attitudes can increase the willingness to have 
cross-group friendships, path analyses indicate that the path 
from friendship to reduction in prejudice is stronger than the 
other way around (Pettigrew, 1998a). 

Other research reveals three valuable extensions of the per- 
sonalized contact effect. One is evidence that personal friend- 
ships with members of one out-group may lead to tolerance 
toward out-groups in general and reduced nationalistic pride, 
a process that Pettigrew (1997) refers to as deprovincializa- 
tion. Thus, decategorization based on developing cross-group 
friendships that decrease the relative attractiveness of a per- 
son’s in-group provides increased appreciation of the relative 
attractiveness of other out-groups more generally. 

A second extension is represented by evidence that contact 
effects may operate indirectly or vicariously. Although 
interpersonal friendship across group lines leads to reduced 
prejudice, even knowledge that an in-group member has be- 
friended an out-group member has the potential to reduce bias 
while the salience of group identities remains high for the ob- 
server (Wright, Aron, McLaughlin-Volpe, & Ropp, 1997). A 
third extension relates to interpersonal processes involving the 
arousal of empathic feelings for an out-group member, which 
can increase positive attitudes toward members of that group 
more widely (Batson et al., 1997). Thus, personalized interac- 
tion and interpersonal processes more generally can directly 
and indirectly increase positive feelings for out-group mem- 
bers through a variety of processes that can lead to more gen- 
eralized types of harmony and integration at the group level. 
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Recategorization: The Common In-Group 
Identity Model 


The second social categorization model of intergroup contact 
and conflict reduction is also based on the premise that 
reducing the salience of in-group—out-group category distinc- 
tions is key to positive effects. In contrast to the decategoriza- 
tion approaches described earlier, however, recategorization 
is not designed to reduce or eliminate categorization, but 
rather to structure a definition of group categorization at a 
higher level of category inclusiveness in ways that reduce 
intergroup bias and conflict (Allport, 1954, p. 43). 

Allport (1954, 1958) was aware of the benefits of a com- 
mon in-group identity, although he regarded it as a catalyst 
rather than as a product of the conditions of contact: 


To be maximally effective, contact and acquaintance programs 
should lead to a sense of equality in social status, should occur in 
ordinary purposeful pursuits, avoid artificiality, and if possible 
enjoy the sanction of the community in which they occur. While 
it may help somewhat to place members of different ethnic 
groups side by side on a job, the gain is greater if these members 
regard themselves as part of a team [italics added]. (Allport, 
1958, p. 489) 


In contrast, the common in-group identity model (S. 
Gaertner, Dovidio, Anastasio, Bachman, & Rust, 1993; S. 
Gaertner & Dovidio, 2000) proposes that group identity can 
be a critical mediating factor. According to this model, inter- 
group bias and conflict can be reduced by factors that trans- 
form participants’ representations of memberships from two 
groups to one, more inclusive group. With common in-group 
identity, the cognitive and motivational processes that ini- 
tially produced in-group favoritism are redirected to benefit 
the common in-group, including former out-group members. 

Allport’s (1954, 1958) description of widening circles of 
inclusion, hierarchically organized, depicts a person’s various 
in-group memberships from one’s family to one’s neighbor- 
hood, to one’s city, to one’s nation, to one’s race, to all of 
humankind. Recognizing that racial group identity had 
become the dominant allegiance among many White racists, 
Allport questioned the accuracy of the common belief that in- 
group loyalties always grow weaker the larger their circle of 
inclusion, which might prevent loyalty to a group more 
inclusive than race. Rather, Allport proposed the potential 
value of shifting the level of category inclusiveness from race 
to humankind. He recognized that the “clash between the idea 
of race and of One World . . . is shaping into an issue that may 
well be the most decisive in human history. The important 
question is, Can a loyalty to mankind be fashioned before in- 
terracial warfare breaks out?” (pp. 43-44). Butis it too difficult 


and unrealistic for people to identify with humankind? Allport 
proposed that this level of common in-group identification is 
difficult for most people primarily because there are few sym- 
bols that make this more ephemeral in-group real or concrete. 
That is, groups such as nations have symbols that include flags, 
buildings, and holidays, but at the international level there are 
few icons that help serve as anchors for unity and world loy- 
alty. Attempts to forge superordinate cooperative alliances, 
therefore, would more likely engage identification processes if 
symbols were adopted to affirm the joint venture. 

Among the antecedent factors proposed by the common 
in-group identity model are the features of contact situations 
that are necessary for intergroup contact to be successful 
(e.g., interdependence between groups, equal status, equali- 
tarian norms; Allport, 1954). From this perspective, inter- 
group cooperative interaction, for example, enhances 
positive evaluations of out-group members, at least in part, 
because cooperation transforms members’ representations of 
the memberships from “Us” versus “Them” to a more inclu- 
sive “We.” In a laboratory experiment, S. Gaertner, Mann, 
Dovidio, Murrell, and Pomare (1990) directly tested and 
found strong support for the hypotheses that the relation be- 
tween intergroup cooperation and enhanced favorable evalu- 
ations of out-group members was mediated by the extent to 
which members of both groups perceived themselves as one 
group. In addition, the generalizability of this effect was sup- 
ported by a series of survey studies conducted in natural 
settings across very different intergroup contexts: bankers 
experiencing corporate mergers, students in a multiethnic 
high school, and college students from blended families (see 
S. Gaertner, Dovidio, & Bachman, 1996). Moreover, appeals 
that emphasize the common group membership of nonimmi- 
grants and immigrants have been shown to improve attitudes 
toward immigrants and to increase support for immigration 
among people in Canada and the United States, and particu- 
larly among those high in social dominance orientation for 
whom group hierarchy is important (Esses et al., 2001). 

These effects of recategorization on behaviors, such as 
helping and self-disclosure (see Dovidio et al., 1997; Nier 
et al., 2001), as well as on attitudes, have some extended, 
practical implications. Recategorization can stimulate inter- 
actions among group members in the contact situation that 
can in turn activate other processes, which subsequently pro- 
mote more positive intergroup behaviors and attitudes. For 
example, both self-disclosure and helping typically produce 
reciprocity. More intimate self-disclosure by one person 
normally encourages more intimate disclosure by the other 
(Archer & Berg, 1978; Derlega, Metts, Petronio, & Margulis, 
1993). As we discussed earlier, the work of Miller, Brewer, 
and their colleagues (e.g., Brewer & Miller, 1984; Miller 


et al., 1985) has demonstrated that personalized and self- 
disclosing interaction can be a significant factor in reducing 
intergroup bias. 

Considerable cross-cultural evidence also indicates the 
powerful influence of the norm of reciprocity on helping 
(Schroeder, Penner, Dovidio & Piliavin, 1995). According to 
this norm, people should help those who have helped them, 
and they should not help those who have denied them help 
for no legitimate reason (Gouldner, 1960). Thus, the devel- 
opment of a common in-group identity can motivate inter- 
personal behaviors between members of initially different 
groups that can initiate reciprocal actions and concessions 
(see Deutsch, 1993; Osgood, 1962). These reciprocal actions 
and concessions not only will reduce immediate tensions but 
also can produce more harmonious intergroup relations 
beyond the contact situation. 

Although finely differentiated impressions of out-group 
members may not be an automatic consequence of forming a 
common in-group identity, these more elaborated, differen- 
tiated, and personalized impressions can quickly develop 
because the newly formed positivity bias is likely to encourage 
more open communication (S. Gaertner et al., 1993). The 
development of a common in-group identity creates a motiva- 
tional foundation for constructive intergroup relations that can 
act as a catalyst for positive reciprocal interpersonal actions. 
Thus, the recategorization strategy proposed in our model as 
well as decategorization strategies, such as individuating 
(Wilder, 1984) and personalizing (Brewer & Miller, 1984) 
interactions, can potentially operate complementarily and 
sequentially to improve intergroup relations in lasting and 
meaningful ways. 


Challenges to the Decategorization and 
Recategorization Models 


Although the structural representations of the contact situa- 
tion advocated by the decategorization (personalization) and 
recategorization (common in-group identity) models are dif- 
ferent, the two approaches share common assumptions about 
the need to reduce category differentiation and associated 
processes. Because both models rely on reducing or eliminat- 
ing the salience of intergroup differentiation, they involve 
structuring contact in a way that will challenge or threaten 
existing social identities. However, both cognitive and 
motivational factors conspire to create resistance to the dis- 
solution of category boundaries or to reestablish category 
distinctions over time. Although the salience of a common 
superordinate identity or personalized representations may 
be enhanced in the short run, then, these may be difficult to 
maintain across time and social situations. 
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Brewer’s (1991) optimal distinctiveness model of the 
motives underlying group identification provides one expla- 
nation for why category distinctions are difficult to change. 
The theory postulates that social identity is driven by two op- 
posing social motives: the need for inclusion and the need for 
differentiation. Human beings strive to belong to groups that 
transcend their own personal identity, but at the same time 
they need to feel special and distinct from others. In order to 
satisfy both of these motives simultaneously, individuals 
seek inclusion in distinctive social groups where the bound- 
aries between those who are members of the in-group cate- 
gory and those who are excluded can be drawn clearly. On 
the one hand, highly inclusive superordinate categories do 
not satisfy distinctiveness needs. Thus, inclusive identities, 
which may not be readily accepted, may be limited in their 
capacity to reduce bias (Hornsey & Hogg, 2000a, 2000b). On 
the other hand, high degrees of individuation fail to meet 
needs for belonging and for cognitive simplicity and uncer- 
tainty reduction (Hogg & Abrams, 1993). These motives are 
likely to make either personalization or common in-group 
identity temporally unstable solutions to intergroup discrimi- 
nation and prejudice. 

Preexisting social-structural relations between groups may 
also create strong forces of resistance to changes in category 
boundaries. Even in the absence of overt conflict, asymmetries 
between social groups in size, power, or status create addi- 
tional sources of resistance. When one group is substantially 
smaller than the other in the contact situation, the minority 
category is especially salient, and minority group members 
may be particularly reluctant to accept a superordinate cate- 
gory identity that is dominated by the other group. Another 
major challenge is created by preexisting status differences 
between groups, where members of both high- and low-status 
groups may be threatened by contact and assimilation 
(Mottola, 1996). 


The Mutual Differentiation Model 


These challenges to processes of decategorization/recategori- 
zation led Hewstone and Brown (1986) to recommend an 
alternative approach to intergroup contact in which coopera- 
tive interactions between groups are introduced without 
degrading the original in-group-out-group categorization. 
More specifically, this model favors encouraging groups 
working together to perceive complementarity by recogniz- 
ing and valuing mutual assets and weaknesses within the 
context of an interdependent cooperative task or common, 
superordinate goals. This strategy allows group members to 
maintain their social identities and positive distinctiveness 
while avoiding insidious intergroup comparisons. Thus, the 
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mutual intergroup differentiation model does not seek to 
change the basic category structure of the intergroup contact 
situation, but to change the intergroup affect from negative to 
positive interdependence and evaluation. 

In order to promote positive intergroup experience, 
Hewstone and Brown (1986) recommended that the contact 
situation be structured so that members of the respective 
groups have distinct but complementary roles to contribute to- 
ward common goals. In this way, both groups can maintain 
positive distinctiveness within a cooperative framework. 
Evidence in support of this approach comes from the results of 
an experiment by Brown and Wade (1987) in which work 
teams composed of students from two different faculties en- 
gaged in a cooperative effort to produce a two-page magazine 
article. When the representatives of the two groups were as- 
signed separate roles in the team task (one group working on 
figures and layout, the other working on text), the contact ex- 
perience had a more positive effect on intergroup attitudes than 
when the two groups were not provided with distinctive roles 
(see also Deschamps & Brown, 1983; Dovidio, Gaertner, & 
Validzic, 1998). 

Hewstone and Brown (1986) argued that generalization of 
positive contact experiences is more likely when the contact 
situation is defined as an intergroup situation rather than as an 
interpersonal interaction. Generalization in this case is direct 
rather than requiring additional cognitive links between posi- 
tive affect toward individuals and toward representations of the 
group as a whole. This position is supported by evidence, re- 
viewed earlier, that cooperative contact with a member of an 
out-group leads to more favorable generalized attitudes toward 
the group as a whole when category membership is made 
salient during contact (e.g., Brown, Vivian, & Hewstone, 1999; 
van Oudenhoven, Groenewoud, & Hewstone, 1996). 

Although in-group—out-group category salience is usually 
associated with in-group bias and the negative side of inter- 
group attitudes, cooperative interdependence is assumed to 
override the negative intergroup schema, particularly if the 
two groups have differentiated, complementary roles to play. 
Because it capitalizes on needs for distinctive social identi- 
ties, the mutual intergroup differentiation model provides 
a solution that is highly stable in terms of the cognitive- 
structural aspects of the intergroup situation. The affective 
component of the model, however, is likely to be less stable. 
Salient intergroup boundaries are associated with mutual dis- 
trust (Schopler & Insko, 1992), which undermines the poten- 
tial for cooperative interdependence and mutual liking over 
any length of time. By reinforcing perceptions of group dif- 
ferences, this differentiation model risks reinforcing negative 
beliefs about the out-group in the long run; intergroup anxi- 
ety (Greenland & Brown, 1999; Islam & Hewstone, 1993) 


and the potential for fission and conflict along group lines 
remain high. 

In addition, theoretical approaches and interventions are 
often guided by the perspective of the majority group. Indeed, 
because the majority group typically possesses the resources, 
focusing strategies for reducing conflict and enhancing social 
harmony on the majority group have considerable potential. 
However, it is not enough, and without considering all of the 
groups involved, these strategies can be counterproductive. 
Intergroup relations need to be understood from the perspec- 
tive of each of the groups involved. We consider the issue of 
multiple perspectives in the next section. 


HARMONY AND INTEGRATION: MAJORITY AND 
MINORITY PERSPECTIVES 


The perspectives that majority and minority group members 
take on particular interactions and on intergroup relations in 
general may differ in fundamental ways. The attributions and 
experiences of people in seemingly identical or comparable 
situations may be affected by ethnic or racial group member- 
ship (see Crocker & Quinn, 2001). In the United States, 
Blacks perceive less social and economic opportunity than do 
Whites (Schuman, Steeh, Bobo, & Krysan, 1997). Cross- 
culturally, the generally nonstigmatized ethnic and racial 
majorities perceive intergroup contact more positively than 
do minorities (S. Gaertner, Rust, Dovidio, Bachman, & 
Anastasio, 1996; Islam & Hewstone, 1993). Distinctiveness, 
associated with numerical minority status or the salience of 
physical or social characteristics, can exacerbate feelings of 
stigmatization among members of traditionally disadvan- 
taged groups (e.g., Kanter, 1977; Niemann & Dovidio, 1998). 

More generally, group status has profound implications 
for the experience of individuals, their motivations and 
aspirations, and their orientations to members of their own 
group and of other groups. As Ellemers and Barreto (2001) 
outlined, responses to the status of one’s group depend on 
whether one is a member of a low- or high-status group, the 
importance of the group to the individual (i.e., strength of 
identification), the perceived legitimacy of the status differ- 
ences, and the prospects for change at the individual or group 
level (see also Branscombe & Ellemers, 1998). Wright 
(2001; see also Tajfel & Turner, 1979) further proposed that 
people in low-status groups will be motivated to pursue col- 
lective action on behalf of their group, rather than seek per- 
sonal mobility, when they identify strongly with the group or 
when possibilities for individual mobility are limited, when 
intergroup comparisons produce perceptions of disadvantage 
and that disadvantage is viewed as illegitimate, and when 


people believe that the intergroup hierarchy can change and 
the in-group has the resources to change it. Although collec- 
tive action may have long-term benefits in achieving justice 
and equality, in the short-term the conditions that facilitate 
collective action may intensify social categorization of mem- 
bers of the in-group and out-groups, temporarily increase 
conflict, and reduce the likelihood of harmony or integration 
between groups. 

Racial and ethnic identities are unlikely to be readily 
abandoned because they are frequently fundamental aspects 
of individuals’ self-concepts and esteem and are often associ- 
ated with perceptions of collective injustice. Moreover, when 
such identities are threatened, for example by attempts to 
produce a single superordinate identity at the expense of 
one’s racial or ethnic group identity, members of these groups 
may respond in ways that reassert the value of the group (e.g., 
with disassociation from the norms and values of the larger 
society; see Branscombe, Schmitt, & Harvey, 1999; Steele, 
1997) and adversely affect social harmony. 

In addition, efforts to incorporate minority groups within 
the context of a superordinate identity may also produce 
negative responses from the majority group. Mummendey 
and Wenzel (1999) argued that because the standards of the 
superordinate group will primarily reflect those of the major- 
ity subgroup, the minority out-group will tend to be viewed 
as nonnormative and inferior by those standards, which can 
exacerbate intergroup bias among majority group members 
and increase group conflict. In contrast, S. Gaertner and 
Dovidio (2000) have proposed that the simultaneous exis- 
tence of superordinate and subordinate group representations 
(i.e., dual- or multiple-identities) may not only improve inter- 
group relations (see also Hornsey & Hogg, 2000a, 2000b) but 
also may contribute to the social adjustment, psychological 
adaptation, and overall well-being of minority group mem- 
bers (LaFromboise, Coleman, & Gerton, 1993). Therefore, 
identifying the conditions under which a dual identity serves 
to increase or diffuse intergroup conflict is an issue actively 
being pursued by contemporary researchers. 

There is evidence that intergroup benefits of a strong 
superordinate identity can be achieved for both majority and 
minority group members when the strength of the subordinate 
identity is high, regardless of the strength of subordinate group 
identities. For example, in a survey study of White adults, H. J. 
Smith and Tyler (1996, Study 1) measured the strengths of 
respondents’ superordinate identity as “American” and also 
the strengths of their subordinate identification as “White.” 
Respondents with a strong American identity, independent of 
the degree to which they identified with being White, were 
more likely to base their support for affirmative action policies 
that would benefit Blacks and other minorities on relational 
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concerns regarding the fairness of congressional representa- 
tives than on concerns about whether these policies would in- 
crease or decrease their own well-being. However, for those 
who had weak identification with being American and primar- 
ily identified themselves with being White, their position on 
affirmative action was determined more strongly by concerns 
regarding the instrumental value of these policies for them- 
selves. This pattern of findings suggests that a strong superor- 
dinate identity (such as being American) allows individuals to 
support policies that would benefit members of other racial 
subgroups without giving primary consideration to their own 
instrumental needs. 

Among minorities, even when racial or ethnic identity is 
strong, perceptions of a superordinate connection enhance in- 
terracial trust and acceptance of authority within an organiza- 
tion. Huo, Smith, Tyler, and Lind (1996) surveyed White, 
Black, Latino, and Asian employees of a public-sector orga- 
nization. Identification with the organization (superordinate 
identity) and racial-ethnic identity (subgroup identity) were 
independently assessed. Regardless of the strength of racial- 
ethnic identity, respondents who had a strong organizational 
identity perceived that they were treated fairly within the 
organization, and consequently they had favorable attitudes 
toward authority. Huo et al. (1996) concluded that having a 
strong identification with a superordinate group can redirect 
people from focusing on their personal outcomes to concerns 
about “achieving the greater good and maintaining social sta- 
bility” (pp. 44-45), while also maintaining important racial 
and ethnic identities. 

S. Gaertner et al. (1996) found converging evidence in a 
survey of students in a multiethnic high school. In particular, 
they compared students who identified themselves on the 
survey using a dual identity (e.g., indicating they were Korean 
and American) with those who used only a single subgroup 
identity (e.g., Korean). Supportive of the role of a dual iden- 
tity, students who described themselves both as Americans 
and as members of their racial or ethnic group had less bias to- 
ward other groups in the school than did those who described 
themselves only in terms of their subgroup identity. Also, the 
minority students who identified themselves using a dual iden- 
tity reported lower levels of intergroup bias in general relative 
to those who used only their ethnic or racial group identity. 

Not only do Whites and racial and ethnic minorities bring 
different values, identities, and experiences to intergroup con- 
tact situations, but also these different perspectives can shape 
perceptions of and reactions to the nature of the contact. 
Blumer (1958a) proposed that group status is a fundamental 
factor in the extent of and type of threat that different groups 
experience. Recent surveys reveal, for example, that Blacks 
show higher levels of distrust and greater pessimism about 
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intergroup relations than do Whites (Dovidio & Gaertner, 
1998; Hochschild, 1995). Majority group members tend to 
perceive intergroup interactions as more harmonious and pro- 
ductive than do minority group members (S. Gaertner et al., 
1996; Islam & Hewstone, 1993; see also the survey of college 
students discussed earlier), but they also tend to perceive sub- 
ordinate and minority groups as encroaching on their rights 
and prerogatives (Bobo, 1999). In addition, majority and 
minority group members have different preferences for the 
ultimate outcomes of intergroup contact. Whereas minority 
group members often tend to want to retain their cultural iden- 
tity, majority group members may favor the assimilation of 
minority groups into one single culture (a traditional melting 
pot orientation): the dominant culture (e.g., Horenczyk, 1996). 

Berry (1984; Berry, Poortinga, Segall, & Dasen, 1992) 
presented four forms of cultural relations in pluralistic soci- 
eties that represent the intersection of “yes—no” responses to 
two relevant questions: (a) Is cultural identity of value, and to 
be retained? (b) Are positive relations with the larger society 
of value, and to be sought? These combinations reflect four 
adaptation strategies for intergroup relations: (a) integration, 
when cultural identities are retained and positive relations 
with the larger society are sought; (b) separation, when cul- 
tural identities are retained but positive relations with the 
larger society are not sought; (c) assimilation, when cultural 
identities are abandoned and positive relations with the larger 
society are desired; and (d) marginalization, when cultural 
identities are abandoned and are not replaced by positive 
identification with the larger society. 

Research in the area of immigration suggests that immi- 
grant groups and majority groups have different preferences 
for these different types of group relations. Van Oudenhoven, 
Prins, and Buunk (1998) found in the Netherlands that Dutch 
majority group members preferred an assimilation of minority 
groups (in which minority group identity was abandoned and 
replaced by identification with the dominant Dutch culture), 
whereas Turkish and Moroccan immigrants most strongly en- 
dorsed integration (in which they would retain their own cul- 
tural identity while also valuing the dominant Dutch culture). 

These preferences also apply to the preferences of Whites 
and minorities about racial and ethnic group relations in the 
United States. Dovidio, Gaertner, and Kafati (2000) have 
found that Whites prefer assimilation most, whereas racial 
and ethnic minorities favor pluralistic integration. Moreover, 
these preferred types of intergroup relations for majority and 
minority groups—a one-group representation (assimilation) 
for Whites and dual representation (pluralistic integration) 
for racial and ethnic minorities—differentially mediated 
the consequences of intergroup contact for the different 
groups. Specifically, for Whites, more positive perceptions 


of intergroup contact related to stronger superordinate Oe, 
common group) representations, which in turn mediated 
more positive attitudes to their school and other groups at the 
school. In contrast, for minority students, a dual-identity 
(integration) representation—but not the one-group represen- 
tation—predicted more positive attitudes toward their school 
and to other groups. In general, these effects were stronger 
for people higher in racial-ethnic identification, both for 
Whites and minorities. 

These findings have practical as well as theoretical 
implications for reducing intergroup conflict and enhancing 
social harmony. Although correlational data should be inter- 
preted cautiously, it appears that for both Whites and racial 
and ethnic minorities, favorable intergroup contact may con- 
tribute to their commitment to their institution. However, 
strategies and interventions designed to enhance satisfaction 
need to recognize that Whites and minorities may have dif- 
ferent ideals and motivations. Because White values and cul- 
ture have been the traditionally dominant ones in the United 
States, American Whites may see an assimilation model—in 
which members of other cultural groups are absorbed into 
the mainstream—as the most comfortable and effective 
strategy. For racial and ethnic minorities, this model, which 
denies the value of their culture and traditions, may be per- 
ceived not only as less desirable but also as threatening to 
their personal and social identity—particularly for people 
who strongly identify with their group. Thus, efforts to 
create a single superordinate identity, though well inten- 
tioned, may threaten one’s social identity, which in turn can 
intensify intergroup bias and conflict. As Bourhis, Moise, 
Perrault, and Sebecal (1997) argued with respect to the 
nature of immigrant and host community relations, conflict 
is likely to be minimized and social harmony fostered 
when these groups have consonant acculuration ideals and 
objectives. 


CONCLUSIONS 


In this chapter we have examined the fundamental psycholog- 
ical processes related to intergroup relations, group conflict, 
social harmony, and intergroup integration. Intergroup bias 
and conflict are complex phenomena having historical, cul- 
tural, economic, and psychological roots. In addition, these 
are dynamic phenomena that can evolve to different forms 
and manifestations over time. A debate about whether a soci- 
etal, institutional, intergroup, or individual level of analysis is 
most appropriate, or a concern about which model of bias or 
bias reduction accounts for the most variance, not only may 
thus be futile but may also distract scholars from a more 


fundamental mission: developing a comprehensive model of 
social conflict, harmony, and integration. 

We propose that understanding how structural, social, and 
psychological mechanisms jointly shape intergroup relations 
can have both valuable theoretical and practical implications. 
Theoretically, individual difference (e.g., social dominance 
orientation; Sidanius & Pratto, 1999), functional (e.g., Sherif & 
Sherif, 1969), and collective identity (e.g., Tajfel & Turner, 
1979; Turner et al., 1987) approaches can be viewed as com- 
plementary rather than competing explanations for social con- 
flict and harmony (see Figure 20.1). Conceptually, intergroup 
relations are significantly influenced by structural factors as 
well as by individual orientations toward intergroup relations 
(e.g., social dominance orientation; Sidanius & Pratto, 1999) 
and toward group membership (e.g., strength of identifica- 
tion) and by the nature of collective identity. Functional rela- 
tions within and between groups and social identity can 
influence perceptions of intra- and intergroup support or threat 
as well as the nature of group representations (see Figure 20.1). 
For instance, greater dependence on in-group members can 
strengthen the perceived boundaries, fostering representations 
as members of different groups and increasing perceptions of 
threat (L. Gaertner & Insko, 2000). Empirically, self-interest, 
realistic group threat, and identity threat have been shown in- 
dependently to affect intergroup relations adversely (Bobo, 
1999; Esses et al., 1998; Stephan & Stephan, 2000). Percep- 
tions of intergroup threat or support and group representations 
can also mutually influence one another. Perceptions of com- 
petition or threat increase the salience of different group repre- 
sentations and decrease the salience of superordinate group 
connections, whereas stronger inclusive representations of the 


Resource 
Availability/Scarcity 


L 


Functional 
Relations 


Perceptions 
of Support/ 


(Cooperative- Threat 


Competitive) 


Individual ES 
Differences 


$ 


Group 
Representations: 
(Different Groups, 

Superordinate Group, 
Superordinate/Subgroup, 
Separate Individuals) 


Identity 


(Individual- 
Social) 


Figure 20.1 


Conclusions 499 


groups can decrease perceptions of intergroup competition 
(S. L. Gaertner et al., 1990). 

Similarly, within the social categorization approach, re- 
searchers have posited not only that decategorization, recate- 
gorization, and mutual intergroup differentiation processes 
can each play a role in the reduction of bias over time 
(Pettigrew, 1998a), but also that these processes can facilitate 
each other reciprocally (S. L. Gaertner et al., 2000; Hewstone, 
1996). Within an alternating sequence of categorization 
processes, mutual differentiation may emerge initially to neu- 
tralize threats to original group identities posed by the recate- 
gorization and decategorization processes. Once established, 
mutual differentiation can facilitate the subsequent recogni- 
tion and acceptance of a salient superordinate identity and 
recategorization, which would have previously stimulated 
threats to the distinctiveness of group identities (S. Gaertner & 
Dovidio, 2000). 

Reductions in perceived threat, increased perceptions of 
intergroup support, and more inclusive representations (either 
as a superordinate group or as a dual identity), in turn, can 
activate group- and individual-level processes that can reduce 
intergroup conflict (see Figure 20.1). These processes may 
also operate sequentially. For example, once people identify 
with a common group identity, they may be more trusting of 
former out-group members and consequently be willing to en- 
gage in the type of personalized, self-disclosing interaction 
that can further promote social harmony (Brewer & Miller, 
1984; Dovidio et al., 1997). Thus, factors related to structural 
and functional relations between groups and those associated 
with collective representations (e.g., involving mutual inter- 
group differentiation, recategorization, and decategorization 
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processes) can operate in a complementary and reciprocal 
fashion. 

Pragmatically, understanding the nature of bias and conflict 
can suggest ways in which these forces can be harnessed and 
redirected to promote social harmony. Given the different per- 
spectives, needs, and motivations of majority (high status) and 
minority (low status) groups, interventions based on these 
principles need to be considered carefully. Nevertheless, un- 
derstanding the multilevel nature of prejudice and discrimina- 
tion is an essential step for finding solutions—which may 
need to be similarly multifaceted. These principles may be 
applied to reduce social conflict and facilitate the integration 
of groups as disparate as corporations and stepfamilies 
(S. Gaertner, Bachman, Dovidio, & Banker, 2001), to im- 
prove race relations in the workplace (Dovidio, Gaertner, & 
Bachman, 2001) and more generally (Dovidio & Gaertner, 
1998), and to meet the challenge of managing immigration 
successfully—in ways that facilitate the achievement and well- 
being of immigrants and that produce the cooperation and sup- 
port of residents of the receiving country (Esses et al., 2001). 

In addition, these approaches may be applied integratively 
to reduce international tensions and improve national rela- 
tions (Pettigrew, 1998b). Rouhana and Kelman (1994), for 
example, described the activities and outcomes of a program 
of workshops designed to improve Palestinian-Israeli rela- 
tions and to contribute to peace in the Middle East. These 
workshops required Palestinian and Israeli participants to 
search for solutions that satisfy the needs of both parties. 
Rouhana and Kelman (1994) explained that this enterprise 
“can contribute to a creative redefinition of the conflict, to 
joint discovery of win-win solutions, and to transformation of 
the relationship between the parties” (p. 160). Conceptually, 
this orientation changes the structural relations between the 
groups from competition to cooperation, facilitates the devel- 
opment of mutually differentiated national identities within a 
common workshop identity, and permits the type of personal- 
ized interaction that can enhance social harmony. Pettigrew 
(1998b) proposed that these workshops serve as a setting for 
direct interaction that provides opportunities for developing 
coalitions of peace-minded participants across conflict lines. 
Thus, a strategic and reflective application of basic social- 
psychological principles can have significant practical bene- 
fits in situations of long-standing conflict. 

In conclusion, the issues related to social conflict, harmony, 
and integration are complex indeed. As a consequence, ap- 
proaches to understanding these processes need to address the 
issues at different levels of analysis and to consider structural 
as well as psychological factors. This diversity of perspectives 
produces a complicated and sometimes apparently inconsis- 
tent picture of the nature of intergroup relations. However, 


rather than viewing these approaches as competing positions, 
we suggest that they often reflect different perspectives on a 
very large issue. No single position is definitive, but jointly 
they present a relatively comprehensive picture of the multi- 
faceted nature of intergroup relations. 
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Prejudice (i.e., biased and usually negative attitudes toward 
social groups and their members), racism (a negatively ori- 
ented prejudice toward certain groups seen as biologically 
different and inferior to one’s own), and discrimination (un- 
fair behavior or unequal treatment accorded others on the 
basis of their group membership or possession of an arbitrary 
trait, such as skin color) have been favored topics of research 
and theorizing for many years by psychologists—especially 
social and personality psychologists—around the world. Of 
these three concepts, prejudice is perhaps the most central 
and important. Prejudice underlies racism and is also be- 
lieved to motivate acts of discrimination. Between 1887 and 
2000, nearly 4,000 papers were published on prejudice in 
journals covered by the American Psychological Associa- 
tion’s electronic database of published psychological litera- 
ture. Since the 1950s, in particular, the pace of psychological 
research on prejudice has steadily increased. 

Much like prejudice as a topic in international prose and 
poetry (Larson, 1971), the psychology of prejudice reflects 
two main themes: (a) the psychology of the bigot, which 
seeks to understand why some people are prejudiced toward 
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certain groups and their members, and (b) the psychology of 
the victim of prejudice and discrimination, which focuses on 
the psychological correlates and consequences of experienc- 
ing or perceiving oneself to be an object or target of preju- 
dice or discrimination. These two principal themes likewise 
provide the basic organization for this chapter. 

Research on the psychology of the bigot far exceeds that 
on the psychology of the victim of prejudice and discrimina- 
tion. One reason for this differential emphasis undoubtedly 
stems from the optimistic view that if the psychology of big- 
otry could be truly understood, scientifically based remedial 
efforts could then be devised and deployed to reduce, if not 
eliminate, prejudice at its source within the bigot. Yet, even if 
we suddenly possessed a magic bullet that instantly turned 
bigots into tolerant people, a strong case could be made for a 
psychology of the victim. Among other reasons, some of the 
prejudice and discrimination confronting members of op- 
pressed groups comes from structural and institutional forms 
of racism, sexism, and all other “isms” rather than being 
solely due to intolerant and bigoted individuals. The task of 
addressing the social structural bases of prejudice within so- 
ciety and its institutions is apt to be far more daunting and 
difficult than reducing prejudices in individuals with psycho- 
logical or other means—a formidable enough challenge in its 
own right (see the chapter on reducing prejudice by Dovidio 
in this volume). 
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The extant literature on prejudice is also so vast and di- 
verse that one chapter cannot realistically suffice to capture 
it all. Accordingly, this chapter’s goal is to survey major 
perspectives and research foci on the aforementioned two 
themes underlying the psychology of prejudice at the turn of 
the twenty-first century. The amount of psychological re- 
search on prejudice has, to some extent, waxed and waned 
over the last five decades of the twentieth century. The preju- 
dice literature has also been characterized by different em- 
phases or waves, such as whether prejudice is conceptualized 
as a form of psychopathology or is instead viewed as being 
the product of normal cognitive processes (Duckitt, 1994). 
The present chapter focuses on the historical continuity of 
key ideas and psychological explanations about prejudice 
over the past several decades and emphasizes links between 
classic and contemporary research on prejudice. 

We begin, then, with the psychology of bigotry. Under this 
principal theme, the classic perspectives of authoritarian per- 
sonality, just world, and belief congruence theories are con- 
sidered first. Though proposed in the 1950s and 1960s, these 
perspectives are still with us and remain important to our 
contemporary understanding of prejudice. For example, by 
focusing on beliefs and values, belief congruence theory 
presaged and anticipated more recent theories of racism (con- 
sidered later under the rubric of ambivalence approaches to 
prejudice) and also has links to more recent perspectives on 
prejudice and impression formation. After considering am- 
bivalence approaches, our focus shifts to automatic and con- 
trolled processing approaches to prejudice, especially the 
dissociation model and recent innovations in measuring prej- 
udice with automatic activation procedures. The final sec- 
tion under the psychology of bigotry highlights integrative 
approaches (viz., social dominance theory, integrated threat 
theory, and the multicomponent approach to intergroup atti- 
tudes), each of which incorporates insights from multiple 
perspectives in seeking to understand prejudice better. 

The psychology of the victim of prejudice and 
discrimination—the second principal theme of this chapter— 
begins with a consideration of attributional ambiguity per- 
spectives, focusing on the complex but important issue of 
whether and when attributing a rejection or failure to preju- 
dice can buffer one’s sense of well-being and self-esteem. 
Following that, the stressfulness of perceiving oneself to be a 
target of prejudice or discrimination and the consequences of 
stereotype threat for task performance, respectively, are con- 
sidered. Finally, the relationship of relative deprivation and 
perceived discrimination to protest and desires to take 
corrective action is considered. I begin, though, with the 
psychology of bigotry. 


THE PSYCHOLOGY OF BIGOTRY 


Authoritarian Personality Theories 
The Original Theory of the Authoritarian Personality 


The original theory of the authoritarian personality (OTAP), 
proposed by Adorno, Frenkel-Brunswik, Levinson, and 
Sanford (1950), was the first comprehensive and systematic 
attempt by psychologists to understand theoretically the roots 
of prejudice and to link ethnic, racial, religious, and ethno- 
centric prejudices to personality. Adopting the research 
methodologies of mid-twentieth-century social and clinical 
psychology along with a guiding psychoanalytic theoretical 
perspective, Adorno et al. (1950) postulated that the origins 
of the prejudice-prone authoritarian personality stemmed 
from a particular pattern of childhood influences and parental 
practices (see Brown, 1967, for an excellent in-depth analysis 
of the OTAP). Specifically, the authoritarian personality was 
the presumed result of an upbringing by parents who, among 
other things, (a) disciplined their child harshly, (b) empha- 
sized duties and obligations instead of affection in child- 
parent relations, (c) made their love dependent on the child’s 
unquestioning obedience, and (d) were status-oriented by 
being ingratiating toward those of higher social status but 
contemptuous toward those of lesser social status. 

According to the OTAP, the child in such a family devel- 
ops hostility but cannot express it toward the harsh, frustrat- 
ing, but feared parents. This submission leads the child to 
develop a sense of itself as dependent upon its parents and 
unable to defy their authority. Moreover, the child in an au- 
thoritarian family presumably deploys an array of defense 
mechanisms to deal with the repressed hostility felt toward its 
parents. By identifying with the aggressor and following a 
strategy of “if you can’t beat them, join them,” the child 
comes to idealize its parents and to identify with established 
authority in general. Repressed hostility and other impulses 
unacceptable to its parents, such as aggression and sex, are 
displaced and projected by the child onto minority and subor- 
dinate groups as safe, alternative outlets. As a result, the child 
in an authoritarian family presumably develops a rigid per- 
sonality organization characterized by a moralistic attitude 
toward unconventional people and practices, prejudice to- 
ward minority and other out-groups, and a tendency to ideal- 
ize power, status, strength, and toughness but to disdain 
tenderness, weakness, and self-introspection. 

The OTAP has several implications flowing from the cen- 
tral idea that prejudice toward ethnic and racial minorities 
and other target groups reflects an underlying, deep-seated 
personality structure in the bigot. First, prejudice should 


relate to attitudes toward a variety of issues and objects (e.g., 
attitudes toward sex, power, and political-economic issues) 
that would otherwise appear unrelated to prejudice and to one 
another because their interrelations reflect deeper, uncon- 
scious processes and connections. (OTAP’s tenet that preju- 
dice is rooted in unconscious processes is clearly echoed in 
contemporary theories of prejudice emphasizing automatic 
cognitive processing, described later, as an important feature 
of individuals’ prejudicial beliefs and their expression.) Sec- 
ond, the authoritarian personality would be prejudiced to- 
ward a wide variety of target groups. If an authoritarian 
person’s prejudice toward one group were somehow blocked, 
it would presumably be expressed, in a process of symptom 
substitution, toward other groups. Third, if prejudice is in- 
deed deeply rooted in a personality structure, it should be dif- 
ficult to change and would require depth-oriented techniques, 
such as psychotherapy and insight, that promote and produce 
profound personality change in the bigoted individual. 

Adorno et al. (1950) attempted to validate the OTAP, in 
good part, by developing a personality scale, the California 
F (for fascism) scale, whose items were constructed to tap 
the right-wing political ideology and belief syndrome that 
they theorized as comprising the authoritarian personality. 
U.S. respondents’ F scale scores correlated positively, as hy- 
pothesized, with their scores on other attitude scales de- 
signed to assess anti-Semitism, negative attitudes toward 
Blacks and other U.S. minority groups, and U.S. ethnocen- 
trism. The F scale was subsequently incorporated into nu- 
merous studies in the 1950s and 1960s. Though criticized at 
the time of its initial appearance and later for keying all its 
items in one direction and not correcting for acquiescence 
response set, the F scale was still sporadically used by psy- 
chological and survey researchers well up to the 1980s. It re- 
mained for Altemeyer (1981, 1988, 1996), in a trilogy of 
books reflecting often painstaking psychometric research, to 
demonstrate conclusively the California F scale’s serious in- 
adequacies as a measure of proneness to prejudice and to 
refocus the conceptualization of the authoritarian personality 
into a more rigorously defined construct and scale of right- 
wing authoritarianism. 


The Theory of Right-Wing Authoritarianism 


Altemeyer (1981) persuasively detailed the inadequacies of 
the California F scale, most notably its lack of scale homo- 
geneity and its saturation with response sets, especially ac- 
quiescence. Even more important, however, he created a 
psychometrically and conceptually appropriate scale of 
right-wing authoritarianism (RWA) that he has continued to 
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refine (see Altemeyer, 1996). Altemeyer defined RWA as the 
covariation of three attitudes: (a) authoritarian submission 
(i.e., ready submission to societally established authorities), 
(b) authoritarian aggression (i.e., aggression sanctioned 
by established authorities toward defined targets or social 
groups), and (c) conventionalism Oe, adherence to con- 
ventions endorsed by societally established authorities). 
Altemeyer (1981, 1988, 1996) has extensively documented 
RWA’s correlates, often with numerous replications. For ex- 
ample, RWA is concentrated more among politicians of the 
right, fundamentalist Protestants, and the poorly educated. 
Also, parents outscore their university-age offspring in RWA. 

Altemeyer’s approach to RWA differs from the OTAP in 
several important regards (Dion, 1990). By contrast to the 
OTAP’s psychoanalytic perspective, Altemeyer has favored 
social learning theory as an explanation for the development 
of RWA in individuals, especially Bandura’s versions with 
their emphases on vicarious learning and self-regulation 
by cognitive processes. Social learning theory has provided 
Altemeyer with a heuristic framework for explaining the con- 
tribution to RWA of personal experiences in one’s adoles- 
cence, of parents and peers, of university education and 
parenthood, and the paradoxical role of religion in fostering 
RWA by creating a sense of self-righteousness. Second, 
whereas the OTAP portrayed authoritarianism as a personal- 
ity dimension with its developmental roots in infancy and 
early childhood, Altemeyer has viewed RWA as an attitudinal 
orientation that emerges and crystallizes in early adoles- 
cence, suggesting that it may be more readily amenable to 
change within the individual. 

Finally, in addition to documenting its empirical links to 
prejudice, Altemeyer (1988, 1996) has particularly focused 
on the political correlates of RWA. He has shown repeatedly 
that individuals (usually university students) scoring high on 
the RWA scale are reportedly more than willing and ready to 
punish others and to infringe upon and curtail their civil 
rights, especially those who threaten the social order. RWA 
scale scores have also been found to discriminate well be- 
tween provincial and state legislators in Canada and the 
United States belonging to right- and left-wing political par- 
ties. Knowing politicians’ RWA scale scores appears to be a 
useful piece of information for predicting their attitudes and 
behaviors. 

Research by Altemeyer and others indicates that the RWA 
scale correlates between .30 and .50 with measures of pre- 
judice toward racial and ethnic minorities and ethnocentrism 
scales. RWA correlates negatively with internal motivation 
(e.g., personal standards) and positively with external moti- 
vation (e.g., social or peer pressure) by White people to 


510 Prejudice, Racism, and Discrimination 


respond without prejudice toward Black people (Plant & 
Devine, 1998). RWA consistently correlates more highly, 
between .5 and .6, with homophobia and negative attitudes 
toward homosexuals. Indeed, Altemeyer (1996) contended 
that RWA is the single individual difference variable most 
relevant for predicting attitudes toward homosexuals, espe- 
cially negative ones. 

Studies by other investigators have likewise documented 
a consistently negative relationship between RWA and 
attitudes toward homosexuals and homosexuality (e.g., 
Haddock, Zanna, & Esses, 1993; Lippa & Arad, 1999; 
Whitley, 1999), strongly reinforcing Altemeyer’s conclusion 
in this regard. The negative attitudes toward homosexuals 
by those scoring high in RWA are due to perceived impedi- 
ments of homosexuals and homosexuality to one’s values 
(Haddock et al., 1993) or to religiousness. Finally, a recent 
lexical approach to mapping the structure of social attitudes 
by Saucier (2000) showed that authoritarianism and RWA 
(along with conservatism and religiousness) defined the first 
and largest of three factors in the domain of social attitudes 
and beliefs. Clearly, the authoritarianism construct, espe- 
cially RWA, remains important in psychological research on 
prejudice and in linking prejudice to individuals’ personality 
and attitudes. 


Just World Theory 


An individual’s belief in a just world (BJW) is another psy- 
chological dimension relevant for understanding individuals’ 
reactions to ethnic and racial minorities and victims of ill for- 
tune. According to Lerner’s (1980) just world theory, we all 
believe, to a varying extent, in a just world where people get 
what they deserve and also deserve what they get. The BJW 
presumably enables us to view our world as a safe, pre- 
dictable place where we can expect to obtain desired rewards 
and to avoid unpleasant outcomes. Becoming aware of an 
innocent victim who does not deserve to suffer, however, 
threatens one’s BJW. Individuals go to considerable lengths 
to maintain and protect their BJW in the face of contrary 
information. For example, classic experiments by Lerner and 
his colleagues have demonstrated that when unable to pre- 
vent or compensate for an innocent victim’s suffering, ob- 
servers preserved their BJW by derogating the victim and 
seeing the suffering as deserved (see Lerner, 1980). 
Questionnaire measures of the BJW consistently correlate 
with the tendency to blame visible victims (e.g., ethnic and 
racial minorities, the unemployed, and immigrants and asy- 
lum seekers) with samples of university and community 
respondents in the United States, Canada, and Europe (see 
Montada & Lerner, 1998). However, the BJW construct is 


conceptually and empirically distinguishable from authori- 
tarianism. Using factor analyses of questionnaire measures 
from a sample of Canadian university students in Ontario, 
Lerner (1978) showed that authoritarianism (as measured by 
Rokeach’s 1960 F scale) and BJW loaded on separate, inde- 
pendent factors. Authoritarianism loaded on a xenophobia 
factor characterized by high loading for authoritarianism, ad- 
herence to the Protestant ethic (a belief in the virtues of hard 
work and effort), attitudes toward social changes, and nega- 
tive attitudes toward both minority groups and out-groups 
(e.g., Americans). By contrast, the BJW loaded on a win-lose 
view of the world, in which winners (e.g., Americans) were 
viewed positively, while losers (e.g., Native Indians and 
Métis) were negatively appraised. The BJW also correlates 
positively, but only modestly (i.e., between .1 and .3) with 
RWA (Lambert, Burroughs, & Chasteen, 1998). 

It is interesting that blaming victims for their ill fate 
strengthens the observer’s BJW (see Lerner & Montada, 1998). 
In turn, believing oneself to have been victimized as a target of 
prejudice or discrimination also appears to affect the BJW 
adversely. Birt and Dion (1987) found that in Toronto, the 
greater the perceived discrimination against homosexuals as a 
group, the weaker was the BJW among gay and lesbian re- 
spondents. Thus, just world theory and the BJW have relevance 
for the psychology of being a victim of prejudice and discrimi- 
nation as well as the psychology of bigotry. 


Belief Congruence Theory 


Rokeach (1960) criticized the OTAP for focusing on right- 
wing authoritarianism, contending that authoritarianism need 
not be tied inextricably to either right- or left-wing political 
views. As an alternative, he proposed the construct of closed- 
mindedness or dogmatism and developed several Dogmatism 
Scales in an attempt to measure authoritarianism and to 
assess general authoritarianism of the political left as well 
as the political right. Unfortunately, his Dogmatism Scales 
possess serious psychometric limitations and are relatively 
little used today. Moreover, if it exists, left-wing authoritari- 
anism would involve resisting and opposing conventional 
and established authorities (see Altemeyer, 1996, for an inter- 
esting discussion of dogmatism and left-wing authoritarian- 
ism and some new prospective scales for measuring these 
dimensions). 

In the same book on the open and closed mind, however, 
Rokeach, Smith, and Evans (1960) also proposed an im- 
portant perspective on prejudice: belief congruence theory 
(BCT). According to BCT, individuals cognitively organize 
their psychological world along the lines of belief congru- 
ence, liking those with similar beliefs and disliking those 


with dissimilar beliefs. Although the link between attitude 
similarity and interpersonal attraction had already been well 
demonstrated by that point, Rokeach et al.’s provocative 
contribution was to extent it to the domain of prejudice and to 
argue that all forms of prejudice were essentially different 
forms of belief prejudice. Thus, according to BCT, the racial 
conflict between Blacks and Whites in the United States is 
not due to race per se but rather to opposite or conflicting 
stands on key issues such as affirmative action in em- 
ployment and education. Likewise, the antipathies between 
English and French in Canada are not due to ethnicity per se, 
but rather to conflict over the issue of Quebec’s role, and the 
place of the French language, within Canada. In other words, 
racial and ethnic prejudice, as two examples, presumably 
reflect belief prejudice. 

BCT clearly suggests research in which belief is pitted 
against group membership characteristics such as race or eth- 
nicity. Rokeach et al. (1960), for example, had samples of 
White university students from northern and southern parts of 
the United States rate their desires to be friends with members 
of pairs of stimulus persons whose races and beliefs, both 
race-relevant and -irrelevant, were specified. For example, 
Type R pairs varied in race but kept belief constant (e.g., a 
White person who believes in God vs. a Black person who be- 
lieves in God). Type B pairs kept race constant but varied be- 
lief (a Black person who believes in God vs. a Black person 
who is an atheist). Type RB pairs varied both race and belief 
simultaneously. Differences in friendliness ratings for mem- 
bers of a stimulus person pair were taken as reflecting 
discrimination. A critical comparison suggested by BCT 
involved a choice between an in-group member with dissimi- 
lar beliefs versus an out-group member with beliefs similar to 
one’s own. For this pair comparison, individuals’ preference 
typically goes to the latter, consistent with BCT. Likewise, 
Rokeach and Mezei (1966) showed that belief similarity ex- 
cels race in predicting preferences for work partners among 
employment applicants following actual interpersonal inter- 
action and discussion between Black and White participants 
with similar and dissimilar beliefs on an issue. 

BCT remains as relevant a theory of prejudice in the 
twenty-first century as it was in the latter half of the twentieth 
century, largely due to the research over the past several 
decades of Insko and his colleagues (e.g., Cox, Smith, & 
Insko, 1996; Insko, Nacoste, & Moe, 1983) as well as recent 
contributions by Biernat and her colleagues (Biernat, Vescio, 
& Theno, 1996; Biernat, Vescio, Theno, & Crandall, 1996). 
For example, Insko et al. (1983) reviewed the literature and 
compared the strong version of BCT (when social pressure is 
absent, only belief determines racial-ethnic discrimination) 
to a weak version (when social pressure is absent, belief is 
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more important than race in determining discrimination or 
prejudice). They concluded that the weak version of BCT was 
clearly supported by the evidence, whereas the strong version 
was more problematic (e.g., race effects in the form of in- 
group favoritism occur even in the absence of social pressure). 

Cox et al. (1996) reported results of three cross-sectional 
surveys conducted over several decades of Black and White 
teenagers sampled from a North Carolina school system who 
had responded to stimulus persons varying in race and belief, 
using a belief discrepancy manipulation in which dissimilar 
beliefs were ones that respondents themselves had previously 
attributed to the other race. For White respondents, race ef- 
fects (i.e., preferring their own race to Blacks on social dis- 
tance and other attitude measures) steadily declined across 
three points in time from 1966 to 1993, as did perceived dis- 
approval of interracial contacts and relationships. The effects 
of belief similarity affected all of their dependent variables 
and were constant across decades for White respondents. For 
Black respondents, more complex findings were obtained: 
Specifically, race effects (i.e., in-group preference) did not 
decline between 1979 and 1993 (the only two time periods 
including Black respondents), and belief similarity primarily 
influenced same-race rather than interracial evaluations. 

BCT has clear links to contemporary perspectives on im- 
pression formation and prejudice. For example, Cox et al. 
(1996) noted that BCT is very similar to Fiske and Neuberg’s 
(1990) temporal-continuum model of impression formation. 
In the latter model, a perceiver begins with categorical infor- 
mation (viz., race, ethnicity, sex, age, etc.) about a person 
but proceeds, if time permits and circumstances require, to 
process individuating information (e.g., beliefs of the stimu- 
lus person). Like Fiske and Neuberg’s model, BCT deals with 
the issue of when individuating information (viz., beliefs and 
values) about a stimulus person overcomes competing cate- 
gorical information (viz., group membership) in the impres- 
sions we form of others. Likewise, the importance that BCT 
accords to perceived belief dissimilarity in eliciting prejudice 
is shared today by terror management theory, a perspective 
focusing on the psychological consequences of being aware 
of, or sensitized to, one’s mortality (Solomon, Greenberg, & 
Pyszczynski, 2000). 

BCT has also been extended to the value domain. Schwartz 
and Struch (1989) proposed that perceptions of value dissim- 
ilarities between groups underlie intergroup antagonisms and 
undercut feelings of shared humanity. Likewise, Biernat, 
Vescio, Theno, and Crandall (1996) reported studies in which 
group membership cues (race and sexual orientation, respec- 
tively, in separate studies) were crossed with value violation 
(e.g., a lazy vs. dependable worker in the race study or a good 
vs. bad parental example in the sexual orientation study). 
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Value similarity had a strong effect on stimulus person ratings 
in both studies and a stronger effect than group member- 
ship characteristics (i.e., whether the stimulus person being 
evaluated was an in-group or out-group member from the per- 
spective of the respondent). 

When only group membership cues are available, per- 
ceivers infer that an out-group member has dissimilar beliefs, 
triggering a discriminatory or prejudicial response toward her 
or him, whether the out-group is defined by race or sexual 
orientation (see Stein, Hardyck, & Smith, 1965; Pilkington & 
Lydon, 1997). When belief similarity or dissimilarity is 
crossed with group membership, belief effects (i.e., prefer- 
ring the individual with similar beliefs to one with dissimilar 
beliefs) are stronger. Race effects, however, usually remain 
evident in interpersonally intimate domains such as eating to- 
gether, dating, and marriage. Insko et al. (1983; Cox et al., 
1996) have suggested that race effects in these particular do- 
mains reflect perceived disapproval of interracial contact by 
reference persons such as parents and peers rather than inti- 
macy per se. 

In sum, as a perspective on prejudice, BCT anticipated the 
subsequent focus on the importance of values in prejudice, an 
idea pivotal to ambivalence approaches to prejudice that 
emerged in the 1970s and 1980s. I now turn to ambivalence 
approaches to prejudice. 


Ambivalence Approaches 


Myrdal (1994) was perhaps first to suggest that ambivalence 
underlies White Americans’ attitudes and behaviors toward 
Blacks. This idea lay fallow in U.S. psychology until the late 
1970s (see Crosby, Bromley, & Saxe, 1980; Pettigrew, 1979). 
By that point, though, it had become increasingly apparent that 
White Americans were less prone to strident racism asserting 
White superiority, Black inferiority, and racial segregation but 
instead inclined toward subtler expressions of racism. Al- 
though attitude surveys suggested growing racial tolerance 
among White Americans from the 1960s onward, the evidence 
was much less clear on indirect indicators (e.g., nonverbal be- 
havior and helping behavior) that feelings of White Americans 
toward Blacks had truly become more tolerant. 

In the last few decades, several groups of researchers 
concerned with prejudice, racism, and discrimination in the 
United States have characterized White Americans’ attitudes 
toward Black Americans in the latter twentieth and early 
twenty-first centuries as being ambivalent in nature, that is, 
consisting of both positive and negative elements (see Jones, 
1997). They differ, however, in the nature of the positive and 
negative elements comprising this ambivalence and other as- 
pects of their models. These ambivalence approaches include 


theories of aversive racism, symbolic and modern racism, re- 
sponse amplification, ambivalent sexism, and blatant versus 
subtle prejudice. 


Aversive Racism 


Dovidio and Gaertner (1986), for example, proposed a the- 
ory of aversive racism, in which they characterized the racial 
attitudes of most liberal, White Americans today as a subtler 
and less obviously bigoted view of Black Americans than the 
dominative racism (i.e., old-fashioned, “redneck” views of 
White superiority and Black inferiority) of previous genera- 
tions. According to the aversive racism perspective, preju- 
dice in the United States of the later twentieth century 
became a subtler, less direct, and perhaps more pernicious 
form than before, although dominative racism has not disap- 
peared altogether. 

Aversive racism theory suggests that on one hand, most 
White Americans subscribe strongly to an egalitarian value 
system, inclining them to sympathize with victims of injus- 
tice, such as Black Americans and other racial minorities, and 
to support policies promoting racial equality. This strong ad- 
herence to egalitarianism enables White Americans to regard 
themselves as being unprejudiced and nondiscriminatory. 
This positive component of the ambivalence comprising 
aversive racism is not assumed, however, to include gen- 
uinely pro-Black attitudes or sentiments of true friendship 
between Whites and Blacks in the United States. 

On the other hand, owing to a historically racist culture in 
the United States and certain feelings of negative affect (e.g., 
uneasiness, disgust, fear, and discomfort, though not neces- 
sarily hostility or hate) toward Black Americans, most White 
Americans are assumed to avoid Black-White interracial in- 
teractions and to be biased and discriminatory toward Black 
Americans in situations in which they can do so without 
appearing to be prejudiced or in which it may be justified 
under a rationale preserving their erstwhile egalitarian val- 
ues. Aversive racism is not assumed to be a psychopatho- 
logical phenomenon but rather to reflect normal cognitive 
processes and the influence of sociocultural and historical 
processes on White Americans. 

Several implications flow from aversive racism theory and 
the idea that aversive racists are strongly motivated and vigi- 
lant to avoid appearing racially bigoted. First, traditional prej- 
udice measures in the form of standard attitude scales would 
presumably be difficult and perhaps of limited use for assess- 
ing aversive racism, according to Dovidio and Gaertner 
(1986). Nevertheless, based on survey research up to the 
1990s, Dovidio and Gaertner (1991) estimated that perhaps a 
fifth of White U.S. citizens were overtly racist. The other 80% 


of White Americans would presumably be, to varying extent, 
ambivalent toward Black Americans. White Americans who 
espouse a political philosophy of liberalism should be espe- 
cially prone to aversive racism (Biernat, Vescio, Theno, & 
Crandall, 1996). 

As noted earlier, a second implication of aversive racism 
theory is that in situations where discrimination would be 
blatant and where the appropriate behavior is normative and 
well-defined, White Americans would be unlikely to discrim- 
inate against Black Americans because doing so would con- 
tradict their allegedly nonprejudiced, egalitarian ideals and 
self-images. However, in ambiguous situations where the dis- 
crimination is less blatant or obvious, White Americans 
should be more likely to be biased against Black Americans 
because in that case they can do so without necessarily threat- 
ening their self-images. This feature of aversive racism 
theory—emphasizing the normative structure of situations as 
a moderator variable for predicting when racially ambivalent 
White Americans will or will not discriminate against Black 
Americans—is perhaps its most unique and distinctive fea- 
ture among ambivalence approaches (Biernat, Vescio, Theno, 
& Crandall, 1996). These predictions have been amply sup- 
ported in studies of White Americans, mostly college stu- 
dents, by Dovidio, Gaertner, and their colleagues. 

This supportive research has included studies of helping, 
social cognition studies measuring reaction times linking the 
words “white” and “black” to positive and negative stereo- 
type characteristics, studies where pictures of Black and 
White individuals’ faces are presented as primes (Dovidio & 
Gaertner, 1986), research on juridic recommendations of the 
death penalty in a capital case (Dovidio, Smith, Gershenfeld 
Donnella, & Gaertner, 1997), and personnel selection recom- 
mendations in 1989 and 1999 (Dovidio & Gaertner, 2000), 
among others. In all of these studies, findings supported aver- 
sive racism theory and were unaffected by whether the par- 
ticipants had scored low or high on standard prejudice 
measures, though high scorers on such scales often showed 
greater bias toward Blacks than did low scorers. 

What remains to be demonstrated by aversive racism the- 
orists is that it is actually the conflict or tension between the 
positive element of egalitarianism, or one hand, and negative 
feelings toward Blacks, on the other, that constitutes the un- 
derlying basis of ambivalence for White Americans’ attitudes 
and behaviors toward Blacks and is the driving force behind 
their discrimination of Blacks in ambiguous situations. 
Indeed, egalitarianism is the value that perhaps most strongly 
promotes tolerance and mitigates negative feelings toward 
Blacks by White Americans. Presenting liberal-oriented U.S. 
university students with an egalitarian message has been 
shown by Biernat, Vescio, and Theno (1996) to elicit more 
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positive ratings of a Black stimulus person than a White one. 
Whether egalitarianism promotes tolerance among indi- 
viduals in countries other than the United States, however, 
remains to be seen. With White participants from Portugal 
and Brazil, Vala and Lima (2001) found that activating an 
egalitarian norm affected perceptions and evaluations of a 
White but not a Black stimulus person. 

Although aversive racism theory has an excellent track 
record in predicting a variety of cognitions and behaviors in 
the social psychological laboratory, documenting the precise 
nature of White Americans’ ambivalence toward Blacks 
remains a task to be completed. Using recently developed 
automatic processing techniques (described later) to assess 
nonconscious feelings of antipathy toward Blacks (or other 
oppressed group members) in conjunction with standard 
value measures to assess egalitarianism and other potentially 
race-relevant values may provide some useful leverage for 
assessing aversive racism in White participants and for test- 
ing the theory directly. 


Symbolic and Modern Racism 


Closely related to aversive racism theory are the constructs of 
symbolic and modern racism that have been suggested by 
several researchers, such as McConahay (1986) and Sears 
(1988; Sears & Funk, 1991). The symbolic and modern 
racism constructs originated because standard prejudice 
scales of the 1950s and 1960s became increasingly problem- 
atic for U.S. survey researchers in the 1960s and 1970s, 
owing to social desirability issues (i.e., the transparency of 
what they were measuring) and because they failed to predict 
racially relevant political behavior, such as voting intentions 
for capable Black candidates in elections where candidates of 
both races were running and racism likely played a role in the 
outcome (see Kinder & Sears, 1981). 

What did predict voting and support for progressive racial 
policies were attitude items reflecting an abstract, moral 
tone that Black Americans were violating cherished White 
American values such as individualism and the Protestant 
ethic extolling the virtues of individual effort and hard 
work—qualities White Americans often felt were lacking 
among Black Americans. Ambivalence, then, arises because 
many White Americans want to maintain a nonprejudiced 
image even though they privately resent and dislike Blacks 
and feel the racial discrimination toward Blacks in the United 
States no longer exists. In protecting themselves from the 
appearance of being prejudiced, symbolic or modern racists 
justify their negative attitudes and behaviors toward Blacks 
by invoking nonprejudiced explanations in the form of 
American values or ideals. A symbolic or modern racist, for 
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example, might justify opposition to affirmative action prog- 
rams benefiting Blacks by saying that they violate the value 
of equality by favoring one group over others. 

The constructs of symbolic and modern racism are similar 
to aversive racism. In both cases, the ambivalence arises from 
negative feelings toward Black people versus core American 
values. In both cases, White Americans dislike and avoid 
racial prejudice but seek indirect ways to manifest their neg- 
ative feelings toward Black Americans. All three racism 
constructs are interested in predicting interpersonal behavior, 
with symbolic and modern racism being used mainly to pre- 
dict political attitudes and behavior, typically in surveys. 
Symbolic and modern racism are assumed to emerge from 
early political socialization and not to be based on personal 
experience, personal competition, or direct, personal, eco- 
nomic threats to Whites from Blacks. Unlike aversive racism, 
however, items and scales to assess symbolic and modern 
racism have been constructed by their adherents and have 
proven very popular in survey and experimental research on 
prejudice by psychologists in the late twentieth century. 

McConahay (1986), for example, presented a Modern 
Racism Scale (MRS) and an Old-Fashioned Racism Scale 
(OFRS), with moderate, positive correlations between the 
two, and items loading on one or the other factor in ex- 
ploratory factor analyses. Whereas the OFRS was reactive 
(i.e., White U.S. respondents’ scores were lower when it was 
administered by a Black experimenter than by a White one), 
the MRS was nonreactive (at least in the 1980s). Items from 
symbolic or modern racism scales became the standard mea- 
sure of prejudice toward Blacks in the United States in the 
1980s and 1990s and are still frequently used in this regard. 
In the twenty-first century, newer scales such as the Blatant 
and Subtle Prejudice Scales (Pettigrew & Meertens, 1995) or 
the Social Dominance Orientation Scale (Pratto, Sidanius, 
Stallworth, & Malle, 1994), both of which are discussed later, 
are perhaps more apt to become the preferred, “paper-and- 
pencil” measures of prejudice. 

Sniderman and Tetlock (1986), themselves prominent po- 
litical psychologists, have strongly criticized the constructs 
and measurement of symbolic and modern racism. Among 
other things, they criticized symbolic and modern racism for 
being unclear as to the causal relation between anti-Black 
affect and core American values, for equating political policy 
preferences (e.g., opposition to busing school children or 
affirmative action) with racism itself, and for suggesting that 
old-fashioned racism no longer existed in the United States. 
Sniderman and Tetlock even contended that symbolic racism 
theory was unfalsifiable and therefore unscientific. The MRS, 
they also charged, was confounded with political conser- 
vatism. Sniderman and his colleagues showed that political 


conservatism related not to rejection and prejudice toward 
out-group members but rather to greater support for those, 
whether from the in-group or out-group, who behaved in a 
manner consistent with politically conservative principles 
(e.g., Sniderman, Piazza, Tetlock, & Kendrick, 1991). 

Although proponents of symbolic and modern racism have 
not thoroughly explored the presumed link to values, Biernat, 
Vescio, and Theno (1996) did so in a series of studies. For ex- 
ample, after completing Rokeach’s Value Survey, White U.S. 
undergraduates were asked to rate the extent to which four tar- 
get groups, including Black Americans, supported or violated 
their values. Whether considering their top value or their 
hierarchy of values, Black Americans were perceived as 
less supportive of their values than were White Americans; 
however, there was no difference in perceived violation of val- 
ues for these two target groups. Likewise, differences in rat- 
ings of White versus Black support and violation of values 
correlated with measures of modern racism as well as pro- and 
anti-Black attitudes, although these correlations were consis- 
tently modest in magnitude. Consistent with theories of sym- 
bolic and modern racism, Biernat et al. showed that White 
individuals who scored high on the Protestant work ethic and 
had their values made salient rated a Black employee less 
positively than a White employee when they violated the 
work ethic. 

Thus, Biernat, Vescio, and Theno’s (1996) research par- 
tially supported models of symbolic and modern racism. How- 
ever, if violating core American values is indeed one of the two 
key components of symbolic and modern racism, one would 
expect to find much stronger relationships than they did. 
Biernat et al. also questioned the assumption that modern- 
symbolic racism is a blending of negative affect toward Blacks 
and core American values, such as individualism. Their analy- 
ses suggested that egalitarianism is a stronger predictor than 
individualism of intergroup attitudes and that combining neg- 
ative affect with value measures added little beyond the sepa- 
rate components in predicting responses to an out-group 
member in their studies of race and sexual orientation. 

In the ambivalence approach presented next the focus 
shifts to conflict between pro- and anti-Black attitudes linked 
to values as the determinant of positive and negative reac- 
tions to Blacks by White Americans. 


Ambivalence Amplification 


Katz and Hass (1988) contended that most White Americans 
hold both positive and negative attitudes about Black 
Americans that are relatively independent of one another. A 
White American who endorses positive statements about 
Blacks on a “Pro-Black scale” is neither more nor less likely 


to agree with anti-Black statements from a separate “Anti- 
Black scale.” Moreover, for White American respondents, 
these racial beliefs relate to different and conflicting value 
systems. Pro-Black attitudes (e.g., beliefs that Blacks have a 
disadvantaged position in society) are linked to humanitar- 
ian-egalitarian values. By contrast, anti-Black attitudes (e.g., 
beliefs that Black people lack the drive or skills necessary to 
improve their socioeconomic position) related to White re- 
spondents’ beliefs in individualism and the Protestant ethic. 

Katz and his colleagues proposed that when these con- 
flicting beliefs are salient to a White person holding them and 
who also becomes aware of the ambivalence, he or she expe- 
riences negative arousal and is motivated to reduce this ten- 
sion. Indeed, Hass, Katz, Rizzo, Bailey, and Moore (1992) 
have demonstrated that White American participants experi- 
enced negative mood change when their racial ambivalence 
toward Black people was stimulated by reading a vivid 
description of an ugly racial incident in which gangs of 
young Whites in New York City viciously beat some Black 
Americans whose car had broken down in their neighbor- 
hood. This discomfort can be reduced, according to these 
theorists, by intensifying either the positive or negative com- 
ponent of the conflicted attitude toward Blacks—an idea 
defining response amplification theory. 

Response amplification theory suggests that for ambiva- 
lent White Americans attitudes and behavior will be more 
polarized or amplified toward Black Americans than toward 
fellow White Americans. Experimental evidence for re- 
sponse amplification theory, as applied to Black Americans 
and other socially stigmatized groups such as the handi- 
capped, was presented by Katz and Glass (1979). For exam- 
ple, White U.S. undergraduates who had been led to believe 
that they had delivered a series of strong shocks to a victim 
derogated a Black victim more than a White victim, and this 
derogation was a function of the extent of ambivalence as 
reflected by measures of prejudice and sympathy toward 
Blacks. Whether racial ambivalence potentiates positive or 
negative responses depends on the situational context and the 
ambivalent person’s behavioral options. 


Ambivalent Sexism 


Sexism, like racism, reflects ambivalence. Glick and Fiske 
(1996) viewed sexism as a multidimensional construct in- 
volving ambivalence. They proposed that ambivalent sexism 
comprises two positively correlated components: hostile sex- 
ism (HS) and benevolent sexism (BS). The former consists of 
hostility, negative attitudes, and negative stereotypes of 
women. By contrast, BS is a set of interrelated sexist attitudes 
that portray women stereotypically and in restricted roles but 
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that are subjectively positive in affective tone from the per- 
ceiver’s viewpoint and elicit prosocial behaviors (e.g., help- 
ing) or intimacy seeking (e.g., self-disclosure). Benevolent 
sexism reflects a positive attitude toward women and positive 
stereotypes about women, although Glick and Fiske do not 
view it as a good thing. Although both HS and BS were orig- 
inally postulated to include three underlying components, 
this conjecture was supported only for BS, while HS was 
found to be a unidimensional construct. 

Both HS and BS relate, as one would expect, to other mea- 
sures of modern sexism (Swim, Aikin, Hall, & Hunter, 1995) 
and neo-sexism (Tougas, Brown, Beaton, & Joly, 1995). 
Benevolent sexism, however, relates to subtler forms of sex- 
ism than HS, masked as it is in a veil of positive sentiment 
toward women. Glick and Fiske (1996) suggested that among 
women, BS reflects a tendency to adopt as one’s own the 
prevalent forms of sexist prejudice in U.S. society. They also 
suggested that while modern- and neo-sexism measures 
excel in predicting gender-related political attitudes, HS 
and BS scales together (comprising the Ambivalent Sexism 
Inventory, or ASI) would be better at predicting attitudes 
and behavior in the realm of interpersonal and romantic 
relationships between women and men. As well, sexist 
ambivalence—the combination of scoring high on both BS 
and HS—is believed to polarize attitudes and behaviors to- 
ward women, in a process like that proposed by Katz, Haas, 
and their colleagues for amplified responses toward Black 
Americans and the physically handicapped as induced by 
ambivalence. 


Blatant and Subtle Prejudice 


The preceding ambivalence approaches differ in whether 
they assume that old-fashioned prejudice still exists or 
whether it is seen to be as potent as its modern or symbolic 
variants. For example, advocates of symbolic and modern 
racism suggest that it is the more dominant form of prejudice 
today. Other ambivalence approaches assume that both forms 
are prevalent and require assessment by researchers inter- 
ested in prejudice. For example, Pettigrew and Meertens 
(1995) postulated the existence of both blatant and subtle 
prejudice toward out-groups today. They characterized bla- 
tant prejudice (the traditional form) as “hot, close, and direct” 
and suggested that it consisted of two components: (a) per- 
ceived threat and rejection of the out-group and (b) avoidance 
of intimacy (especially sex and marriage). By contrast, subtle 
prejudice (the modern variant) is “cool, distant, and indirect” 
and includes three components: (a) defense of traditional 
values, (b) exaggeration of cultural differences, and (c) denial 
of positive emotional responses toward out-groups. 
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Pettigrew and Meertens (1995) created separate multi-item 
scales for blatant and subtle prejudice toward immigrants 
and administered them to survey respondents from four 
European countries with regard to several different target 
groups. Across countries, confirmatory factor analyses sug- 
gested that two-factor models surpassed a one-factor model, 
but that a correlated two-factor model and a hierarchical model 
in which blatant and subtle prejudice were first-order factors 
subsumed under a general second-order factor were equally 
viable models to account for the pattern of scale scores. 

An advantage of using both subtle and blatant prejudice 
scales is that a threefold typology emerged that yielded dif- 
ferent patterns of responses to immigrants in Pettigrew and 
Meertens’s (1995) research. Respondents who scored low on 
both blatant and subtle prejudice scales were called “equali- 
tarians,” a group who were most in favor of maintaining and 
enhancing immigrants’ rights in their countries and who 
presumably have internalized most strongly contemporary 
norms of tolerance in their societies. Respondents scoring 
high on both scales comprised “bigots,” who were most in 
favor of returning immigrants to their home countries and 
restricting immigrants’ rights and were assumed to have 
rejected current norms against blatant prejudice. “Subtles” 
were respondents scoring low on blatant prejudice but high 
on subtle prejudice and were assumed to have only partially 
and incompletely internalized norms against blatant preju- 
dice. On immigration issues, “subtles” adopted a middling, 
nonprejudicial stance between bigots and equalitarians and 
required justification for restricting immigrants’ rights. The 
“subtles” category, of course, is the analogue to symbolic, 
modern, and aversive racism in that these people strive to 
appear nonprejudiced and are assumed to express their biases 
against immigrants in ways that do not violate current norms 
against blatant prejudice. 


Automatic and Controlled Processing 
The Dissociation Model 


As noted earlier in discussing ambivalence approaches to 
prejudice, some prejudice researchers (e.g., Crosby et al., 
1980; Dovidio & Gaertner, 1986) have suggested that most 
White Americans are prejudiced toward Black people and 
that subtle behaviors that individuals can less readily monitor 
and censor (e.g., helping, nonverbal behavior, reaction times 
to briefly presented stimuli) are better gauges of White 
Americans’ true racial attitudes. In an influential contribution 
to the prejudice literature, Devine (1989) strongly challenged 
and countered this view. She claimed that it implied that 


prejudiced beliefs and attitudes were unamenable to change, 
as well as that prejudice is an inevitable, unavoidable product 
of normal cognitive processes. 

As an alternative, Devine (1989) proposed a dissociation 
model that emphasizes the importance of distinguishing 
between automatic versus controlled cognitive processing 
and the differentiation of stereotype activation versus per- 
sonal beliefs. The automatic versus controlled processing dis- 
tinction emerged in cognitive psychology during the 1970s 
and subsequently has become an increasingly important con- 
struct in social and personality psychology (see Bargh, 1989). 
Automatic processing refers to unintentional, nonconscious 
cognitive processing that occurs without effort or intention 
and is unlimited by cognitive capacity. By contrast, con- 
trolled cognitive processing refers to intentional, effortful, 
and goal-directed processing of information that is assumed 
to be under the person’s awareness and control but subject to 
limitation by cognitive capacity (e.g., attentional limits). Ap- 
plying this distinction to the relationship between stereotyp- 
ing and prejudice, Devine (1989) suggested that stereotype 
activation was an automatic process that did not require in- 
tention, attention, or cognitive capacity on the part of a per- 
ceiver. Instead, whenever an appropriate cue is present, such 
as the appearance of a Black person or a symbolic represen- 
tation of one, a White U.S. perceiver’s stereotype of Black 
people should be activated automatically. 

Devine (1989) proposed that common socialization expe- 
riences in late-20th-century America have led White people 
in the U.S. to become equally knowledgeable about the 
prevalent and generally negative stereotype of Black people, 
regardless of their personal levels of prejudice. As a conse- 
quence of this common knowledge, her dissociation model 
predicted that automatic activation of the stereotype would 
be equally strong and unavoidable for White U.S. perceivers, 
regardless of the extent of their personal prejudice toward 
Blacks. 

Prejudiced and nonprejudiced White persons, however, 
were expected in the dissociation model to differ in their per- 
sonal beliefs concerning Black people, and this difference in 
personal beliefs regarding Blacks should be manifested on 
cognitive tasks involving deliberate, controlled cognitive 
processing. Specifically, on such a task, nonprejudiced White 
persons should inhibit and override their negative cultural 
stereotype of Blacks because it conflicts with their egalitarian 
values and their personal beliefs and to replace the pejorative, 
Black stereotype with more positive perceptions and attribu- 
tions of Black persons. On this latter point, Devine’s (1989) 
analysis of nonprejudiced perceivers agrees with aversive 
racism and modern-symbolic racism theories in positing a 


conflict between core American values, on one hand, and a 
desire to avoid appearing prejudiced, on the other. 

For prejudiced White persons, on the other hand, the cul- 
tural stereotype of Blacks and their personal beliefs about 
them are congruent with one another. Because they do not 
conflict, there would be little need for them to censor their 
negative personal beliefs concerning Black people. Thus, ac- 
cording to the dissociation model, White persons varying in 
prejudice toward Black people should differ on cognitive 
tasks involving controlled processing but not on tasks involv- 
ing automatic processing. 

Devine (1989) supported her dissociation model with 
three studies, in which the MRS served as the measure used 
to define high versus low levels of prejudice in White par- 
ticipants. One study demonstrated that on an open-ended 
measure, both high- and low-prejudice White participants 
listed very similar characteristics, and predominantly nega- 
tive ones, when asked to describe the cultural stereotype of 
Black people—an effect since replicated by other investiga- 
tors in the United States and the United Kingdom (e.g., 
Lepore & Brown, 1997). Another study deployed a con- 
trolled processing task by giving participants ample time to 
list alternative labels for “Black Americans” and then asking 
them to list all of their thoughts in response to this label. 
Thoughts on this listing task were categorized by judges as 
being positive beliefs, negative beliefs, or traits. Highly prej- 
udiced White participants listed negative traits most often, 
while less prejudiced ones were more likely to list thoughts 
reflecting positive beliefs—uncontroverisal and unsurprising 
results. 

In what has since become a more controversial study, 
however, Devine (1989) also compared reactions of White 
persons varying in prejudice on an automatic processing task 
in which participants were subliminally presented with word 
primes parafoveally (i.e., outside the central visual field) 
while performing a perceptual vigilance task. Word primes 
were related to the Black stereotype either 20% or 80% of the 
time and included reference both to the category Blacks and 
to stereotypic traits for Black Americans (e.g., lazy, poor, op- 
pressed, etc.). Following this automatic processing task in 
which participants had been primed to varying extent with 
racially relevant stimuli, they read an ambiguous story about 
a male person of unspecified race performing various as- 
sertive behaviors and then rendered their impressions of him. 
As predicted by the dissociation model, impressions of the 
stimulus person were affected by the automatic processing 
task in that attributions of hostility were more likely when 
primes from the preceding automatic processing task had 
been proportionally more stereotypically oriented (i.e., in the 


The Psychology of Bigotry 517 


80% condition instead of the 20% one), with no difference as 
a function of the participants’ level of prejudice. 

From the preceding research, Devine (1989) concluded 
that controlled processing rather than automatic processing 
differentiates the highly prejudiced from their less prejudiced 
White counterparts. Moreover, White people with egalitarian 
ideals employ controlled processing to try to behave and 
think in an unprejudiced manner toward Black people. Both 
high- and low-prejudiced White Americans have the same 
stereotypic knowledge of Black people and are presumably 
both susceptible to having this stereotypic knowledge that is 
presumably elicited automatically beneath their awareness. 
However, stereotypic and prejudicial responses can be over- 
ridden by intentional and flexible controlled processing. 

Deciding to be unprejudiced is, according to the dissocia- 
tion model, a conscious, intentional act of controlled process- 
ing. Inhibiting and overriding stereotypic and prejudicial 
responses elicited by automatic activation processes and re- 
placing them with more appropriate and positive beliefs to- 
ward Blacks and other minorities held by individuals seeking 
to be unprejudiced is akin, Devine has argued, to their “break- 
ing a bad habit.” That is, the White person trying to be un- 
prejudiced toward Black people must consciously and 
deliberately decide to forego prejudicial beliefs and actions 
(the bad, old habit) and to replace them with new attitudes and 
behaviors consistent with an egalitarian outlook (the new, 
good habit). In essence, Devine’s (1989) dissociation model 
suggests that for those seeking to be (or actually being) un- 
prejudiced, automatic and controlled processes must become 
dissociated from one another, with the good habit of tolerance 
strengthened at the expense of the bad habit of prejudice. 

Monteith (1993; Monteith, Devine, & Zuwerink, 1993; 
Monteith & Walters, 1998) and her colleagues (Devine & 
Monteith, 1999; Devine, Monteith, Zuwerink, & Elliot, 
1991) have explored in depth the self-regulatory processes by 
which low-prejudice White Americans (i.e., those who score 
low on prejudice measures, such as the MRS) inhibit preju- 
diced responses and maintain egalitarian standards. First, 
low-prejudice Whites do indeed have personal beliefs and 
standards against expressing prejudice toward oppressed 
groups, such as Black people and homosexuals, but many of 
the former also acknowledge responding from time to time in 
ways that are more prejudiced than their personal beliefs 
would warrant. Second, when they do find themselves 
exhibiting a biased response toward an oppressed group 
member (i.e., what Monteith and her colleagues term a 
prejudice-related discrepancy), low-prejudice White 
Americans experience emotional responses in the form of 
guilt and negative, self-directed affect as well as increased 
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self-focus and self-attention, and they subsequently monitor 
their behavior more carefully to ensure that it conforms more 
closely to their personal beliefs. 


Critique of the Dissociation Model 


The dissociation model’s contention that prejudiced and un- 
prejudiced perceivers would be equally responsive to priming 
by an automatic processing task has, however, been recently 
criticized and questioned by several investigators. Lepore and 
Brown (1997), for example, criticized Devine’s (1989) auto- 
matic processing study for including both categorical cues 
referring to Blacks as a social group and stereotypic traits of 
Black people among the subliminal primes. As an alternative 
to the dissociation model, Lepore and Brown argued that the 
link between the category and the stereotypic features relating 
to Blacks differentiates White perceivers varying in preju- 
dice, with the link being much stronger and more chronically 
accessible for highly prejudiced White persons than for less 
prejudiced ones. If only categorical cues referring to Blacks 
as a group comprise the subliminal primes on an automatic 
processing task, one should observe highly prejudiced White 
persons subsequently forming more negative impressions 
than less prejudiced ones—a result that Lepore and Brown 
(1997, Study 2), in fact, have obtained. 

By contrast, subliminal cues that include stereotypic at- 
tributes along with the categorical label also prime the stereo- 
typic knowledge of both high- and low-prejudice White 
perceivers, which has been shown to be highly similar. Thus, 
subliminal cues containing both category references and 
stereotypic attributes on an automatic processing task would 
not be expected to reveal differences between White persons 
varying in prejudice, a prediction that Lepore and Brown 
(1997, Study 3) also supported in a conceptual replication of 
Devine’s (1989) automatic processing study. Null hypothesis 
predictions have been rife on the issue of automatic process- 
ing effects on impression formation as a function of the 
White participants’ prejudice toward Blacks. Predicting 
the null hypothesis, however, is problematic because tests of 
such hypotheses often lack sufficient statistical power (see 
Cohen, 1992). 

Kawakami, Dion, and Dovidio (1998) further reinforced 
Lepore and Brown’s conceptual analysis in two ways. They 
found that high-prejudice White persons were more respon- 
sive to primes on a single task where automatic and con- 
trolled processing could both be experimentally manipulated 
by varying stimulus onset asynchrony (i.e., the difference in 
time between presentation of the prime and a subsequent, to- 
be-responded-to stimulus). Second, individual differences in 
stereotype attribution as assessed by a separate measure 


correlated with stereotypic activation on the experimental 
task when it allowed automatic processing. 

With regard to Devine’s automatic processing findings, 
Fazio, Jackson, Dunton, and Williams (1995) have suggested 
that the MRS has become a reactive and insensitive measure 
of racial prejudice. Consistent with this point, they showed 
that the levels of modern racism in White American partici- 
pants failed to moderate priming effects on a procedure 
(described later) that was designed to elicit automatic activa- 
tion of racial attitudes. 

Taken together, the preceding critiques of the dissociation 
model have important implications for prejudice and its re- 
duction. According to Lepore and Brown’s (1997) alternative 
perspective, low-prejudice White persons have never estab- 
lished the bad habit of prejudice toward Black people in the 
first place or established it much less firmly than their highly 
prejudiced White counterparts. For low-prejudice White per- 
sons, the link between the social category, Blacks, and the 
culturally stereotypic information about them is already weak 
and tenuous. Rather than unlearning a bad habit, those inter- 
ested in reducing prejudice in White people presumably need 
to focus on the highly prejudiced Whites and on weakening 
the associative strength of the links between the category of 
Blacks as a social group and negative stereotypic information 
and content about them. 


Automatic Activation as Prejudice Measures 


Automatic activation techniques are a means of unobtrusively 
measuring racial and other intergroup attitudes and an alterna- 
tive to traditional attitude scales, which are often compromised 
by social desirability and transparency regarding the goal of 
assessing prejudice. Even the MRS has recently been shown to 
be sensitive to social desirability, yielding lower scores from 
White participants when administered by a Black experi- 
menter than by a White one (Fazio et al., 1995, Study 3). From 
their findings in several studies, Fazio et al. (1995) have styled 
the MRS as a measure of White Americans’ “willingness to 
express” negative feelings or opinions about Blacks, one that 
also confounds racism with political conservatism. Other re- 
searchers have noted that correlations between old-fashioned 
and modern and symbolic racism are higher than would be 
expected if these were truly two separate constructs rather than 
different aspects of a single construct (see Dovidio et al., 1997; 
Swim et al., 1995). 

As an alternative, Fazio et al. (1995) proposed a priming 
paradigm using automatic activation of attitudes from mem- 
ory as an unobtrusive measure of racial attitudes that is 
demonstrably superior to the MRS. The priming procedure 
consists of multiple trials on a computer in which the prime 


consists of a symbolic representation of the attitude object, 
such as digitized photos of stimulus persons from one or 
more racial groups. Immediately following the prime, a target 
in the form of a positive or negative evaluative adjective is 
displayed, and the participant is required to indicate its con- 
notation as either good or bad by pressing different computer 
keys. When the prime and target are evaluatively congruent 
for the participant, responding should be facilitated as mani- 
fested in a faster, more efficient reaction time. By contrast, 
when prime and target are evaluatively incongruent with one 
another from the viewpoint of the participant, responding 
should be slowed, as reflected by a longer reaction time. 

Using this priming procedure, Fazio et al. (1995) showed 
in several studies that White U.S. university students showed 
greater facilitation when negative adjectives were preceded by 
photos of Black people. By contrast, a small sample of Black 
participants showed response facilitation on the priming task 
when photos of Blacks preceded positive adjectives and when 
White photos were preceded by negative adjectives. More- 
over, scores on this unobtrusive measure of racial attitudes had 
predictive validity for a Black experimenter’s ratings of the 
participant’s friendliness and interest when interacting with 
her, to which MRS scores were unrelated. 

Along similar lines, Greenwald, McGhee, and Schwartz 
(1998) suggested the Implicit Association Test (IAT) as a 
related procedure for assessing implicit attitudes, defined as 
behaviors, feelings, or thoughts elicited outside the partici- 
pant’s awareness by automatically activated evaluation 
procedures (see Greenwald & Banaji, 1995). The IAT con- 
sists of a series of five discrimination tasks, conducted on 
computer, in which the participants differentiate between two 
categories of stimuli by responding as quickly as possible on 
different computer keys. 

If one were assessing White attitudes toward Black people 
with the IAT, the first task would be an initial target-concept 
discrimination in which they might be asked to differentiate 
between White and Black American first names by pressing 
different keys on the computer. The second task is an associ- 
ated attribute discrimination in which the participant differ- 
entiates pleasant from unpleasant words. The third step is the 
initial combined task in which the two prior tasks are now su- 
perimposed or mapped onto one another, such as using one 
key for individual stimuli that are either White or pleasant 
and another key for stimuli that are either Black or unpleas- 
ant. In the fourth step, the response keys from the first 
task are reversed. The fifth and final step, the reverse com- 
bined task, reverses the response key contingencies from the 
third step (e.g., one key for stimuli that are either White or 
unpleasant or either Black or pleasant. The difference in 
speed of responding to the two combined tasks on the IAT 
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provides the measure of implicit attitudes. Following the ear- 
lier example, a latency shorter for the first combined task than 
for the reverse combined task would suggest a less positive or 
more negative implicit attitude toward Blacks by a White 
participant. 

Using the IAT, Greenwald and Banaji (1998) found evi- 
dence that it may reveal the existence of prejudice that is not 
evident on paper-and-pencil attitude measures such as the 
semantic differential scale. Whereas a majority of a sample of 
White American participants in one study indicated no 
Black-White difference or even a pro-Black preference on 
paper-and-pencil ratings, all but one had IAT scores indicat- 
ing a White preference, presumably a nonconscious one. 
Greenwald and his colleagues have also found modest posi- 
tive correlations between IAT scores and some “explicit” at- 
titude measures such as the feeling thermometer (in which 
social groups are rated on a 100-point thermometer scale) and 
a diversity index but not others, especially semantic differen- 
tial scales. IAT scores, they suggested, do not merely reflect 
greater familiarity with one’s in-group (e.g., naming prac- 
tices, facial stimuli) compared to an out-group. The IAT pro- 
cedure, they also proposed, yields stronger effect sizes and is 
therefore more sensitive than the priming procedure devised 
by Fazio et al. (1995) and by other investigators. 

One would not necessarily expect implicit and explicit 
measures of racial attitudes to correlate highly with one 
another. Demonstrating this point, Dovidio, Kawakami, 
Johnson, Johnson, and Howard (1997) showed that the predic- 
tive validity of implicit (i.e., elicited by automatic processing 
techniques, such as priming or the IAT) and explicit measures 
of racial attitudes (i.e., elicited by self-report measures such as 
scales of modern and old-fashioned racism) of White partici- 
pants toward Black people diverges in a predictable manner. 
Specifically, implicit prejudice measures predicted sponta- 
neous cognitions and behaviors that are not easily monitored 
but reflect automatic processing, such as performance on a 
word-completion task in which answers may be racially tinged 
or nonverbal behavior such as eye blinking or direct gaze 
when interacting with a Black person. By contrast, explicit 
prejudice measures possessed predictive validity for delibera- 
tive thoughts and actions that reflect controlled processing, 
such as judgments of a Black defendant’s guilt in a juridic 
decision-making task and evaluations. 

Fazio et al. (1995) had previously obtained a similar pat- 
tern of findings. Their unobtrusive priming measure of preju- 
dice in Whites had predictive validity for rated quality of 
interaction with a Black experimenter, whereas explicit mea- 
sures predicted deliberative acts such as attractiveness ratings 
of photos and evaluations of the fairness of the Rodney King 
verdict (in which White police officers in Los Angeles were 
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exonerated from charges of using excessive force with a 
Black defendant). Both explicit and implicit measures pre- 
dicted attributions of responsibility for the causes of rioting 
following the Rodney King verdict. Thus, implicit attitude 
measures add an important, new, and separate dimension to 
the conceptual and methodological toolbox that psycholo- 
gists have to assess prejudice. 

To summarize, both automatic and controlled cognitive 
processing play an important role in the social psychology of 
bigotry. Racial stimuli presented below or just above the 
threshold of awareness operate as primes that influence 
thinking and behavior by White persons toward members of 
a stereotyped group such as Blacks. If the racial prime 
includes only reference to the social category, automatic acti- 
vation will activate stronger stereotypes among the more 
highly prejudiced Whites than among the less prejudiced. If 
the racial prime includes both categorical reference as well as 
stereotypic trait information, differences on dependent mea- 
sures (e.g., impression formation) between participants dif- 
fering in levels of prejudice by Whites will usually no longer 
be apparent. 

An important development for automatic processing tech- 
niques has been their utilization for assessing prejudice, 
avoiding problems with standard attitude measures of preju- 
dice such as social desirability, and deliberately masking 
one’s negative feelings toward specific groups. These tech- 
niques, such as the priming methodology as well as the IAT, 
will undoubtedly be increasingly utilized to assess individu- 
als’ nonconscious prejudices, with the resulting measures 
being especially helpful in predicting behaviors and cogni- 
tions toward out-group members that an individual cannot 
easily monitor and censor. 


Integrative Approaches 


The rubric of integrative approaches includes perspectives on 
prejudice that include the insights of multiple theoretical 
viewpoints concerning the psychology of bigotry that their 
advocates have organized into a single, coherent, explanatory 
framework. By incorporating multiple perspectives, each in- 
tegrative approach becomes a broad, comprehensive expla- 
nation of prejudice. Social dominance theory, integrated 
threat theory, and a multicomponent approach to intergroup 
attitudes exemplify integrative approaches to prejudice. 


Social Dominance Theory 


Social dominance theory (SDT) assumes that societies are 
structured as group-based social hierarchies, with one or a 
small number of dominant or hegemonic groups at the top 


of the social structure and at least one subordinate group 
below them (Sidanius, Levin, Rabinowitz, & Federico, 1999; 
Sidanius & Pratto, 1999). In general, dominant group mem- 
bers disproportionately enjoy society’s goods and benefits 
De, wealth, status, and power), whereas subordinate group 
members suffer a disproportionate share of society’s mis- 
eries and inequities (i.e., poverty, low prestige, and relative 
powerlessness). 

In group-based social hierarchies, individual’s stations in 
life are determined largely by their membership in socially 
constructed groups defined by race, gender, age, religion, so- 
cial class, and so on. Group-based hierarchies are assumed 
to be highly stable, often reflecting consensus as to which 
groups are dominant and subordinate, respectively. For ex- 
ample, perceived social standing of U.S. ethnic groups in 
1964 and later in 1989 correlated almost perfectly across 
the quarter century (Sidanius & Pratto, 1999). SDT defines 
three types of social stratification systems: an age system 
where adults and older individuals command more resources 
and power than the younger, a gender system in which men 
possess greater status and power than women, and an arbi- 
trary set system in which socially constructed, arbitrarily de- 
fined categories (e.g., races, occupations, social classes, 
nationalities) enjoy disproportionately more status and power 
over other socially constructed categories. SDT concentrates 
especially on gender and arbitrary set systems of group-based 
hierarchy. 

Whereas age and gender systems of group hierarchy are 
assumed by SDT to be universal across human societies, 
arbitrary set hierarchies differ in several regards. First, they 
display more definitional fluidity across time period and 
countries. Sidanius and Pratto (1999), the principal architects 
of SDT, claimed that arbitrary set hierarchies emerge only in 
societies that produce an economic surplus. Arbitrary set 
hierarchies tend to be dynastic with social status passing on 
to one’s children. Finally, arbitrary set hierarchies are pre- 
sumably maintained more by terror, violence, and brutality 
than by age- and gender-based hierarchies. 

Three basic assumptions of SDT are as follows: (a) Most 
intergroup conflict and oppression reflect a predisposition 
toward forming group-based social hierarchy; (b) social 
systems are prone to hierarchy-enhancing (HE) forces 
pushing toward greater inequality, and opposing effects of 
hierarchy-attenuating (HA) forces toward greater equality; 
and (c) conflict between HE and HA forces produces rela- 
tively stable social systems. 

From these assumptions SDT concerns itself with the 
mechanisms that contribute to group-based social hierarchy 
and with how hierarchies affect these mechanisms. Behav- 
ioral asymmetry is one mechanism. The notion of behavioral 


asymmetry is that the behavioral repertoires of dominant and 
subordinate group members differ and that these differences 
contribute to the hierarchical relationships among these 
groups. Four types of behavioral asymmetries are asymmetri- 
cal in-group bias, systematic out-group favoritism or defer- 
ence, self-debilitating behavior, and ideological asymmetry. 

Regarding in-group bias (i.e., favoring one’s own group 
over other groups), dominants show more than do subordi- 
nates. This asymmetrical in-group bias reinforces the hege- 
monic group’s dominance over the subordinate group. By 
contrast, deference, or out-group favoritism, is more apt to be 
shown by members of the subordinate group, again reinforc- 
ing the dominant group’s hegemony. Self-debilitation occurs 
when subordinate group members engage in more self- 
defeating and self-destructive behavior, such as criminal ac- 
tivity or drugs, than do dominant group members. Ideological 
asymmetry refers to the idea the antiegalitarian values lead 
one to endorse policies and ideologies promoting group- 
based inequality, such as support for the death penalty in the 
United States, which dominant group members endorse more 
strongly than do subordinate group members. 

The degree of group-based social inequality is also influ- 
enced by support for various legitimizing myths (LMs). 
These are ideologies that provide moral or intellectual justifi- 
cations for group-based social hierarchies within all three hi- 
erarchical systems (age, gender, or arbitrary set). SDT defines 
two types of LMs based on whether they facilitate social in- 
equality and are HE or facilitate social equality and are HA. 
Racism, sexism, and ageism exemplify HE-LMs, while fem- 
inism, socialism, and universalism are HA-LM examples. 

The psychological aspect of SDT is the construct of social 
dominance orientation (SDO) as assessed by an eponymous 
scale. SDO is a personality dimension defined as an attitude 
toward intergroup relations reflecting antiegalitarianism and 
intolerance, at one end, to support for group-based hierarchy 
and the domination of inferior groups by superior groups, at 
the opposite end. A high score on the SDO scale reflects a 
willingness to accept inequalities between and among groups 
in society. Items in the SDO scale refer to groups in the ab- 
stract and thus tap the respondent’s acceptance of intergroup 
inequalities for whatever group distinctions are salient to the 
respondent in a given sociopolitical or national context. 

SDO scale scores have been shown to relate to many po- 
litical attitudes (e.g., political conservatism, nationalism, 
patriotism), legitimizing ideologies (e.g., racism, sexism, be- 
lief in fate), social attributions (e.g., internal vs. external at- 
tributions for the fate of the poor), HE/HA career choices 
(e.g., police officers vs. teachers), and group evaluations (see 
Sidanius et al., 1999; Sidanius & Pratto, 1999). In Saucier’s 
(2000, p. 378) study of the structure of social attitudes, SDO 
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loaded with Machiavellianism on a dimension defined as 
“favoring whatever is immediately beneficial to me and 
mine, disregarding wider concerns of fairness or morality,” 
which was separate from the factor on which authoritarian- 
ism loaded on. 

Focusing on prejudice specifically, Whitley (1999) has 
shown that (a) SDO predicted most forms of prejudice toward 
Black Americans and homosexuals in a sample of White, het- 
erosexual U.S. university students and (b) SDO also mediated 
gender differences in those prejudices in that sample. Accord- 
ing to Sidanius, Pratto, and their colleagues, SDO also shows 
discriminant validity in being relatively independent of other 
constructs such as conservatism, interpersonal dominance, 
and right-wing authoritarianism, although Altemeyer (1996) 
reported a moderate, positive correlation between RWA and 
SDO. Consistent with the notion that attitudes toward group 
hierarchy reflected in the SDO scale are culturally universal, 
Pratto et al. (2000) showed that with proper translation and 
back translation, SDO can be reliably measured cross- 
culturally, and its scores related in theoretically predicted 
ways to sexism, prejudice toward oppressed groups by major- 
ity group members, and related attitudes (e.g., support for the 
military) for samples of respondents in several countries out- 
side North America, including Israel, Taiwan, and the People’s 
Republic of China (Shanghai), as well as Canada. 

Advocates of SDT have suggested that the SDO construct 
can account for the relationships between conservatism and 
racism and between conservatism and antimiscegenation (e, 
a disdain for interracial marriages) in terms of their mutual de- 
pendence on SDO (see Sidanius & Pratto, 1999). Advocates 
of SDT also believe that individual differences in SDO are de- 
termined by four factors: group status, gender, socialization, 
and temperament. First, the greater the social status of one’s 
in-group in a given society, the higher is one’s level of SDO. 
In the United States, for example, White Americans outscore 
Black Americans in SDO. In Israel, Ashkenazi (European an- 
cestry) Jews have higher SDOs than Sephardic (North- 
African or Middle Eastern ancestry) Jews. Second, the single 
most reliable finding of SDT research is that with a few ex- 
ceptions in cultures outside North America (see Pratto et al., 
2000), males outscore females on SDO. Socialization experi- 
ences, such as education, are also assumed to affect SDO, with 
higher educational levels relating inversely to SDO. Finally, 
higher SDO scores correlate with lower empathy levels and 
greater aggressivity—temperamental features that are pre- 
sumably heritable and that promote out-group prejudice. 

Advocates of SDT view it as a theoretical perspective 
linking the individual and the social structure together in the 
explanation of prejudice, and one that provides a comprehen- 
sive explanation for the oppression of subordinate groups by 
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dominant ones in human societies around the world. As such, 
its advocates claim that SDT complements and integrates the- 
ories of prejudice focusing on the individual, such as the 
right-wing authoritarianism theory (see Altemeyer, 1996; 
Whitley, 1999), and those focusing on the role of social struc- 
ture and elites, such as Marxism, as well as providing a 
theoretical bridge between these micro and macro levels of 
analysis. 

Proponents of SDT have also noted some differences be- 
tween their perspective and other theories of prejudice and 
racism. Sidanius et al. (1999) suggested that symbolic racism 
is limited to focusing on racism toward Black Americans in 
one historical and cultural context (viz., the United States 
in the late 20th century), whereas SDT claims a much wider 
historical and cross-cultural focus as well as a broader sweep 
regarding oppressed groups around the world to which it pre- 
sumably applies. In fact, however, some evidence suggests 
that U.S.-derived measures of prejudice, especially blatant 
and subtle prejudice, work as well in Europe as they do in the 
United States (Pettigrew et al., 1998). Similarly, while sym- 
bolic and modern racism theories focus on values such as in- 
dividualism and the Protestant ethic, SDT instead emphasizes 
antiegalitarianism as crucial to prejudice. 

Its proponents also suggest that SDT complements inter- 
group theories, such as social identity theory (SIT), by taking 
into account the attitudes and behaviors of subordinate group 
members as well as those from the dominant group, focusing 
on out-group derogation as well as in-group favoritism, and 
differentiating status and power in intergroup relations. In- 
deed, researchers have profitably used both SDT and SIT (e.g., 
Levin & Sidanius, 1999; Levin, Sidanius, Rabinowitz, & 
Federico, 1998) to yield insights into intergroup processes, 
such as the relationship between in-group identification and 
SDO in high-status versus low-status groups in a society. 
Clearly, SDT is presently one of the most prominent and 
promising contemporary theories of prejudice, and the SDO 
measure is apt to become a scale of choice among those who 
wish to use an explicit prejudice measure instead of, or along 
with, implicit prejudice measures. 


Integrated Threat Theory 


Without claiming to incorporate all possible causes of preju- 
dice, Walter Stephan, Cookie Stephan, and their colleagues 
have nevertheless proposed that threat is certainly one major 
class of its causes and arguably its principal one. Their inte- 
grated threat theory (ITT) identifies and combines four major 
types of threat that they and other investigators have previ- 
ously documented as relevant to understanding and pre- 
dicting prejudice: (a) realistic threats, (b) symbolic threats, 


(c) intergroup anxiety, and (d) stereotyping (e.g., Stephan & 
Stephan, 1996; Stephan, Ybarra, & Bachman, 1999). Al- 
though other theories and investigators have emphasized one 
or another of these threats, the Stephans and their associates 
provide a distinctive twist or interpretation of each threat in 
the overall context of ITT. 

Realistic threats include any perceived threats from an- 
other group to the welfare, well-being, or survival of one’s in- 
group and its members. Symbolic threats are perceived group 
differences in beliefs, values, or norms that may threaten the 
in-group’s way of life. Unlike symbolic racism, ITT’s sym- 
bolic threats apply to a wider array of groups, both dominant 
and subordinate, and to value differences in general, rather 
than those typifying only U.S. society, such as the Protestant 
ethic. The intergroup anxiety construct derives from prior 
research by Stephan and Stephan (1985), referring to the neg- 
ative emotions occurring when one interacts with members of 
another group, especially an antagonistic or competitive out- 
group. Beliefs about the characteristics of groups and the traits 
of group members (i.e., stereotypes) constitute yet another 
threat by creating expectancies about the type of interactions 
that can be anticipated with out-group members, with negative 
expectancies reflecting prejudice. Finally, in addition to the 
four types of threat, ITT also assumes that the history and na- 
ture of prior contact between groups (e.g., negative, positive, 
or mixed) and the status of groups relative to one another also 
needs to be taken into account for predicting prejudice. 

Immigrants are assumed by ITT to elicit all four types of 
threat in members of immigrant-receiving societies, such 
as the United States, Spain, and Israel. For that reason, atti- 
tudes toward immigrant groups have figured prominently as a 
criterion of particular interest in ITT research. Using samples 
of university students at several locales throughout the United 
States, Stephan, Ybarra, and Bachman (1999) showed that all 
four threats were relevant for predicting prejudice toward 
Cubans (in Miami), Mexicans (in New Mexico), and Asians 
(in Hawaii), accounting for 50% or more of the variance in at- 
titudes toward each of these different immigrant groups. 
Stephan, Ybarra, Martinez, Schwarzwald, and Tur-Kaspa 
(1998) likewise showed that each of the four threats was a 
reliable predictor of attitudes held by Spanish university stu- 
dents toward Moroccan immigrants and by Israeli students 
toward Ethiopian and Russian immigrants to Israel. 

Structural equation modeling (SEM) analyses by Stephan 
et al. (1998) indicated that the four threats comprised a single, 
unitary dimension of threat. Schwarzwald and Tur-Kaspa 
(1997) showed that realistic, symbolic, and interpersonal 
threats were significant predictors of Israeli university stu- 
dents’ attitudes toward Ethiopian and Russian immigrants, 
whereas individual differences in SDO predicted prejudice 


toward Ethiopian immigrants only. By exploring women’s 
attitudes toward men, Stephan, Stephan, Demitrakis, Yamada, 
and Clason (2000) showed that ITT is useful for target groups 
other than immigrants and for attitudes of members of subor- 
dinate or oppressed groups as well as dominant ones. Stephan 
et al. found that for women, symbolic threat, intergroup anxi- 
ety, and negative contact were predictors of negative attitudes 
toward men; however, contrary to prediction, realistic threats 
failed to emerge as a reliable predictor. 

Because the preceding research on ITT is correlational in 
nature, it does not and cannot conclusively document that the 
causal sequence goes only from perceived threat to prejudice 
and not the other way or in both directions. However, Maio, 
Esses, and Bell (1994) experimentally manipulated perceived 
realistic and symbolic threats and found increased prejudice 
toward immigrants, thus validating at least the proposed 
causal sequence of threats heightening prejudice that lies at 
the core of ITT, at least for that target group. 

Assessment of different types of threat has potential utility 
for those interested in improving intergroup relations. In stud- 
ies in which attitudes toward more than one target group are 
assessed from an ITT perspective, one may explore which tar- 
get group may deserve more attention in ameliorative efforts 
(for an example, see Schwarzwald & Tur-Kaspa, 1997). Simi- 
larly, in the aforementioned studies of ITT, some but usually 
not all types of threat emerged as significant predictors, sug- 
gesting where change attempts might profitably focus. For ex- 
ample, in attitudes of U.S. university students toward Mexican 
immigrants, intergroup anxiety has emerged as the most reli- 
able predictor (Stephan & Stephan, 1996; Ybarra & Stephan, 
1994). ITT is, therefore, especially useful for those interested 
in reducing as well as understanding prejudice (see the chap- 
ter on reducing prejudice by Dovidio in this volume). 


The Multicomponent Approach to Intergroup Attitudes 


The multicomponent approach to intergroup attitudes 
(MAIA), proposed by Esses, Haddock, and Zanna (1993; see 
also Haddock et al., 1993; Zanna, 1994), is the final example 
of an integrative theoretical approach to be considered. Al- 
though MAIA was derived independently from ITT, the two 
perspectives clearly resemble one another in their mutual em- 
phases on symbolic beliefs and emotional reactions to out- 
groups as important predictors of prejudice and also in a 
shared interest in determining if and when stereotypes of 
out-groups will relate to prejudice toward them. 

MAIA presumes that an intergroup attitude, like the atti- 
tude concept in general, has several components (viz., evalu- 
ations, cognitions, and affect). An attitude toward a social 
group is an overall evaluation, either positive or negative. 
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Esses and her colleagues use the feeling thermometer as 
their preferred measure of an intergroup attitude as a global 
evaluation. The goal of MAIA is to predict prejudice and 
intergroup attitudes, relying mainly on cognitive and affec- 
tive factors as the key predictors. Stereotypes and symbolic 
beliefs constitute MAIA’s cognitive factors. Stereotypes are 
beliefs about the characteristics of groups, both those shared 
with other perceivers (i.e., a consensual stereotype) and those 
unique to a given perceiver (a personal stereotype), with per- 
sonal stereotypes assumed by MAIA researchers to be more 
useful to predict prejudice than consensual stereotypes. Sym- 
bolic beliefs are a person’s ideas as to how a social group hin- 
ders or facilitates her or his core values and norms. In the 
MAIA the affective component consists of the specific feel- 
ings and emotions evoked by a social group (see also Esses, 
Haddock, & Zanna, 1994). To assess personal stereotypes, 
symbolic beliefs, and emotions toward one or more groups, 
MAIA researchers typically employ open-ended measures in 
which respondents first list their thoughts and feelings toward 
a specified group and then go over their lists in order to rate 
the valence of each entry and the percentage of the social 
group believed to be characterized by it. 

In their initial studies MAIA researchers explored attitudes 
among English-Canadian university respondents in Ontario 
toward several social groups: English-Canadians, French- 
Canadians, Native Indians, Pakistanis, and homosexuals (see 
Esses et al., 1993). The MAIA model successfully predicted 
attitudes toward the out-groups. Attitudes toward Pakistanis 
and homosexuals were best predicted by symbolic beliefs, a 
component of intergroup attitudes believed to be impor- 
tant for assessing prejudice toward disliked or unfavorable 
groups. By contrast, out-groups more favorably regarded by 
the English-Canadian respondents (viz., French-Canadians 
and Native Indians) were best predicted by emotions. 

Esses et al. (1993) also showed that RWA is an important 
moderator of out-group attitudes and their subcomponents. 
English-Canadian respondents scoring high on RWA had 
consistently more negative attitudes toward all four out- 
groups, especially the disfavored groups, and symbolic be- 
liefs were their single best predictor of attitudes toward 
different groups, including French-Canadians. By contrast, 
emotions best predicted the more favorable out-group atti- 
tudes of those scoring low on RWA. 

These conclusions, particularly regarding homosexuals as 
a target group, were further reinforced in two studies by 
Haddock et al. (1993). Their first study confirmed the more 
negative attitude of high RWA scorers toward homosexuals 
and the importance of symbolic beliefs in predicting preju- 
dice toward homosexuals. Their second study replicated and 
extended these findings by showing that for those scoring 
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high on RWA, past experience and perceived value dissimi- 
larity were additional factors along with symbolic beliefs that 
were useful in predicting their prejudicial attitude toward 
homosexuals. 

Because intergroup attitudes can be ambivalent rather 
than uniformly positive or negative, Esses and her colleagues 
extended their open-ended techniques to assess attitudinal 
ambivalence toward various social groups. In one study, 
Bell, Esses, and Maio (1996) assessed evaluations, stereo- 
types, symbolic beliefs, and emotions that a sample of 
English-Canadian university students in Ontario felt toward 
Native People, French-Canadians, Canadians, and Oriental 
immigrants. Respondents were more ambivalent toward 
Native People than Canadians or Oriental immigrants, with 
French-Canadians in between. Correlations between average 
ambivalence scores and an overall summary evaluation of 
each group showed that ambivalence was unrelated to atti- 
tude toward Native Peoples but negatively related to attitudes 
for the other groups, especially French-Canadians. Because 
MAIA takes into account ambivalence in intergroup atti- 
tudes, it could also qualify as an ambivalence approach to 
prejudice. 


Conclusion 


As perhaps the ultimate form of an integrative approach to 
the psychology of bigotry, one could ask what a general the- 
ory of prejudice would look like. In reviewing the literature 
on theories of racism and their own research on values and 
prejudice, Biernat, Vescio, Theno, and Crandall (1996) out- 
lined just such a general theory of prejudice. A general the- 
ory, they suggested, should seek to predict or explain 
prejudice by oppressors toward an array of potential target 
groups, such as Blacks, homosexuals, ethnic groups, women, 
and so on in the United States and in other countries. They 
also generated a list of factors that promote prejudice. From 
racism and belief congruence theories (as well as SDT, ITT, 
and MAIA, it might be added), these prejudice-promoting 
factors include negative affect toward Blacks (and other 
groups), prototypic values such as antiegalitarianism, indi- 
vidualism, and the Protestant ethic, the perception that mem- 
bers of groups who are the target of prejudice violate 
cherished beliefs and values, as well as normative and con- 
textual cues that condone or permit prejudice and discrimina- 
tion. Other contributing factors, they noted, would include 
known correlates of prejudice, such as an authoritarian per- 
sonality (especially RWA) and attributional styles in per- 
ceivers that lead them to attribute negative outcomes 
confronting oppressed people to internal, controllable causes 
rather than external ones. 


To their list of factors promoting prejudice should also be 
added individual differences in aggressiveness and social 
dominance orientation, realistic threats, and situational cues 
that prime and stimulate negative out-group attitudes, both 
subliminally and supraliminally. In addition, unconscious 
processes of the types specified by the OTAP and automatic 
processing approaches to prejudice would also need to be 
taken into account. On the other hand, humanitarian and egal- 
itarian values, internal motivation to avoid prejudice, and 
empathy and sympathetic identification with the underdog 
would help to counteract prejudice and its expression. 

This outline for a general theory of prejudice summarizes 
well the insights of psychology’s best theories for under- 
standing prejudice at the dawn of the twenty-first century. It 
highlights ambiguities that future research might try to re- 
solve, such as whether egalitarian values promote or counter 
prejudice or both, depending on yet other factors. Finally, it is 
perhaps useful as a heuristic device for designing and execut- 
ing studies of prejudice, with an eye to evaluating the relative 
power of promotive and counteractive factors and assessing 
their unique predictive power and interactions. Illustrating 
just such an approach is the research by Biernat, Vescio, 
Theno, and Crandall (1996), which (among other things) in- 
cluded measures of core American values, prejudice scales, 
supraliminal priming of values, and experimental variations 
in value violation by attitude targets representing (in different 
studies) variations in race, sexual orientation, and weight 
status. 

Having completed a review of prejudice from the perspec- 
tive of the bigot, I now consider the psychology of prejudice 
from the viewpoint of the victim or target. 


THE PSYCHOLOGY OF THE VICTIM OF 
PREJUDICE AND DISCRIMINATION 


Psychologists have long been interested in the effects of dis- 
crimination on members of oppressed groups. One early ap- 
proach to exploring this question was to assess samples of 
oppressed individuals on psychological measures as a means 
of exploring the impact of oppression. Kardiner and Ovesey 
(1951), for example, used psychoanalytic interviews and re- 
sponses to projective tests such as the thematic apperception 
test (TAT) and the Rorschach to assess the “mark of oppres- 
sion” among Black Americans. Similarly, Karon (1975) com- 
pared samples of White respondents and northern versus 
southern Black respondents in the United States on a modi- 
fied version of the Tomkins-Horn Picture Arrangement Test 
(PAT), a projective test for assessing personality. Although 
both studies showed evidence of the stigma of being Black in 


the United States, they did not link it clearly to experiences of 
discrimination encountered by their respondents. 


Attributional Ambiguity Perspectives 


Beginning in the 1970s, research on the psychology of being 
a victim of prejudice and discrimination changed in several 
important ways (see Dion, Earn, & Yee, 1978). First, it shifted 
toward an experimental approach in which discrimination 
experiences were manipulated by investigators in the psycho- 
logical laboratory by creating conditions in which partici- 
pants from stigmatized groups either could or could not 
attribute a negative outcome to prejudice on the part of others 
(an attributional ambiguity paradigm) or were explicitly 
given the odds that their failure was due to discrimination by 
allegedly biased judges of their performance (the base rate 
paradigm). Second, these experimentally oriented researchers 
often adopted a viewpoint stressing the attributional ambigu- 
ity of being a target of prejudice (see Crocker & Major, 1989; 
Crocker, Major, & Steele, 1998; Dion, 1975, 1986; Dion & 
Earn, 1975; Dion et al., 1978). 

According to an attributional ambiguity perspective, in- 
stances of encountering prejudice or discrimination are often 
ambiguous. For example, Black Americans who encounter a 
rejection from a White American confront an attributional 
dilemma to explain the situation and must decide whether the 
rejection is due to something about themselves (i.e., a personal 
characteristic) or to something about the person rejecting them 
(e.g., a prejudicial bias or a discriminatory reaction against 
Blacks). Attributional ambiguity perspectives emphasize that 
the type of attributions that a victim of prejudice or discrimi- 
nation makes in such a situation (i.e., an internal attribution to 
the self, an external attribution of perceived prejudice or dis- 
crimination, or perhaps both) has a psychological impact on 
the victim’s self-evaluations and affective reactions. 


Attributions of Prejudice and Self-Esteem 


Dion (1975) provided the first suggestive evidence for a link 
between attributions of prejudice and self-esteem in an ex- 
periment where university women competed against several 
opponents in a laboratory setting, who they were led to be- 
lieve were either all male or all female; and the women them- 
selves were made to fail either mildly or severely. Following 
experimentally induced failure, the women rated themselves 
on positive and negative traits comprising the female stereo- 
type and self-esteem traits and indicated to what extent their 
opponents were biased and prejudiced against them. From 
this latter measure, women were further categorized into 
high- versus low-perceived prejudice groups, with perceived 
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prejudice taken as an additional independent variable along 
with the experimental variables of alleged sex of the oppo- 
nents and severity of failure (i.e., an internal analysis). 

Unsurprisingly, the greater the failure, the lower was the 
women’s subsequent self-esteem. However, perceived preju- 
dice moderated this effect and apparently mitigated the im- 
pact of severe failure in decreasing women’s self-esteem. 
Specifically, women who experienced severe failure with 
male opponents and perceived it as reflecting sexist prejudice 
showed higher self-regard than did those who did not see 
their putative male opponents as prejudiced. Dion (1975) in- 
terpreted this finding as suggesting that perceived prejudice 
or discrimination may not inevitably lower self-esteem in the 
victim. Rather, under some circumstances the attribution of 
prejudice may sustain self-esteem by enabling the minority 
or subordinate group member to attribute a negative experi- 
ence to prejudice by others toward an arbitrary trait (i.e., their 
group membership) rather than to their own personal quali- 
ties as an individual. 

In an important theoretical statement and elaboration of 
the attributional ambiguity perspective, Crocker and Major 
(1989) reviewed the then-existing literature and outlined sev- 
eral ways that members of stigmatized groups could protect 
their self-concepts in the face of a negative experience. For 
example, a stigmatized group member could interpret the 
negative encounter as due to prejudice or discrimination to- 
ward their group. Alternatively, they could protect them- 
selves from invidious comparisons with privileged majority 
group members by comparing their outcomes to their own in- 
group rather than to the out-group and by focusing on those 
dimensions on which their group exceeds the dominant out- 
group. Major and Schmader (1998) have added psychological 
disengagement to the list of ways in which stigmatized group 
members may psychologically insulate and protect them- 
selves from prejudice and discrimination. Miller and Kaiser 
(2001a, 2001b) recently outlined the wide variety of re- 
sponses that those who are discriminated against may employ 
to protect themselves, drawing from the literature on coping 
and stress as well as attachment theory for insights. 

Crocker and Major (1993) qualified the conditions under 
which attributing negative outcomes to prejudice could 
buttress one’s self-esteem: namely, when the stigma was 
perceived as legitimate, justifiable, or controllable and legit- 
imizing beliefs supported the stigmatized group’s lower 
status, or when other important beliefs were threatened by at- 
tributions of prejudice. Crocker, Cornwell, and Major (1993) 
supported this reasoning in a subsequent experiment in which 
obese women were rejected by an attractive male confederate 
as a potential date. Although the obese women attributed the 
negative outcome to their weight, they did not attribute it to 
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the male rater or to his prejudice. Crocker et al. interpreted 
the lower self-esteem by obese women to the fact that obesity 
is widely seen as a controllable stigma, which legitimizes 
and justifies prejudice and bias toward the overweight. The 
stigma of obesity, however, applies more to White than to 
Black American women (Hebl & Heatherton, 1998). 

Crocker, Major, and their colleagues have also conducted 
experimental tests of the attributional ambiguity perspective 
with groups that regard prejudice and discrimination to- 
ward them as illegitimate. Crocker, Voelkl, Testa, and Major 
(1991) focused on sex and race in separate experiments in 
order to explore the potential buffering effects of perceived 
prejudice on self-esteem. Their study with White U.S. uni- 
versity women as participants succeeded in experimentally 
varying their attributions to prejudice on the part of a sexist 
man evaluating an essay of theirs negatively; however, the 
trait measure of global self-esteem failed to yield reliable dif- 
ferences as a function of perceived prejudice, though the 
mood measure followed the prediction of a self-protective 
function for attributions of prejudice. 

Crocker et al. (1991) reported finding evidence for the 
buffering effects of perceived prejudice on self-esteem with 
Black American participants who had received either positive 
or negative interpersonal feedback from a White evaluator. 
These participants believed that the White evaluator either 
could see them from another room and was thus aware of their 
race or could not see them because of a drawn blind and hence 
was unaware of their race. Black participants who thought 
they could be seen by a White evaluator and had attributed the 
evaluator’s feedback to prejudice showed less of a pretest- 
posttest difference in self-esteem than when they thought that 
the White evaluator could not see them. In other words, in the 
condition where prejudice was attributed, Black participants 
appeared to discount the negative feedback from a White 
evaluator, with the consequence that their self-esteem was left 
unchanged. They also discounted positive feedback when the 
White evaluator could allegedly see them and showed de- 
creased self-esteem in that condition. 

The classic book Black Like Me, in which White author 
James Griffin (1961) described his experiences posing as a 
Black man in the U.S. South of the 1950s, had suggested a 
similar process among Black Americans. Recalling an in- 
stance of racial discrimination he had experience, he noted, 
“The Negro’s only salvation... lies in his belief, the old 
belief of his fore fathers, that these things are not directed 
against him personally, but against his race, his pigmentation. 
His mother or aunt or teacher long ago carefully prepared him, 
explaining that ‘... they don’t do it to you because you’re 
Johnny—they don’t even know you. They do it against your 
Negro-ness’” (p. 48). In the United States, Black Americans 


are considerably more likely to be targets of prejudice and 
discrimination than are members of other minority or subordi- 
nate groups. Perhaps as a consequence of this greater victim- 
ization now and in the past, Black Americans have developed 
through ethnic group socialization the strategy of discounting 
negative (and perhaps positive) feedback from White majority 
group members and attributing negative feedback to prejudice 
as a means of coping and sustaining their self-esteem. 

Some investigators (e.g., Branscombe & Ellemers, 1998; 
Kobrynowicz & Branscombe, 1997), however, have ques- 
tioned whether Crocker et al. (1991) actually succeeded in 
demonstrating the buffering effects of attributing prejudice 
on self-esteem with their Black participants. Branscombe 
and Ellemers (1998) have instead suggested that in-group 
identification is a necessary mediator between the attribu- 
tion of prejudice for experiences of oppression and self- 
esteem for Black American men and women as well as 
other minority groups in the United States, such as Native 
Americans and Hispanic-Americans. The greater the in- 
group identification, the more likely that attributions of 
prejudice for experiences of discrimination or oppression 
will be associated with the maintenance and retention of 
high self-esteem. 


Protective Benefits for Majority Group Members 


Of course, even members of dominant, hegemonic groups 
can and sometimes do avail themselves of the self-protective 
benefits of perceiving themselves and their group as being 
discriminated against, but apparently without the same psy- 
chological dilemma and tradeoff confronting members of 
oppressed groups. Kobrynowicz and Branscombe (1997) ar- 
gued that certain members of structurally privileged groups, 
such as White American men whose self-esteem may be low 
or otherwise vulnerable, may exaggerate estimates of per- 
ceived discrimination against their group as a means of bol- 
stering their self-esteem. Consistent with this perspective, a 
sample of White men scoring low in self-esteem were espe- 
cially prone to perceive themselves and their group as having 
been discriminated against on the basis of gender. Likewise, 
Branscombe (1998) showed that asking men to contemplate 
their group’s disadvantage on the basis of gender led to 
higher self-esteem, whereas thinking about their group’s ad- 
vantages produced decreases on group-related well-being. 
By contrast, women contemplating their group’s disadvan- 
tages scored lower in reported self-esteem. Thus, the self- 
protective effect of attributing one’s failure to discrimination 
is apparently even more evident among dominant majority 
group members and has positive benefits for both their self- 
esteem and their sense of control. 


The Personal-Group Discrimination Discrepancy 


Research originally conducted in the tradition of relative de- 
privation theory has suggested that individuals in subordi- 
nate and oppressed groups typically perceive more group 
discrimination than personal discrimination. Specifically, in 
testing models of egoistic relative deprivation (defined later), 
Crosby (1982) observed that members of a sample of work- 
ing women in Massachusetts believed that they, as individual 
women, were personally less deprived and discriminated 
against in terms of income and employment opportunities 
than were women as a group. Crosby (1984) subsequently 
attributed the tendency for women to perceive less personal 
than group discrimination to a process of denial of their per- 
sonal disadvantage. 

This phenomenon has since been observed among ethnic 
and racial groups in the United States, Canada, and elsewhere 
and has been labeled the personal-group discrimination dis- 
crepancy (PGDD; Taylor, Wright, Moghaddam, & Lalonde, 
1990; Taylor, Wright, & Porter, 1994). Much like Crosby 
(1982), Taylor et al. (1990) found that Haitian and East 
Indian women in Montreal reported more group than per- 
sonal discrimination across four sources of potential discrim- 
ination (viz., race, culture, status as newcomers to Canada, 
and sex). Dion and Kawakami (1996) likewise found a 
PGDD across a variety of domains for six ethnic groups in 
Toronto, three of them visible minorities and the other three 
White or nonvisible minorities, although the PGDD was con- 
sistently stronger among the visible minorities. 


Explanations for the Personal-Group 
Discrimination Discrepancy 


One reason that people from oppressed groups may be reluc- 
tant to claim that they have personally experienced prejudice 
or discrimination is that there are social costs to attributing a 
setback to discrimination. In two studies, Kaiser and Miller 
(2001) showed that a Black person who attributed a failing 
grade on a test to discrimination was perceived by Whites as 
being a complainer and was evaluated less positively than 
was a Black person attributing the failure to the low quality 
of his answers on the test. 

Perhaps the most comprehensive explanation of the 
PGDD, at present, has been suggested by Postmes, 
Branscombe, Spears, and Young (1999). Postmes and his 
colleagues argued that the PGDD is not an intentional com- 
parison between oneself and one’s group as regards experi- 
enced discrimination. If the latter were the case, the 
difference between separate ratings of perceived discrimina- 
tion for self and for group (i.e., the standard way of assessing 
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the PGDD) should relate highly to a single direct comparison 
for self (compared to others of one’s group, e.g., a gender 
group) or in-group (compared to a comparison out-group, 
e.g., the other gender group). In fact, standard PGDD scores 
correlated only modestly with direct comparisons for self and 
for group. 

Instead, Postmes et al. (1999) proposed and showed that 
ratings of personal discrimination and of group discrimina- 
tion are based on two separate judgments: an interpersonal 
judgment comparing self and other in-group members for 
ratings of personal discrimination and an intergroup judg- 
ment comparing one’s in-group to an out-group for ratings 
of group discrimination. Consistent with this emphasis on 
different types of judgment and comparison referents, they 
also demonstrated that ratings of personal discrimination or 
advantage reflect personal, self-serving motives; whereas 
ratings of group discrimination or advantage are influenced 
by social identity motives and in-group identification. Other 
researchers’ analyses of the PGDD converge with Postmes 
et al.’s conclusions (Dion & Kawakami, 1996; Kessler, 
Mummendey, & Leisse, 2000; Quinn, Roese, Pennington, & 
Olson, 1999). 


Perceived Prejudice and Discrimination as Stressors 
A Stress Model 


A number of investigators have independently proposed that 
perceiving oneself to be a target of prejudice or discrimina- 
tion is a psychosocial stressor. For example, Dion, Dion, and 
Pak (1992) contended that perceived prejudice or discrimina- 
tion is a social stressor because it elicits cognitive appraisals 
of threat such that its victims see themselves as being delib- 
erate targets of negative behavior by one or more out-group 
antagonists and impute stable, malevolent motives and inten- 
tions to them. Moreover, prejudice and discrimination are 
often unpredictable stressors, entailing greater adaptational 
costs for the victim than a predictable or controllable stressor 
(see Allison, 1998, for an excellent discussion of other stress 
models). 

If perceived prejudice and discrimination are indeed stres- 
sors, they should produce in individuals various social- 
psychological consequences known to result from stress, 
such as negative affect, reported stress, psychological or psy- 
chiatric symptoms, and lowered sense of well-being, as well 
as heightened in-group identification (a frequent response to 
external threat to one’s group). Dion et al.’s stress model of 
perceived discrimination has now been amply supported by 
both experimental and correlational studies. In an experiment 
varying perceived prejudice in an attributional ambiguity 
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paradigm, Dion and Earn (1975) found that when they made 
attributions to prejudice for a severe failure, Jewish men 
showed evidence of heightened in-group identification as 
well as a stress response on mood measures: namely, feeling 
more aggression, greater sadness, higher anxiety, and height- 
ened self-consciousness. Similar, Crocker et al. (1993) found 
that women, especially obese ones, reported more negative 
moods when they received negative feedback from an attrac- 
tive man as opposed to positive feedback. 

Correlational studies concur strongly with experimental 
studies in documenting a link between perceived discrimina- 
tion and stress. Perceptions of discrimination in Black 
Americans correlate with psychiatric symptoms. Landrine 
and Klonoff (1996; Klonoff & Landrine, 1999) developed a 
reliable 18-item measure of perceived racial discrimination 
called the Schedule of Racist Events (SRE) and validated it in 
two separate studies with samples of Black American com- 
munity respondents. In the most recent study with more than 
500 respondents sampled from middle- and lower-class sec- 
tions of San Bernardino, California, they found that 96% 
reported discrimination in the past year and 98% at some 
time during their lives. For 95% of the respondents, these 
discrimination experiences were labeled as stressful. Black 
American men reported more experiences of discrimination 
than did their female counterparts. In both studies, frequency 
of discrimination experiences correlated positively with psy- 
chiatric symptoms, accounting for about 10% of the variance. 
In the 1996 study, the frequency of discrimination experi- 
ences was also linked to cigarette smoking. 

Other researchers have highlighted the cumulative and 
chronic stressfulness of perceived discrimination among 
Black Americans. Feagin (1991) emphasized that for Black 
Americans, even those well ensconced in the middle class, the 
cumulative effect of racist encounters over a lifetime becomes 
potentially more potent than a simple sum of frequency count 
of such experiences might suggest. Branscombe, Schmitt, and 
Harvey (1999) showed the negative effects upon well-being 
of chronic perceptions of discrimination in Black American 
respondents. Branscombe and her colleagues emphasized that 
chronic perceptions of discrimination and stable attributions 
of pervasive prejudice have quite different effects on self- 
esteem and well-being than do attributions to prejudice for a 
single event, such as is typically explored in laboratory stud- 
ies of perceived prejudice or discrimination. 


A Biopsychosocial Model 


Clark, Anderson, Clark, and Williams (1999) proposed a 
biopsychosocial model of racism as a stressor for Black 
Americans. Its underlying assumption is that perceived 


racism leads to heightened psychological and physiological 
stress responses from Black Americans. In this model, con- 
stitutional, sociodemographic, psychological, and behavioral 
factors are proposed to moderate the relationship between an 
environmental stimulus and its perception as being racist. 
Perceptions of racism are then linked to coping responses, 
psychological and physiological stress responses, and health 
outcomes. 

The links between perceived racism and health outcomes 
among Black Americans are perhaps the most intriguing and 
important aspect of Clark et al.’s (1999) model. The authors 
suggested that racism and its perception (or denial) relate 
to cardiovascular, neuroendocrine, and immune system re- 
sponses by Black Americans. Hypertension among Black 
Americans may well be associated, albeit in complex ways, 
with experiences of racism and methods of coping with them. 
For example, Krieger (1990) found that Black American 
women who indicated that they passively accepted racist ex- 
periences were over four times more likely to report hyper- 
tension than were those indicating a more active response to 
unfair treatment. Moreover, those Black American women 
reporting no instances of unfair treatment were more than 2'/, 
times more likely to report hypertension than were those re- 
porting one or more experiences of racism. If one assumes 
that Black women reporting no instances were denying or in- 
ternalizing racist experiences, this finding and other studies 
(Krieger & Sidney, 1996) suggest that as a coping mecha- 
nism, denial may have unfortunate health correlates or conse- 
quences for Black Americans. The specific links between 
perceptions and experiences of racism and hypertension in 
Black Americans of both sexes, however, remain to be firmly 
established and better understood. 

Like racism, sexism also has pernicious consequences 
for individuals experiencing and perceiving it. Landrine, 
Klonoff, Gibbs, Manning, and Lund (1995) correlated life- 
time and recent experiences of sexist events from their 
Schedule of Sexist Events (SSE) with scores from anxiety 
and depression scales, the Hopkins Symptom Checklist 
(HSC), and a measure of premenstrual tension syndrome 
(PMTS). Hierarchical regression analyses were performed in 
which generic stress measures for life events and hassles 
were entered at the first step, followed by lifetime and recent 
SSE scores in the second step. SSE scores accounted for ad- 
ditional variance beyond the generic stress indexes. Sexist 
discrimination emerged as an especially important and better 
predictor than generic stress for symptoms from the PMTS 
and HSC measures including premenstrual, somatization, 
obsessive-compulsive, depressive, and total psychiatric 
symptoms. Moreover, the ability of SSE scores to predict 
symptoms varied as a function of the U.S. women’s ages and 


ethnicities. Lifetime SSE scores enhanced prediction (over 
and above generic stress measures) of total HSC symptoms 
for older women but not for younger ones. 


Buffers for Discrimination-Related Stress 


Not all members of oppressed groups will suffer the stress of 
discrimination in the same way or to the same extent. The 
personality construct of hardiness—a composite of self- 
esteem and sense of control—may be one factor that buffers 
the stress of experiencing or perceiving discrimination to- 
ward oneself and one’s group. Dion et al. (1992) explored the 
role of personality-based hardiness in a study of Toronto’s 
Chinese community. As they predicted, the relationship of 
discrimination to psychological symptoms was markedly 
higher among Chinese community respondents who were 
low in hardiness than among those high in hardiness. Indeed, 
for those scoring high in hardiness, discrimination and re- 
ported psychological symptoms were effectively unrelated, 
whereas they related reasonably strongly for those low in har- 
diness. In addition, alternative interpretations in terms of 
differential life stresses or differential exposure to discrimi- 
nation in the two hardiness groups were ruled out as rival ex- 
planations (see Dion et al., 1992). 

Foster and Dion (2001) explored whether the beneficial 
relationship of personality-based hardiness to discrimination- 
related stress is due to buffering or denial in an experiment in 
which women confronted gender discrimination on an exam- 
ination. The findings favored a buffering interpretation and 
suggested that the buffering was due to the types of attribu- 
tions that hardy women made relative to their less hardy 
counterparts. Specifically, hardy women made specific, un- 
stable attributions rather than global, stable ones; that is, they 
tended to see the gender discrimination as a unique and un- 
usual occurrence, even though there were no differences 
between the hardy and nonhardy women in perceived un- 
fairness of the discrimination. 

Whereas hardiness may provide a personality-based 
buffer and coping dimension, in-group identification has 
been hypothesized to be important in predicting reliance on 
group-based responses to coping with discrimination and 
buffering self-esteem. Branscombe and Ellemers (1998) pro- 
posed a rejection-identification model suggesting that greater 
willingness to make attributions to prejudice among Black 
Americans heightens their minority-group identification as 
well as hostility toward the dominant White group but has a 
negative effect on personal and collective sense of well- 
being. Minority-group identification, however, has a buffer- 
ing effect in sustaining well-being. Branscombe and her 
colleagues tested and supported this model with SEM 
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procedures. Some alternative theoretical models failed to re- 
ceive support. 


Stereotype Threat 


Not only do women and minority members confront preju- 
dice and discrimination, but they also must deal with broadly 
shared, negative stereotypes about their groups by majority 
group members, which can have pernicious and deleterious 
effects upon their academic and athletic performance. Black 
Americans, for example, confront low expectations in the 
realm of academic ability, whereas women in the United 
States, Canada, and some other societies are presumed by 
consensually shared stereotypes to be inferior in mathematics 
compared to men. 

Steele (1997; Steele & Aronson, 1995) and his colleagues 
contended that negative stereotypes impugning the abilities 
of stigmatized group members constitute a powerful situa- 
tional threat with two notable consequences. First, in a testing 
situation involving an ability where one’s group is negatively 
stereotyped, the performance of those members who care 
about the ability and doing well on the test can be adversely 
affected. Second, chronic experiences of stereotype threat 
can lead members of stigmatized groups to disidentify by 
denying the importance of the ability for themselves. At the 
college level, this disidentification can lead to academic 
dropouts among Black Americans and proportionally fewer 
women enrolling in math, science, and engineering programs 
where mathematical ability is prerequisite. 


Initial Studies 


Steele and Aronson (1995) reported the first set of four ex- 
periments documenting the impact of stereotype threat on the 
performance of Black American university students, relative 
to their White American counterparts, at Stanford University, 
an elite U.S. university. These investigators told participants 
that difficult and challenging items from the Graduate Record 
Examination (GRE) either were diagnostic of their intellec- 
tual ability (the diagnostic or stereotype threat condition) or 
were a test of problem-solving with no implications for diag- 
nosing their intellectual ability (the nondiagnostic or no- 
stereotype-threat condition). In all four studies, participants’ 
previous Scholastic Achievement Test (SAT) scores in high 
school were statistically controlled in the analyses by means 
of analysis of covariance (ANCOVA) procedures. 

The first two studies by Steele and Aronson (1995) demon- 
strated that Black American participants in the diagnostic 
or stereotype threat condition completed fewer items and 
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attained lower accuracy (i.e., number of items correct relative 
to the number attempted) than did either Black or White par- 
ticipants in the other conditions. Steele and Aronson’s third 
experiment demonstrated that the diagnostic ability manipu- 
lation elicited among Black participants who were expecting 
to take a difficult test (but did not do so) the racial stereotype 
of Blacks held by Whites as well as an avoidance of self- 
characterization in terms of this stereotype, and even an 
avoidance of indicating one’s racial status on a demographic 
postquestionnaire, relative to nondiagnostic and control con- 
ditions. In Study 4, priming race by merely having partici- 
pants indicate their race on a demographic questionnaire 
before attempting a challenging intellectual test served to in- 
hibit performance by Black participants and presumably to 
elicit stereotype threat. Steele and Aronson proposed that the 
mechanism underlying the impact of stereotype threat on the 
test performance of their Black American participants was 
probably an inefficiency of cognitive processing, not unlike 
that produced by other evaluative pressures. 

Croizet and Claire (1998) extended the applicability of the 
stereotype threat concept to those of low socioeconomic sta- 
tus (SES) outside the United States. Using a predominantly 
female sample of French university students, these investiga- 
tors likewise found that under stereotype threat, students of 
low SES obtained fewer correct answers, attempted fewer 
items, and had lower overall accuracy on verbal GRE items. 
By contrast, much like Steele and Aronson (1995) had previ- 
ously found in comparing Black and White American partic- 
ipants, there was no difference in test performance between 
participants of low and high SES when the same test was 
described as nondiagnostic of one’s intellectual ability. Vary- 
ing the salience of SES before the test by having participants 
indicate their parents’ occupation and educational level, how- 
ever, had no effect in this study. 


Recent Studies 


Spencer, Steele, and Quinn (1999) themselves applied stereo- 
type threat theory to U.S. women’s math performance in three 
studies including math-oriented students who had taken cal- 
culus and had performed highly on the high-school mathemat- 
ics section of the SAT. Their first experiment demonstrated 
that a gender difference, with women underperforming men, 
occurred only when the math GRE items used to assess math 
performance were difficult rather than easy. Spencer and his 
colleagues varied stereotype threat in the next two studies by 
informing participants either that there was a gender differ- 
ence previously obtained with the math GRE items they were 
to solve (threat condition) or not (no stereotype threat). In 
the no threat condition, women’s performance on the math 


GRE test equalled that of men. By contrast, in the threat 
condition, women underperformed men. Finally, their third 
experiment demonstrated that the stereotype threat effect 
was obtainable at a state university in the U.S. whose 
academic standards were less rigorous and selective than 
the elite university samples in prior studies and further ex- 
plored possible mediating processes. The mediational tests 
excluded evaluation apprehension and self-efficacy as a basis 
for the impact of stereotype threat on women’s math perfor- 
mance. Anxiety emerged as a weak mediator of stereotype 
threat. 

Finally, recent studies by different sets of investigators 
show that stereotype threat can affect the performance of 
White majority group members and does not require that one 
be a member of a historically stigmatized group. Aronson 
et al. (1999) conducted two experiments in which White 
students of high math ability at an elite U.S. university were 
presented information that Asian Americans outperform 
Whites in math (stereotype threat condition) or not (no threat 
condition). Additionally, in the second study they selected 
math-oriented students who scored on the bottom and top 
tertiles of rated importance of mathematics ability to their 
self-concept as a means of assessing low versus high identifi- 
cation with this domain. Their first study showed that White 
students performed less well on a challenging math test when 
threatened with a racial stereotype indicating their inferiority 
relative to Asians. Their second study showed that this 
stereotype effect occurred only when the White students were 
math-identified and that evaluation apprehension was a 
weak, potential mediator. 

Stone, Lynch, Sjomeling, and Darley (1999) took advan- 
tage of a golf test that they presented to Black and White 
Princeton University participants as indicating their “natural 
athletic ability” or their “sports intelligence.” Their first study 
showed that performance by Black students on the golf test 
suffered more when it constituted a stereotype threat (an in- 
dication of sports intelligence—a negative stereotype for 
Whites) than when it did not (an indication of natural athletic 
ability—a positive stereotype for Blacks). By contrast, for 
White participants for whom the opposite was true De, 
sports intelligence is a positive stereotype, and natural ath- 
letic ability is a negative or less positive stereotype than for 
Blacks), the reverse pattern was obtained, as predicted from 
stereotype threat theory. Their second study, focusing on 
White participants only, showed that the detrimental effects 
of stereotype threat on performance on the golf test occurred 
only for “engaged” participants for whom performance in the 
athletic domain was important to their self-worth and not for 
those who were “disengaged.” In addition to showing the im- 
portance of engagement for the stereotype threat effect, their 


explorations of mediators for White participants implicated 
performance anxiety and lowered expectations when the 
task’s difficulty became apparent. 

In sum, accumulating evidence suggests that the stereotype 
threat effect is real and that its effects can be demonstrated 
among historically stigmatized groups such as Black 
Americans and White women as well as nonstigmatized 
groups. Also, apart from the obvious importance of a person 
being engaged and identified with the domain (e.g., math, 
athletics, etc.), the precise mechanisms responsible for 
the stereotype threat effect remain somewhat ambiguous. 
The preceding studies have assessed an array of potential 
mediators—such as self-handicapping and situational and 
trait anxiety, as well as test anxiety, evaluation apprehension; 
self-concept, and so on—with self-report measures. Weak 
evidence of potential mediators has emerged, though not the 
same ones across studies and groups. Perhaps different 
mechanisms will ultimately be shown to be important for dif- 
ferent target groups. What seems clear at present is that the 
stereotype threat effect is not due to a lowering of effort, as 
stereotype-threatened individuals typically work or try harder 
than their nonthreatened counterparts. On the other hand, 
stereotype threat seems to act as a distractor and an additional 
pressure that reduces one’s effectiveness for successfully 
completing challenging tasks at the limit of one’s ability in a 
given domain. 

Advocates of stereotype threat theory suggest that their 
perspective is optimistic in that it points to a situational stres- 
sor as a key factor in underperformance by negatively stereo- 
typed and stigmatized groups, in contrast to dispositional 
interpretations of innate inferiority in ability, genetic factors, 
and so on. Stereotype threat theory also provides a viable ex- 
planation for why academic achievement tests have lower 
criterion validity for stigmatized groups in the U.S. and else- 
where than for nonstigmatized ones. Once the deleterious ef- 
fects of stereotype threat are identified and understood, steps 
to counteract them in standardized testing and in academic 
learning environments can be developed—a process that 
Steele and his colleagues have already begun with some no- 
table success (see Aronson, Quinn, & Spencer, 1998; Steele, 
1997; see also the chapter on reducing prejudice by Dovidio 
in this volume). 


Relative Deprivation, Perceived Discrimination, 
and Desire for Corrective Action 


Paradoxically, members of oppressed groups do not always, 
or even often, respond to stereotypes, disadvantage, de- 
privation, and discrimination by seeking redress or social 
change. Relative deprivation theory (RDT) is one conceptual 
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framework that tries to predict when and why members of an 
oppressed group will respond to their disadvantage with 
attempts to instigate social change, such as political protest. 
As its name implies, RDT assumes that one’s feelings of de- 
privation are not absolute but instead depend on the individ- 
ual or group with whom one compares. 

RDT proposes different types of deprivation as defined by 
two dimensions. One dimension concerns the focus of com- 
parison and defines the distinction between egoistic and fra- 
ternalistic relative deprivation (RD). Egoistic RD occurs 
when an individual feels deprived relative to others in their 
membership group. Fraternalistic RD (also called collectivis- 
tic RD by those of us preferring a gender-neutral label) oc- 
curs when one’s group is perceived to be at a disadvantage to 
one or more out-groups. The second dimension concerns the 
cognitive-affective distinction. Cognitive RD concerns the 
perception of inequality, whereas affective RD refers to re- 
sentment over inequalities. Taken together, these two dimen- 
sions define four types of RD. Reviews of RDT (e.g., Dion, 
1986) indicate that of these four types, it is primarily affec- 
tive, collectivistic RD Oe, resentment over poorer treatment 
of one’s group compared to other groups) that best predicts 
desires and attempts at social change. 

In a series of studies, I and my colleagues have pitted per- 
ceived discrimination against measures of RD types to assess 
their relative efficacy at predicting attitudinal measures of de- 
sires to take corrective action (Dion, 2002). With groups in 
Canada such as lesbians and gays, Chinese university stu- 
dents, and women, we have consistently found that perceived 
discrimination is a more powerful and consistent predictor of 
reported desires to corrective action than are the different RD 
types, with the notable exception of affective, collectivistic 
RD (e.g., Birt & Dion, 1987; Dion & Kawakami, 2000). To- 
gether, perceived discrimination and affective, collectivistic 
RD predict desires to take corrective action in response to 
group disadvantage quite well. Relatedly, Foster (2000) has 
shown that global attributions of gender discrimination (i.e., 
seeing gender discrimination as affecting many situations in 
one’s life) was also associated with greater proneness to sup- 
port collective action in U.S. college women. Thus, the vic- 
tim’s perceptions of discrimination—whether it is seen as 
being global in its effects, whether it affects one’s group, and 
whether it evokes a negative affective response—make a dif- 
ference in stimulating desires to take corrective action and to 
mobilize one’s efforts with others to create social change. 


Conclusion 


Perceived prejudice and discrimination are pivotal in the psy- 
chology of ethnic and intergroup relations. The literature on 
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the psychology of being a victim of prejudice and discrimina- 
tion that was reviewed earlier suggests several conclusions. 
First, for some groups and for some individuals within op- 
pressed groups, perceptions of prejudice and attributions of 
setbacks to prejudice may buffer self-esteem and maintain 
well-being. However, the buffering effect of attributed preju- 
dice is probably a weak one, may occur for only some groups, 
involves a tradeoff between types of self-esteem and per- 
ceived control, and is mediated or moderated by in-group 
identification. Somewhat perversely, the buffering effects of 
perceived discrimination on self-esteem seem to be more 
straightforward and clearer for members of dominant than of 
subordinate groups. Second, the experience or perception of 
prejudice and discrimination toward oneself and one’s group 
is unquestionably stressful, although personality-based hardi- 
ness and in-group identification may moderate discrimination- 
related stress to some extent. Discrimination-related stress 
has been linked to mental and physical health outcomes for 
both American women and Black Americans. Stereotype 
threat—the perception of being negatively stereotyped by 
others in academic and other domains—is also a stressor 
whose deleterious effects on achievement task performance 
are now established, although the mediators are unclear. Fi- 
nally, Some evidence suggests that perceived prejudice and 
discrimination, along with feelings of resentment about in- 
group disadvantage relative to other groups, instigate desires 
to take corrective social action. These conclusions demon- 
strate that our knowledge of the psychology of victimization 
has advanced appreciably in the last several decades of the 
twentieth century. 


A FINAL THOUGHT 


Having considered the psychology of bigotry as well as the 
psychology of being a victim of prejudice and discrimination, 
a next step for future psychological research on prejudice 
may be to explore the reciprocal interaction between bigot 
and victim. To date, the psychology of bigotry and the psy- 
chology of being a victim of prejudice and discrimination 
have been investigated separately from one another and have 
focused heavily on intrapersonal dynamics (e.g., the effects 
of automatic processing on a person’s cognitions and behav- 
iors). Yet, some previous theorists (e.g., Dion et al., 1978) 
have suggested that the bigot and the victim of prejudice form 
a complementary role relationship with one another. Under- 
standing the interpersonal dynamics of prejudice may require 
investigating situations in the laboratory and the community 
where victims of prejudice confront the bigotry, whether from 
one or more persons or an institution, directly. As always, 
psychological researchers interested in prejudice will rise to 


the methodological and theoretical challenges of exploring 
the reciprocal interactions between bigot and victim. 
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THE CONCERN FOR JUSTICE—THE 
JUSTICE MOTIVE 


Claims for justice and protests against injustice are ubiquitous 
in social life. Political movements, revolutions, and wars 
are initiated under the banner of justice. Justice is a prominent 
issue in all fields of politics. The courts are swamped with law 
suits, and their judgments are accepted without protest or 
bitterness only when they are considered to be just. Perceived 
injustices are at the core of everyday conflicts in private life. 
Close relationships are put at risk by experienced injustice. 
Victims of misfortune have to cope with the perceived injus- 
tice of their fate and—not seldom—with being derogated and 
blamed by others who try to preserve their belief in a just world 
by reconstructing the observed misfortune as deserved. 
Humans are averse to injustice—they have a justice mo- 
tive. Lerner (1977, 1980) was the first to outline the essential 
psychological features of the justice motive. He has empha- 
sized the human need for justice, including the need to 
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believe in a just world in which everybody gets what he or 
she deserves. Lerner has assigned the belief in a just world 
(BJW) a key role in his theory of the justice motive: When- 
ever people are confronted with injustices, their BJW is 
challenged and they try to preserve what Lerner calls the 
fundamental delusion either by efforts to restore the violated 
justice in the real world or to restore it mentally by reinter- 
preting the reality to minimize injustice. According to Lerner, 
people build up a personal contract whereby they are oblig- 
ated to observe rules of justice, expecting that others do the 
same. 

Can this human concern for justice be explained? And 
what would such an explanation mean? It is indisputable that 
views of what is just and what is unjust in general and in par- 
ticular vary among individuals, subcultures, and cultures. 
Some individuals and cultures are more concerned about jus- 
tice than others are. Moreover, their views are subject to 
intra-individual and to historical changes. It is worth explor- 
ing the sources of these variations and changes: cultural 


538 Justice, Equity, and Fairness in Human Relations 


traditions, acculturation, information about the implications 
of applying specific rules of justice, personal and socially 
shared values, the impact of social roles and positions (for 
reviews, see Tyler, Boeckman, Smith, & Huo, 1997), person- 
ality traits, and basic belief systems (cf. Schmitt, 1994, for a 
review). It is not the aim of this chapter to review and discuss 
this kind of research. Rather, the influential theory that the 
concern for justice has the instrumental function of maximiz- 
ing self-interest is disputed. 


Justice: A Means to Serve Self-Interest? 


The view that people care about justice purely as a means to 
pursue their own self-interest is prominent in social psychol- 
ogy (a review is provided by Tyler et al., 1997). It forms 
the core assumption in the equity theory of social ex- 
change (Adams, 1965; Homans, 1961; Walster, Berscheid, & 
Walster, 1978), which states that people prefer equity as a 
strategic choice to maximize their individual gains within so- 
cial exchanges on either the short or the long term. 

Thibault and Walker (1975) took a similar approach to 
explain people’s preference rankings for different ways of 
settling conflicts with others—by negotiation, mediation, 
arbitration, or court decisions. They supposed that people’s 
preferences were guided by their personal self-interests. The 
best strategy is to keep control over the outcome. In situations 
in which people have only limited control over the outcome 
because authorities have decisional power, people seek pro- 
cedural control and attempt to influence the outcome by hav- 
ing a voice and by presenting arguments, their views, and 
evidence. Therefore, people view those procedures in which 
they have voice and influence as fairer than others. That 
means that the fairness ratings given to procedures are 
dependent on the indirect control over the outcome that these 
procedures allow. 

The reasons given for this reduction of the concern for jus- 
tice to a concern for self-interest mirror social contract theo- 
ries in political philosophy. Philosophers—from Hobbes 
(1648/1970) to Rawls (1971)—have tackled the question of 
why individuals living in a fictional “original” prestate situa- 
tion consented to build a state with powerful institutions, 
laws, and rules of justice. The basic answer is that it was to 
their best mutual advantage to restrict their egoistic fight for 
their own interests by establishing a system of social norms 
which would and could regulate their rights and obligations 
in competition as well as in cooperation. If this idea is gener- 
alized slightly, informal social norms (like most justice 
norms) can also be regarded as serving the mutual advantage 
of all (Hardin, 1996). Both the establishment of a state with a 
system of rules and of powerful institutions to ensure their 


observation can be considered to be rational choices in the 
well-understood self-interest of individuals, especially when 
the social system is structured in a way that even the weak, 
less fortunate, and less able individuals participate in the 
common wealth (Rawls, 1971). 

For several reasons, this rational choice modeling of the 
justice motive is disputable. 


e What is missing in these theoretical accounts is the norma- 
tive, prescriptive core of justice. Justice is an ought, a 
moral imperative for social life. It is not a means to achieve 
personal aims, but rather an end in itself (Montada, 1998a). 
People are obligated to observe the norms of justice re- 
gardless of whether this is in their self-interest. Moreover, 
they are entitled to claim justice from other actors, organi- 
zations, state institutions, and so on, not only for them- 
selves but also for others. 


e The model does not fit to empirical data showing that 
justice concerns are not reducible to self-interests and may 
in fact conflict with self-interests. 


e There is no empirical proof of the reductionistic view of 
the model; it is merely an anthropological assumption. As 
such, it is part of a belief system and is not a testable sci- 
entific hypothesis. These three arguments are elaborated 
in more detail in this section. 


Justice as an End in Itself 


In every rational-choice explanation of the justice motive, it 
is not the concern for justice that is the primary motivational 
factor—rather, it is concerns for one’s self-interests. If justice 
matters at all, then it matters only insofar as it serves these 
other concerns. This implies that justice is not acknowledged 
as a prescriptive normative standard. 

When the equity principle is applied, inequity is unjust 
and needs to be rectified. The disadvantaged are entitled to 
claim equity, and anybody observing inequity is entitled or 
even obligated to claim equity for the disadvantaged. Those 
who are overbenefited within an exchange relation are 
morally obligated to reestablish equity. 

The prescriptive nature of justice norms is not dependent 
on the actors’ self-interest or on observers’ sympathies with 
the advantaged or disadvantaged party in a social exchange 
relationship. Of course, self-interest may motivate actors to 
interpret and to balance the investments and benefits to their 
own advantage, and observers’ sympathies with advantaged 
actors, for instance, may motivate them to discount the in- 
vestments or overestimate the benefits of the disadvantaged 
in order to avoid or cope with feelings of injustice. Such 
biased distortions of so-called objective justice, however, do 


not mean that justice and its standards are disregarded. They 
can perhaps be interpreted as trade-offs between self-interest 
and justice motives, but not as a reduction of the justice 
motive to self-interest; this is not what Walster et al. (1978) 
and others wanted to explain by referring to self-interest. 
What they wanted to explain was why people care about 
justice in the first place. 

Concern for justice may mean that people claim justice for 
themselves—for example, equitable shares in their social ex- 
changes, voice in disputes, and so forth; it is incontestable 
that this serves self-interest. But concern for justice means 
also that people claim or (at least) that they concede justice 
for all parties involved, equitable shares for all actors, or 
equal voice for all. Equal justice for all involved does not and 
must not result in maximum benefits for the subject who 
raises these claims: Equitable shares are usually less than the 
maximum share, and voice may benefit the other party if he 
or she has the better arguments. There is consensus among 
philosophers of justice that the crucial test of whether actions 
are based on a concern for justice is whether the actor not 
only claims justice for him- or herself, but at least concedes 
justice for others, if not claiming it for them. 

Of course, the rules of justice may be applied for reasons 
other than establishing justice. Self-interest and further con- 
cerns may provide the motivation. 


e Politicians may fight for more justice merely in order to 
win the votes of those whose claims they are voicing. 


e In business exchanges, equity rules may be observed be- 
cause the actors expect that this will pay off in the long run. 


e Companies may expect just wages to increase efficiency 
by stabilizing the motivation and performance of their 
employees. 


e Retaliation may be used for the rational reason that it will 
stop continued defection (egoistical behavior) by another 
party. 

e Applying the rule of parity in allocations within a group 
may be a rational choice aimed at furthering the social 
cohesion of the group. 


e Responsiveness to needs of one’s spouse may express 
one’s sympathy and love (and may not be an application 
of the need principle of justice). 


In these cases, justice is not the primary concern—if it is a 
concern at all—but rather comes into play as a means of pur- 
suing the actors’ other concerns. This is a tactical use of jus- 
tice that does not require that the actors have internalized the 
relevant justice principles, that they believe in their validity, 
or that they are convinced that these principles should be 
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applied by everyone in similar cases. It can be assumed that 
concerns for justice—if they exist at all—will be trumped by 
the actors’ primary concerns if there ever were a conflict 
between the two motives. 

This is not the case when justice is the actors’ primary 
concern or at least one of his or her main concerns. In these 
cases, actors are committed to establishing, reestablishing, or 
maintaining justice—whatever they may hold to be just. In 
such cases, concerns for justice are not easily trumped by 
other concerns. 

It is well known in psychology that actors usually have 
more than one concern in a given situation and that different 
concerns may come into conflict. We need to use valid diag- 
nostic measures to show that justice was one of the concerns 
taken into consideration in a given situation, even if it was ul- 
timately trumped by other concerns. If justice was only con- 
sidered for tactical reasons, actors who neglected justice in 
order to achieve the desired outcomes should not be expected 
to show signs of moral disquiet about their behavior. 

In cases in which justice is a concern in itself, its neglect 
causes feelings of guilt and possibly also efforts to correct 
one’s actions, compensate for the resulting injustice, beg for 
pardon, excuse or justify one’s actions, minimize the unjust 
consequences mentally, deny the injustice by blaming the 
victims or the disadvantaged, and so on. These are possible 
indicators for the neglect of justice concerns by acting 
subjects. 

In social interactions, the rules of justice may be neglected 
or violated by others. In this case, subjects’ concern for 
justice may be indicated by explicit claims for justice, resent- 
ment of others’ behavior, criticism of this behavior, punish- 
ment, or retaliation; but this concern may also be indicated 
by mental reconstructions aiming at minimizing or denying 
the perceived injustices, as shown in the literature on belief in 
a just world and observed injustices (cf. Lerner, 1977, 1980; 
Montada & Lerner, 1998, for overviews), as well as by cop- 
ing with suffered injustice (cf. Montada, 1994). Both victims 
of injustices and observers who are not directly affected may 
have concerns for justice and have to cope with experienced 
or perceived injustice in one way or the other. 


Empirical Evidence for the Justice Motive 
as a Primary Motive 


Looking at empirical research, we find much evidence that 
does not fit the rational-choice modeling of the justice con- 
cern. Instances of resentment in which the resenting subjects 
do not have any vested interests of their own but nonetheless 
commit themselves to costly and potentially risky attempts to 
restore justice are especially significant in this respect. It is not 
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unusual for social movements to be initiated and supported by 
people without any vested interests of their own. Think of 
Keniston’s (1970) study of young radicals involved in the 
anti-Vietnam-war movement, and of the studies Haan, Smith, 
and Block (1968) and Fishkin, Keniston, and McKinnon 
(1973) have conducted about activists in the 1960s civil rights 
movement. Moral orientations and social responsibility were 
the motivational bases identified here. Phenomena such as the 
survivor guilt described for Holocaust survivors (L. Baron, 
1987), Hiroshima survivors (Lifton, 1967), and released pris- 
oners of war (Lifton, 1954) demonstrate that not all people 
who have been favorably advantaged are able to enjoy their 
good fortune. 

They perceive the disadvantaged victims as belonging to 
their own community of solidarity (Deutsch, 1985), whereby 
equality and need are postulated to be the preferred justice 
principles (Deutsch, 1975) and communal orientations are 
prevalent (Clark & Chrisman, 1994; Lerner & Whitehead, 
1980). Managers’ feelings of guilt after layoff decisions 
(Lerner, 1996) indicate that members of the management feel 
more obligations toward the staff than expected. Guilt felt by 
survivors of layoffs (Brockner, 1994) is another example. As 
Cohen (1986) has pointed out, the application of standards of 
justice depends on the psychological boundaries of the com- 
munity one has in mind. Susan Opotow (1996) uses the term 
scope of justice to depict the fact that some people draw their 
personal boundaries much wider than do others. Those who 
are concerned about global inequalities (Olson, 1997) have a 
wide scope of justice. If self-interest is the dominant concern, 
the scope of justice will not be extended to include those who 
are disadvantaged relative to oneself. 

We have studied the ways in which relatively privileged 
people respond to the misery, the problems, and the poor life 
conditions of less fortunate others: poor people in developing 
countries, unemployed individuals in their own countries, 
physically handicapped people, and foreign guest workers 
with unfavorable working and living conditions (Montada, 
Schmitt, & Dalbert, 1986; Montada & Schneider, 1989, 1991; 
Montada, Schneider, & Reichle, 1988). How do people in rel- 
atively privileged life situations respond emotionally when 
confronted with the hardships and the misery of the disadvan- 
taged? Do they respond with sympathy or angry reproaches, 
pride in their own achievements, satisfaction about their 
higher standards of living, feelings of guilt about their rela- 
tively privileged situations (which they may not consider to 
be entirely deserved), or resentment about the unjust treat- 
ment of the disadvantaged? We found large interindividual 
differences in these emotional responses. Guilt feelings— 
which in this case we called existential guilt and resentment— 
were with respect to their intensity normally distributed 


emotions in large heterogeneous samples, rather than being 
rare or exotic abnormalities. Guilt and resentment have mean- 
ingful correlates; some examples include perception of the 
existing inequalities as unjust, cognitions that one’s own 
higher standard of living and the lower standard of living of 
others are causally related, or cognitions that the inequalities 
could be reduced by redistribution—and preference for 
the need-based principle of distributive justice (over the 
contribution-proportional principle). It has been shown that 
both guilt and resentment dispose people to perform prosocial 
activities in favor of the disadvantaged (guilt is more closely 
associated with personal sacrifices, resentment with political 
protest). It could also be proved that guilt and resentment 
were not reducible to self-interests of the privileged which 
was also assessed in these studies—namely, in terms of fear of 
losses through forced redistribution and anger at the disad- 
vantaged because of their lack of self-help. Guilt and resent- 
ment proved to be not reducible to fear of loss or to anger at 
the disadvantaged. 

In these studies, we tried to disentangle justice and self- 
interest by looking more closely at people who are better off 
than others are, consider their views and standards of justice 
violated to their own advantage, and feel morally uneasy 
about this situation. They feel responsible for helping to cor- 
rect the injustice. Other researchers (De Rivera, Gerstman, & 
Maisels, 1994; Edelstein & Krettenauer, 1996) have come to 
similar conclusions. Such findings recall those of equity re- 
search, in which distress was observed in people who were 
overbenefited. 

Whereas justice claims arising from a position of relative 
deprivation can easily be interpreted to be self-interested, this 
is not the case when justice is claimed for the disadvantaged 
by those in a more privileged position. From the perspective 
of rational choice theory, one could of course ask Isn't it a 
rational choice, serving self-interest in the long run, to 
correct the gross inequalities existing all over the world, for 
instance, to prevent violent rebellion by the disadvantaged? 
The counterquestion to this would be Why guilt and re- 
sentment instead of fear of their violent efforts to restore 
equality—or instead of cool, strategic deliberation how to 
prevent their violent attacks at the status quo? 

Whenever self-interest has been assessed and factor ana- 
lyzed together with justice scales, the independence of these 
variables was demonstrated (e.g., Montada & Schneider, 
1990; Moschner, 1998). 


Traps of Reductionism 


Reducing the number of human motives seems to correspond 
to the ideal of parsimony in theory construction. Of two 


theories, the one with fewer postulates is the more parsimo- 
nious. Such a comparison presupposes that both theories 
explain the same empirical phenomena and allow the same 
predictions. A motivation theory that posits only a small 
number of motives—or even a single one—would seem to 
be more parsimonious than would one that offers a larger 
number of motives. 

The parsimony argument may have added to the reduc- 
tionistic stance that the justice motive (as well as other mo- 
tives such as altruism, social responsibility, love, etc.) can in 
fact be reduced to or unmasked, so to speak, as self-interest. 
The economic analysis of behavior (Ramb & Tietzel, 1993) 
suggests that a great variety of behavior (if not all) can be ex- 
plained by assuming some degree of self-interest as the basic 
motivation. This idea is illustrated in the following examples 
(cf. Montada, 1998a): 


e Hypothetically, caring for disabled parents can easily be 
traced back to selfish motivations such as the desire to cul- 
tivate a favorable public image or to ensure that the par- 
ents do not withdraw their love or financial support. 


e Hypothetically, improvements in community or state care 
for the poor can be interpreted as enhancing political 
leaders’ chances of being elected by these less privileged 
voters. 


e As mentioned previously, the avoidance of opportunistic 
and selfish behavior can reasonably be interpreted as self- 
serving in the context of continued social exchanges. 


The economic theory of behavior allows elegant so-called 
explanations of every action by tracing a path to some basic 
hypothetical self-interest (e.g., Baurmann & Kliemt, 1995). 
With some ingenuity, it is possible to generate hypotheses re- 
ducing every surface motive to an underlying self-interest, or 
to unmask it as ultimately serving self-interest. This kind of 
hypothesizing may be creative, but it clearly does not consti- 
tute valid scientific proof of the hypotheses proposed. Instead 
of asking the scientific question What explains X?, rational- 
choice theorists ask How might a rational-choice theory 
explain X? (Green & Shapiro, 1994, p. 203). Bunge has 
therefore criticized rational-choice modeling, arguing that it 
has “inhibited the search for alternatives” (1989, p. 210). 
Approaching the scientific task of explaining the interindi- 
vidual and intra-individual variance of human behavior with a 
single-motive model is counterproductive because this single 
motive (maximizing one’s self-interest) does not contribute to 
the explanation of the behavioral variance. The statement that 
a person’s behavior is motivated by self-interest has no infor- 
mational value and no scientific validity as long as alternative 
motives such as altruism, social responsibility, the justice 
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motive, and moral obligation are not tested and excluded 
by empirical data. Furthermore, the seeming parsimony of a 
single-motive model is offset by the necessary increase in the 
number of corollary hypotheses needed to predict and explain 
the behavioral variance and the diversity of behaviors ob- 
served. Single-motive conceptions may best be understood as 
anthropological predecisions without scientific validity or 
utility. They are part of a belief system, not of a scientific 
knowledge system. 


Trade-Offs Between the Justice Motive 
and Other Motives 


Lerner (1977, 1980, 1998) has stressed the categorical nor- 
mative quality of the justice motive as a primary motive and 
as an aim in itself. After it is internalized as a normative stan- 
dard, justice imposes itself as an ought—valid for oneself and 
for others, not as an option that can be rationally deliberated 
and chosen when it seems functional for a particular goal or 
disregarded without moral disquiet if other options arise. 
Defining justice as an ought implies that unjustified viola- 
tions evoke moral emotions—guilt when the subject him- or 
herself has failed to meet the requirements of the ought by 
action or omission, and resentment when others have done so 
(Montada, 1993). 

We agree that the justice motive may come into conflict 
with other motives such as self-interest, but can it be trumped 
by egoism without remorse (cf. Lerner, 1996, on managers’ 
guilt after layoff decisions)? Lerner doubts whether humans 
can give up the fundamental delusive belief that the world is 
a place where everybody gets what he or she deserves— 
ultimately, at least. This motivated belief in a just world 
(BJW) is supported by what Lerner has called the personal 
contract to observe the rule of justice. 

Lerner has contributed and instigated a wealth of empiri- 
cal studies showing that the justice motive does not always 
appear as a straightforward application of standards of jus- 
tice. Trade-offs between what one deserves and what others 
deserve are elaborated in specific situations, as are trade-offs 
between justice concerns and other concerns such as self- 
interest (Montada, 1998b). The three psychologically fasci- 
nating phenomena described in the following passage can be 
interpreted as examples of trade-offs between justice con- 
cerns and self-interests. 

Blaming innocent victims is a phenomenon observed in 
many experiments conducted by Lerner and his students 
(cf. Lerner, 1980), as well as in studies carried out elsewhere 
(cf. Furnham, 1998; Maes, 1998; Montada, 1998b). Blaming 
victims is plausibly interpreted as subjects’ doing an injustice 
in an attempt to preserve their belief in a just world, which is 
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or would be threatened by the victimization of innocent 
people. Belief in a just world can be assumed to be a psycho- 
logical resource that may be defended by attributing respon- 
sibility and blame to the victims themselves, thus reframing 
the injustice of their victimization. 

Another exciting phenomenon is the exchange fiction, 
discussed in more detail in works by Holmes, Miller, and 
Lerner (cited in Lerner, 1977) and Lerner (1980). It has been 
observed that many people tend to prefer buying an over- 
priced article when they know that the profits will go to 
needy people, rather than directly donating an amount of 
money corresponding to the price difference. The explana- 
tion for this phenomenon is that helping a needy person es- 
tablishes a commitment and personal responsibility for this 
and other needy persons. Thus, any act of helping is prob- 
lematic in several respects: It implies an acknowledgment of 
undeserved neediness which is threatening the belief in a just 
world; it creates a continuing and generalized responsibility 
for needy people, a responsibility that may interfere with 
personal concerns; and finally, it creates further injustices 
with respect to all other similarly needy people who have not 
been helped. Yet provided that they are not to blame for hav- 
ing inflicted their hardships upon themselves, needy people 
deserve to be helped. This conflict is best solved by an ex- 
change fiction that allows the donation to be masked as a 
purchase. 

The third phenomenon is called the free riding dilemma. 
Everyone will agree that free riding—profiting without 
investing—serves self-interests. Those who do invest resent 
free riders for their selfishness. A relevant observation here 
is that opinion polls conducted in Europe during the 1990s 
revealed that two thirds of employees were in principle will- 
ing to reduce their working hours and income (on average 
by 10% to 20%), if this would result in the creation of new 
jobs for unemployed persons. However, this willingness 
was very rarely translated into action. At least two hypothe- 
ses explain this discrepancy: (a) Self-interest (preserving 
one’s level of income) finally outweighs the justice motive, 
and (b) moving the targets of social comparison changes the 
objects and contents of the justice motive. When considering 
mass unemployment and its undeserved sequelae, it would 
be a just decision for the relatively privileged to share their 
working time and income. When comparing themselves with 
free riders (other full-time employees who are not willing to 
share their privileges), however, subjects who do choose 
to share would feel relatively deprived. Thus, not sharing 
can be justified as long as free riding is not prevented at the 
societal level. What at first glance appears to be selfish be- 
havior may well be motivated by justice concerns (Montada, 
1998b). 


JUSTICE: A UNIVERSAL CONCERN 
WITH DIVERGING VIEWS 


The concern for justice seems to be an anthropological uni- 
versal. However, there does not appear to be a universal con- 
sensus on what is considered to be just or unjust. We speak of 
justice in the singular as if there were only one single, just 
solution for every social system and for every problem or 
conflict; yet frequently, there are diverging views about what 
would be just, which criteria should be applied, and how they 
should be applied in order to establish justice. This is true for 
all domains of social life in which justice is critical: distribu- 
tions, social exchanges, and the retribution and acknowledge- 
ment of deeds. The application of different standards of 
Justice results in diverging and conflicting outcomes. A com- 
mon view about what is just and what is unjust would be 
helpful to avoid and to settle social conflicts in private and 
business contexts, as well as in the political arena within and 
between societies. 

The normative nature of justice is obvious. The aim of nor- 
mative disciplines is to analyze and account for normative 
standards, to elaborate reasonable and just solutions for 
specific cases, and to conceive criteria and procedures for just 
decision-making. The ultimate challenge for normative ap- 
proaches may be to find universally valid solutions. In view of 
the difficulty of that task, the focus may be displaced from the 
concrete solutions to the procedures of finding a solution— 
precisely as in discourse ethics (Ackerman, 1980) in which 
ideal rules of discourse are considered a guarantee for the eth- 
ical truth of the result. 

It is not the aim of empirical approaches to propose the 
best standards of justice and the best solutions for justice 
problems. Instead, the following questions are investigated: 
What do people consider to be just and unjust? How diver- 
gent or convergent are the views about justice? Which are the 
belief systems and dispositions that influence people’s per- 
ceptions of justice and injustice? What is the motivational 
impact of experienced or observed injustice? How do people 
cope with experienced or observed injustices? How can 
justice conflicts be settled? How can one-sided views of jus- 
tice be qualified? What is the impact of procedures on the 
appraisal of decisions? 

There are interfaces between normative and empirical 
approaches: Philosophical theories of justice imply anthro- 
pological assumptions that need to be tested empirically 
(e.g., Frohlich & Oppenheimer, 1990, on Rawls’ theory of 
justice), and the ethical validity of empirically assessed 
views about justice and injustice as well as the claims for 
justice has to be examined normatively and cannot be taken 
for granted. 


THE JUSTICE OF DISTRIBUTIONS 


Standards of Distributive Justice 


In the domain of distributions—that is, the allocation or the 
existing distribution of material resources or symbolic goods, 
rights, duties, positions, power, opportunities, taxes, and so 
on, within as well as between groups and populations— 
equality certainly constitutes the basic idea of justice. How- 
ever, equality can be specified in many different ways. It may 
mean equal shares for all human beings or—if that is impos- 
sible or dysfunctional—equal opportunities. Alternatively, 
according to Aristotle, it may mean equal shares or equal op- 
portunities for all equal human beings. This second view im- 
plies that human beings differ and that specific individual 
differences justify unequal allocations and given inequalities 
in the distribution of resources, rights, duties, and so forth 
(e.g., according to citizenship, social status, specific merits, 
professional qualifications, productivity, age, conventional 
rights, gender, neediness). In this sense, equality means equal 
shares for all those with the same status, for all those with 
the same kinds of merits, needs, conventional rights, and so 
forth. That implies unequal shares for people with different 
status, merits, and so forth. A particular variant of the equal 
opportunity rule is the equal chance rule, in which a lottery 
procedure is used when it is impossible for goods to be split 
up (e.g., in the allocation of organs for transplantation), or the 
rotation schedule, which may be appropriate when it would 
be dysfunctional for a position to be split. Equal opportunity 
may also mean providing similar material, physical, and so- 
cial conditions for development, for a good life, for health, 
and so on. 

Walzer (1983) somewhat neutralized the justice problems 
with inequalities in allocations and existing distributions by 
his concept of complex equality, which postulates that distri- 
butions in different spheres of justice (material wealth, social 
recognition in various contexts, political power, education, 
kinship and love, recreation time, etc.) are not perfectly cor- 
related. Thus, a lower rank in one sphere may be compen- 
sated by a higher ranking in another. Moreover, the subjective 
importance of the different spheres of justice varies within a 
population, so that the perceived overall inequalities may be 
reduced further. 

However, the question of which differences between peo- 
ple justify which inequalities in allocations and distributions 
remains open. In the ongoing discussions, arguments are in- 
spired by cultural traditions and social philosophies such as 
egalitarianism, liberalism, social welfarism, utilitarianism, 
and the human and civil rights movements. We are far 
from having reached a general consensus on this question; 
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the preferred standards of distributive justice vary be- 
tween and within cultures and they are subject to historical 
changes. 

This fact applies across all fields of distributive justice. 
For instance, the populations of postcommunist states are less 
tolerant of inequalities, and they claim more responsibility 
for individual welfare from the governments than do the pop- 
ulations of states with a liberal tradition (Kluegel, Mason, & 
Wegener, 1995). The rules of justice for the allocation and 
withdrawal of scarce goods such as scholarships, state- 
subsidized housing, jobs, and so on vary largely within and 
across states (Elster, 1992). In the dismissal of employees, 
for instance, the criteria applied include the employees’ 
seniority, acquired skills, current productivity, neediness, 
responsibility for a family, age, and gender. Although it 
may be possible to justify the application of each of these cri- 
teria, each could result in different decisions—not a single 
just solution. Therefore, the application of different justice 
principles can result in grossly diverging outcomes. Given 
this multitude of justice criteria, conflicts about which ones 
would be appropriate in which cases are not surprising. 

Although a long list of justice principles has been identi- 
fied empirically (Reis, 1984), psychological research has 
largely been limited to three of them: equality (equal shares 
for all those within specified social boundaries), allocation 
according to merit or to contributions (achievements, in- 
vestments, etc.), and allocation according to needs. More- 
over, most empirical research has focused on the application 
of these principles in single concrete allocations of material 
goods. Fewer psychological studies are available on the al- 
location of symbolic goods, rights, and positions, on the 
withdrawal of positions, or on the allocation of loads (e.g., 
tasks, taxes; for a comprehensive review, see Törnblom, 
1992). Data have been gathered on individual preferences 
for a particular justice principle in specific cases and con- 
texts (e.g., Schmitt & Montada, 1981), and for culturally 
shared preferences (e.g., Bierbrauer, 2000; Schwinger, 
1980). 


The Choice of a Principle 


Viewing the application of a justice principle as a choice raises 
a question as to the goals of a specific choice: Is the objective 
merely to act or evaluate justly, or to avoid disharmony, to 
strengthen solidarity, to demonstrate solidarity, to motivate 
performances or effort, to punish laziness, to enhance produc- 
tivity within a social system by stimulating competitiveness, 
or perhaps to further the health and growth of the recipients? 
Deutsch (1975), for instance, has argued that people who 
pursue economic productivity should use proportionality to 
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contributions as the allocation principle (because this kind of 
allocation can be expected to motivate recipients to give their 
best), whereas people trying to foster enjoyable and harmo- 
nious social relationships should use the equality principle, 
and those aiming to foster personal growth and welfare may 
well consider the need principle to be appropriate. 

However, as is discussed later in greater detail, these 
goals per se have little to do with justice. Justice as a set of 
social norms creates entitlements and duties (cf. Lerner, 
1987). Which goals allocators pursue has nothing to do with 
justice unless they are actively trying to discharge their own 
duty to observe rules of justice, to observe the entitlements 
of recipients, or—in the case that they have to impose tasks, 
loads, and risks—the legal and moral obligations of the ad- 
dressees. The relevant question, which was also posed by 
Deutsch (1975), is not what choice is functional for which 
goal, but what ought to be chosen for the reason of justice. 
Although equality, equity, and need are principles of dis- 
tributive justice, not every allocation according to one 
of these criteria is for doing justice. In many empirical stud- 
ies the aim of applying a specific principle of distribution— 
doing justice or pursuing another goal—is not adequately 
assessed. 

For instance, Barrett-Howard and Tyler (1986) found that 
proportionality to contributions is more likely to be used as 
an allocation principle when productivity is the goal, 
whereas the equality and the need principles are preferred 
when harmony and welfare are the goals. In experimental 
studies, Mikula and Schwinger observed that most partici- 
pants are polite when asked to propose a way of distributing 
joint earnings. Those who (were made believe to) have con- 
tributed more to the joint undertaking tended to propose 
equal shares, and those who (were made believe to) have 
contributed less propose that the earnings should be allocated 
according to the respective contributions (Schwinger, 1980). 
It is open to question, however, whether the participants in 
these studies considered their allocation proposals to be a just 
solution or to be functional for some other goal like social 
harmony. 

It may well be that justice counts less in some social con- 
texts (e.g., in intimate relationships) and situations (e.g., in 
emergencies) than in others. But it should be assessed 
whether justice is at stake or some other goal. When respon- 
dents are explicitly asked to rate several alternative criteria 
for allocations with respect to their justice, it emerges that 
the equality and the need principles are more frequently ap- 
plied within close relationships (e.g., between friends or 
within stable cooperative working groups), the need princi- 
ple is more frequently applied in health care and welfare 


contexts, and the contribution principle is more often used in 
economic contexts (e.g., Schmitt & Montada, 1981). 


Social Comparisons as a Basis for Justice Appraisals 
Equity Theory 


Much attention has been paid to the equity theory of distribu- 
tive justice—equity as a criterion in the valuation of social 
exchange relationships is tackled later in this chapter—which 
was originally developed in the context of work organiza- 
tions to explore employees’ reactions to their wages and 
promotions (Adams, 1965). The basic components are 
(a) proportionality of contributions (performance, effort, in- 
vested time, expertise, etc.) and outcomes (benefits, grades, 
acknowledgements etc.), and (b) equal ratios of contributions 
and outcomes for similar actors. 

Later on, the concept of equity was used in an inflationary 
manner and was taken as a synonym for what people sub- 
jectively considered to be just or fair, regardless of the crite- 
rion they may apply to judge whether their outcomes were 
equitable: contributions, status, membership, need, or others. 
The assumption was that equity is in the eye of the beholder 
(Walster & Walster, 1975). Used this broadly, equity is no 
longer conceived as one justice criterion or principle among 
others, but rather is synonymous with the justice of out- 
comes. The criteria used by the beholders were not assessed, 
probably because the authors were only interested in the pre- 
diction and explanation of the emotional and behavioral con- 
sequences of the experience of injustice, and not in the 
prediction of experienced injustice on the basis of specific 
criteria (Mikula, 1980). 

Equity theory predicts that people will be satisfied when 
they consider their outcomes (e.g., their wages) to be equi- 
table. They resent receiving too little and feel uneasy about 
receiving too much. Many studies have supported the basic 
assumptions of equity theory. People feel satisfied with 
equitable outcomes and those who feel underbenefited are 
angry—but also those who feel overbenefited feel uneasy 
(for a review, see Tyler et al., 1997). This has been demon- 
strated by subjects’ self reports, by physiological measures of 
the emotional arousal (Markowski, 1988), and by observa- 
tion of behaviors aimed at restoring justice (e.g., adjusting 
one’s job performance, cf. Greenberg, 1988). The first finding 
that people are dissatisfied when receiving less than would be 
equitable does not allow any discrimination between a justice 
motive and self-interest. The second finding, that people feel 
distressed when overbenefited, is stronger evidence for the 
justice motive, which is discussed in the next section. 


The conclusion that equity and inequity are in the eye 
of the beholder does not only mean that various criteria of 
justice may be applied. Rather, subjects may view the values 
of their own contributions and benefits in an entirely different 
way from the way they see the contributions and benefits of 
others. A self-serving bias in appraisals of contributions and 
benefits has been identified in a few studies (Lerner, Somers, 
Reid, Chiriboga, & Tierney, 1991; Schlenker & Miller, 
1977). Therefore, justice conflicts may also arise in cases in 
which all parties apply the equity principle (Montada, 2000). 


The Theory of Relative Deprivation 


Research on distributive justice was instigated by the concept 
of relative deprivation developed by Stouffer, Suchman, 
DeVinney, Star, and Williams (1949). These authors ob- 
served that soldiers’ satisfaction with the promotion system 
within their section of the army was not determined by their 
current position nor by the objective probability of promo- 
tion. (In fact, dissatisfaction was more prevalent in the air 
force than in military police although the air force had a 
higher promotion rate.) Rather, comparisons with similar oth- 
ers had a considerable impact on their level of satisfaction. 
They were dissatisfied when they felt that they were disad- 
vantaged (deprived) in relation to similar others. Depending 
on the availability and the choice of comparison referents, 
people in the same objective situation may be either satisfied 
or dissatisfied. In most studies, the objective social situation 
correlates only weakly with feelings of personal deprivation. 
What are the circumstances leading to feelings of relative 
deprivation? 

Crosby (1976) proposed five necessary and sufficient pre- 
conditions that can be illustrated using the example of wages. 
A person must (a) see that someone else has a higher wage, 
(b) want to have this higher wage as well, (c) feel entitled to 
this higher wage, (d) think it is feasible to be paid a higher 
wage, and (e) lack a sense of personal responsibility for not 
receiving this higher wage. The denial of any personal re- 
sponsibility for one’s relatively disadvantaged situation is a 
necessary condition for feeling entitled to claim the wanted 
good. Feasibility can be defined by using one of the postu- 
lates in Folger’s referent cognition theory (1986): Resent- 
ment will occur when persons can easily imagine obtaining 
the wanted good, implying that they do not perceive any seri- 
ous objective restrictions or barriers. If they do not, some 
actor or agency must be responsible for withholding the 
wanted good. 

Runciman (1966) has distinguished between egoistical 
(personal) and fraternal (group) deprivation. The latter 
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implies that a person views his or her social group or the so- 
cial category to which he or she belongs as disadvantaged 
compared with another social group or category. It is remark- 
able that in Western societies with a liberal tradition, even 
large inequalities in material wealth between social groups 
or categories are not viewed as being unjust by the majority 
of the population, and consequently do not cause feelings of 
group deprivation (Shepelak & Alwin, 1986). This can be ex- 
plained by the dominant liberal ideology that everybody is 
personally responsible for his or her success and welfare. 
When discrimination is made salient and is clearly perceived, 
however, feelings of group deprivation may become more 
prevalent. 

Conceptually, group deprivation does not imply personal 
deprivation: The two have different comparison targets. 
Personal deprivation occurs when individuals perceive that 
they are disadvantaged compared with others of similar so- 
cial status. Group deprivation is based on comparisons with 
groups of dissimilar status. However, high levels of group 
deprivation are less frequent among individuals ranking at 
the lower end of their group’s objective deprivation range— 
that is, among those who are (objectively speaking) the most 
deprived. In fact, the more advantaged members of disadvan- 
taged groups are, the more likely it is that they will resent the 
difference between their group and more advantaged groups 
and engage in protest actions (e.g., Pettigrew, 1964). An ex- 
planatory hypothesis is that they compare themselves with 
members of the more advantaged group and feel personal de- 
privation in relation to them (D. M. Taylor & Moghaddam, 
1994). For instance, women in higher-status positions who 
earn significantly more money than does the average woman 
resent the gender-bound inequalities in earnings more than 
do women in low-wage groups (Crosby, 1982). Thus, rela- 
tive group deprivation may be mediated by a perceived per- 
sonal deprivation because the choice of comparison target 
may cross the borderline between social groups (Zanna, 
Crosby, & Lowenstein, 1987). 

The 1960s civil rights movement in the United States 
emerged during a period in which the disadvantaged were 
making economic social gains. The observation that protest 
against discrimination becomes more probable within up- 
ward economic and social development can be explained by 
the hypotheses (a) that comparisons with advantaged groups 
become more likely (Pettigrew, 1972), and (b) that reality 
cannot keep pace with raised expectations and feelings of en- 
titlement to further improvement (Gurr, 1970). 

Fraternal (group, collective) deprivation has different con- 
sequences from those of personal deprivation. People are 
more likely to admit the existence of an unjust discrimination 
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and disadvantaging of their group than of themselves. This 
corresponds to the well-known better-than-average phenom- 
enon: J am personally better off than the average member of 
my group (e.g., Crosby, 1982, for women’s appraisal of jus- 
tice in wages). People are more likely to engage in protest 
when they perceive their group as relatively deprived (e.g., 
Dion, 1986; Dubé & Guimond, 1986). One explanatory 
hypothesis is that personal deprivation is more likely to be as- 
sociated with symptoms of depression than with outrage 
against an unjust system (Hafer & Olson, 1993). Another hy- 
pothesis is that protest against personal relative deprivations 
can be attributed to envy, an emotion and motive that has 
negative connotations. Protest against fraternal relative de- 
privation means both solidarity with one’s group and a fight 
for more social justice: Both motivations are respectable. If 
oneself is better off than the average of one’s group, one’s 
protest even has a prosocial—not an egotistical—touch 
(Montada, 2001a). 

Relative deprivation theory emphasizes the role of per- 
ceived injustice in comparison to referents in the emergence 
of resentment and assertive actions both in the personal con- 
text or in the political arena. But the theory does not specify 
which standards of justice are applied, nor which referents 
are chosen for comparison by whom—nor does it specify any 
other antecedents. Therefore, relative deprivation theory 
works well as a post hoc framework for interpretation. It is 
less suited to predict resentment and protest. For instance, the 
spectrum of options for choosing comparison referents and 
standards of justice is large, and these choices are motivated 
whether they are deliberated or spontaneous. Those who are 
motivated to avoid or reduce feelings of unjust discrimina- 
tion have the option of downward comparisons. Findings re- 
vealing that the majority of subjects state that they are 
personally better off than the average member of their social 
group demonstrates the motivated nature of this choice 
(Crosby, 1982; S. E. Taylor & Brown, 1988). The majority of 
those belonging to disadvantaged groups tend to avoid com- 
parisons with advantaged groups (Major & Testa, 1988), or to 
underestimate the size of the inequality (Wegener, 1987). 
These and further coping strategies may help them to keep an 
emotional balance by controlling feelings of injustice. 

The question of how an active movement against injustice 
arises has led to many explanatory hypotheses (Major, 1994; 
J. Martin & Murray, 1986). Latent feelings of relative depri- 
vation may be made conscious as a result of public condemna- 
tions of existing discrimination and injustice. Participation in 
public protests may be dependent on a rational calculation of 
the expected personal costs and benefits, on the strength of 
one’s feelings of solidarity with one’s group, and of moral 
obligations to support it. Participation may be triggered 


by outrage. Outrage against group relative deprivation may be 
inflamed by unexpected and noticeably unjust losses and loads 
decreed by those in power (Moore, 1978), especially when 
losses were preceded by upward economic and social devel- 
opment that has set higher standards for the appraisal of the 
present unsatisfactory state (Davies, 1962). The justice princi- 
ple violated is the right to preserve the status quo and to pre- 
serve the present conditions of life and the acquired rights, an 
issue that is referred to very frequently in political disputes. 
Latent feelings of group relative deprivation may flare up as a 
reaction to events of enraging victimization of members of the 
own minority group; such reactions may explode collectively 
in riots, especially when the state authorities violate their 
duties by contributing to the unjust action or by failing to in- 
tervene in ongoing victimization (Lieberson & Silverman, 
1965): The withholding of basic civil rights by representatives 
of the state is especially enraging. 

The cases referred to by Moore (1978) and by Lieberson 
and Silverman (1965) are characterized by an unequal distri- 
bution of power. If the disadvantaged groups do not see the 
possibility to push through their claims by taking legal ac- 
tion, outrage may bring the empowerment to take collective 
action to correct disadvantageous decisions, to change the 
power structure, or to retaliate the victimization. 


JUSTICE IN SOCIAL RELATIONS 


Justice in social relations means justice with respect to the 
exchanges between the members of social systems, the ex- 
changes between social groups, corporations, and organiza- 
tions, and between individuals and institutions—but it also 
concerns the exchanges of casual encounters. 


Forms and Contents of Social Exchanges 


Exchanges are ubiquitous in social life—between individu- 
als, groups, organizations, and states, between individuals 
and groups, individuals and organizations, and so forth. 
Exchanges may be direct, for instance, when two individuals 
express their liking for one another, conclude a contract, or 
attack one another. Exchanges may also be indirect, for in- 
stance, when an individual donates money to a charity that 
provides help to people in need, or when the state collects 
taxes from its citizens, using this income to pay for educa- 
tion, law and order, and so on. Exchange relationships may 
be sequentially chained. For instance, each adult generation 
cares for the welfare and development of the younger gener- 
ation, as well as the welfare of the aged parent generation. 
The next generation will in turn do the same, thus abiding by 
the terms of the generation contract. 


What are the contents of social exchanges? Not only prod- 
ucts and money, but also status (e.g., by a marriage or by 
granting citizenship), commitment to a relationship, atten- 
tiveness, information, services, support, good mood, love, 
loyalty, and—on a less positive note—burdens (such as health 
problems, addictions, and depression), criticism, blame, 
harm, mistrust, and hostility. 


What Is Fair and Just in Social Exchanges? 
Fair Contracts 


The contract is a prototypical form of social exchange. 
Contracts are regarded as just when the partners are equally 
informed and equally free to consent (Nozick, 1974). Justice 
is threatened if relevant information is withheld, if pressure is 
exerted, or if one party is not free to refuse to enter into the 
contract because of a certain predicament. 

Because contracts are of eminent importance in social 
life, many legal norms have been established that specify the 
obligations of the partners to provide all relevant informa- 
tion, to observe the contractual agreements, to respect social 
norms in the contents of the contract, and so forth. A contract 
only has to be fair ex ante: Valid contracts have to be ful- 
filled, even if they turn out to be unfavorable for one party 
because of circumstances beyond the control of the contrac- 
tual partner (e.g., unexpected significant changes in market 
prices). Moreover, specific legal regulations have been es- 
tablished to protect the supposedly less powerful parties— 
with respect to rent control, industrial law, and product 
liability. 


Laws and Social Role Norms 


Legal regulations have also been established for many non- 
contractual relationships, specifying the rights and duties of 
the exchange partners (e.g., married couples, parents and chil- 
dren, administrations and citizens, police and citizens, etc.). 
Most important is, of course, the criminal law (addressed in 
this chapter’s section on retributive justice) containing nega- 
tive exchanges that are banned by law in a society. 

Many exchange relationships are regulated not by laws but 
by informal social norms, for example, by the system of reci- 
procal social roles. Many normative scripts for social roles are 
conventional in character. They prescribe the rights and oblig- 
ations of the actors, and they shape the normative expecta- 
tions held by the actors and by the public with respect to the 
reciprocally related roles. Further regulations are found in the 
behavioral codes of professional associations, in public con- 
ventions regarding politically correct behavior, in the rule 
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systems of sports, in the bylaws of organizations, and so forth. 
These legal and conventional rule systems correspond to and 
are shaping the sense of justice within the population. 


Justice Principles 


Many social exchanges are not subjected to legal or role- 
bound norms but still have to be fair and just. Which stan- 
dards of justice are operating here? Modified forms of the 
equality criterion are standards for appraising the fairness of 
exchange relationships, especially the principles of reciproc- 
ity and equity. Social exchanges are regarded as just if reci- 
procity is established. This is true for positive exchanges, in 
which equal mutual advantage is the normative standard, as 
well as for negative exchanges, as portrayed in the biblical 
rule of “an eye for an eye and a tooth for a tooth.” 

Equal mutual advantages have also been postulated in 
Hobbes’ (1648/1970) social contract conception of the state 
and in the sociological theory of role-bound exchanges 
(teacher-student, physician-patient, employer-employee, 
leader-follower; cf. Parsons, 1951). However, the general 
truth of Parson’s equal—mutual-advantage assumption for 
reciprocal social roles has been criticized with good reason 
(Gouldner, 1960): Some social roles or positions are certainly 
more attractive, powerful, prestigious, and profitable than 
others. And the equal opportunity assumption, which means 
that all citizens have equal opportunities to receive the more 
favorable positions, is illusory. 

Equity has been proposed as the normative standard for 
social exchanges, implying equal ratios of investments/costs 
and outcomes/benefits for all parties involved (Homans, 
1961). Equity is to assess easily only when the investments 
and outcomes are quantifiable—for instance, in money 
equivalents. This is possible for market exchanges. 

The claim that equity is the general justice principle in 
exchange relationships (Walster et al., 1978) is exaggerated. 
One reason is that equity theory is too vaguely defined to be 
applicable in qualitatively and structurally different relation- 
ships. How should the ratio of contributions to benefits be 
calculated, for example, for the employer and the employee, 
the mother and the child, the players in a tennis match, or the 
victim of an accident and his or her rescuer? All actors in 
these relationships have obligations or entitlements, but they 
are not entitled an equal ratio of contributions to benefits. The 
entitlements are based on contracts, on particular social and 
moral norms, or on the norms of fair play. Every citizen main- 
tains an exchange relationship with the state. The citizens 
and the state (in the same way as the insured and their insur- 
ance companies) have reciprocal rights and obligations that 
cannot be represented in the equity formula. Nevertheless, the 
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relationships can be evaluated as just if all parties meet their 
obligations. 

The equity principle is less precise than it was contended 
to be in more informal social exchanges, in which its assess- 
ment is based on subjectively focused and subjectively 
valued exchanges (of goods, services, love, respect, trust, 
loyalty, harm, negligence, hostilities, etc.), and indirect and 
chained exchanges (e.g., services to third parties or to the 
community that are of indirect benefit to the exchange part- 
ner) may or may not be included in subjective evaluations. 
Nevertheless, it has been proposed as the justice principle in 
social relations, even in close personal relationships. As it is 
worthwhile to have a closer look at close relationships this 
issue will be taken up in a later section. 


Implicit Contracts Within Social Relationships 


Not every aspect of a relationship can be explicitly articu- 
lated in a contract. For instance, employment contracts imply 
that employees use working time for the employer and not for 
private interests, that they are conscientious, that they do not 
misuse the contract by spying for a competitor, and so on. In 
return, the employer does not require employees to perform 
inappropriate tasks (e.g., those that are below the employee’s 
qualification level) and also takes care to ensure a safe work- 
ing environment. 

Moreover, existing, practiced relationships create new, 
contract-like expectations. For instance, significant changes 
in the task structure that was practiced for a long time by an 
employee cannot simply be assigned by the employer to the 
employee; rather, they have to be negotiated. People believe 
that employers have obligations to their current employees 
but not to those requesting employment (Kahneman, 
Knetsch, & Thaler, 1986). There is a widespread view that se- 
niority (defined here as the length of continuous employment 
with an employer) is an important factor protecting against 
layoff in the case of workforce reduction (Elster, 1992; 
Engelstad, 1998). The period of notice that tenants must be 
given is dependent on the length of the tenancy; in many 
countries, this issue is regulated by law. People are expected 
to keep to existing exchange relationships, even if the pre- 
vailing market situation would allow one party to make more 
profit elsewhere. Rousseau has investigated the implicit psy- 
chological contracts (and the entitlements derived from 
them) that are built up in ongoing relationships (Rousseau & 
Anton, 1988; Rousseau & Parks, 1993). Respectful treat- 
ment, meaningful work, and a safe working environment 
constitute important parts of these implicit contracts. 

It may well be that the perceived quality of an existing 
relationship shapes the expectations of what would be an 


appropriate reciprocal treatment. And these expectations 
have a normative character; violations may be valuated as un- 
deserved. One limit, however, is that people, as mentioned 
previously, consider different justice rules as appropriate 
depending on the kind of the relationship (e.g., equal distrib- 
ution or distribution according to needs within close rela- 
tionships but equitable distributions in market exchange). 
Relationship issues are also an important topic of procedural 
justice research (Tyler et al., 1997) and are addressed later in 
this chapter. The way people are treated by authorities—as 
representatives of social systems, communities, or social 
groups—is informative with respect to their social status. If 
the treatment is not felt to be in accordance with their subjec- 
tive entitlements, it is considered unjust. These entitlements 
are part of implicit psychological contracts. 


Entitlement to Respectful Treatment 


One aspect of social interactions has attracted much attention 
in justice research—respect. Miller (2001) has recently pro- 
vided an excellent review. A few examples are mentioned 
here. Lind and Tyler (1988) have stressed the eminent impor- 
tance of respectful and decent treatment in their group value 
theory of procedural justice. Mikula (1986) studied unjust ex- 
periences of students in daily life and found frequently men- 
tioned unjustified accusation and blame, the giving of orders 
in an inappropriate form, and ruthless misuses of status and 
power (see also Clayton, 1992). Bies and Tripp (1996) found 
that humiliation and wrongful accusation by superiors were 
instances of reported injustices. Insult and disrespectful treat- 
ment have been identified as powerful instigators of resent- 
ment and aggression (R. A. Baron, 1993; Bettencourt & 
Miller, 1996; Folger & Skarlicki, 1998; Heider, 1958). 
People in general seem to expect respect from others in social 
interactions, and they seem to feel entitled to respectful treat- 
ment and interaction. The right of being treated with dignity 
is the first of the human rights. However, what respect and 
disrespect mean in concrete social encounters may vary a lot 
between individuals, settings, kinds of relationships, social 
groups, subcultures, and cultures. Nevertheless, what has 
been named interactional justice (Bies & Moag, 1986) seems 
to be agreed upon in psychological contracts about a code of 
conduct (Robinson, Kraatz, & Rousseau, 1994; Rousseau, 
1995) defining what is and what is not acceptable in a rela- 
tionship. As the code of conduct frequently is neither explic- 
itly articulated nor negotiated between interaction partners, 
the normative expectations have to be derived from observed 
resentment and reproaches, from aggressive responses, and 
from further behavioral manifestations of feeling vio- 
lated (e.g., withdrawal, reduced commitment, cf. Miller, 


2001). These responses function also as a measures to edu- 
cate the offender whose apologies and remorse are healing 
the relationship (Ohbuchi, Kameda, & Agarie, 1989; 
Montada & Kirchhoff, 2000). The range of normative expec- 
tations may encompass the acknowledgement of expertise, 
performances and efforts, granted supports, loyalty, consider- 
ation of one’s preferences, sympathies, aversions, fears, 
handicaps, and vulnerabilities: All these have to be respected. 
Moreover, codes of politically correct behavior and language 
that have to be respected have gained much attention. 

Disrespect is experienced as an offence to the personal 
and social identity. Some of the victims’ responses to disre- 
spect are meant as a defense or restoration of their violated 
self (Vidmar, 2000). In case of public disrespect, the aim of 
responses may be to restore social status in the eyes of others. 
Violent acts may have these functions to restore the self- 
esteem and social status (Megargee & Bohn, 1979; Toch, 
1969; for a review, see Streng, 1995)—evening the score by 
retaliation (Greenberg & Scott, 1996). 

The motive to even the score may also be given in re- 
sponses like reduction of commitment at the workplace or in 
close relationships, reduction of trust, and silent or explicit 
rejection of proposals. The positive effects of respectful treat- 
ment by authorities on the acceptance of their decisions and 
on generalized trust in them (Lind & Tyler, 1988) mirror this 
hypothesis in positive terms. 


Justice Within Close Relationships 


“The rule most frequently advocated as the rule governing all 
relationships, including intimate ones, is equity” (Clark & 
Chrisman, 1994, p. 17). Participants are expected to be more 
satisfied with the relationship, which is in turn expected to be 
more stable when equity is realized. Participants strive to 
make inequitable relationships equitable by changing their 
contributions, their expected outcomes, or both by requesting 
change in the contributions made by their partners or by reap- 
praising their own or their partners’ contributions and out- 
comes (Walster et al., 1978). Because equity is in the eye of 
the beholder, so to speak, reappraisals may be functional for 
establishing subjective equity. 

It is indisputable that exchanges can be balanced on vari- 
ous dimensions (e.g., those emphasized by Foa and Foa 
(1980)—love, status, money, material goods, services, and 
information): Information can be compensated by money, 
services by love, and so on. In our culture, parents typically 
do not expect that their investments into their children’s care, 
development, and education would be reciprocated by the 
children. Instead, they feel more than compensated when they 
are loved by their children. To assess whether a relationship is 
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considered just, one has to look at the balances that are actu- 
ally made. The global measure of equity generally used in 
research—that is, asking subjects what they contribute to a 
relationship and what they get out of it—relative to their part- 
ner(s) is unsatisfactory. The precise balances have to be spec- 
ified if we are to learn how appraisals of equity and inequity 
are generated. 

One major problem is the validity of global measures of 
equity. Respondents who do not really balance contributions 
and outcomes for themselves and for their partners may use 
the terms equity and justice synonymously. Detailed mea- 
sures across numerous exchange dimensions—which may 
additionally be weighted according to personal importance— 
are possible, however (e.g., Lujansky & Mikula, 1983; Van 
Yperen & Buunk, 1994). The correlations between detailed 
and global measures are generally modest or near zero 
(Sprecher & Schwartz, 1994). Thus, using global measures of 
equity does not really clarify which justice standards are 
actually used by respondents. 

Nevertheless, in accordance with hypotheses derived from 
equity theory, some studies have found that not only respon- 
dents who feel deprived compared to their partner but also 
those who feel advantaged are less satisfied with their rela- 
tionship than are respondents who perceive their relationship 
as equitable (Buunk & van Yperen, 1991). The effects of eq- 
uity ratings on satisfaction in the partnership and the stability 
of the relationship are, however, generally weak or nonexis- 
tent (Sprecher & Schwartz, 1994). 

Thus, we do not have robust evidence in favor of the equity 
model in close relationships, such as those between family 
members, in intimate partnerships, and best friendships. Re- 
search about justice in close relationships is reported and re- 
viewed in a volume edited by Lerner and Mikula (1994) and a 
special issue of Social Justice Research (Vol. 11, 3) edited by 
Mikula (1998). One might question whether justice actually 
matters at all in these kinds of relationships, which ideally are 
characterized by mutual love, trust, and caring. However, as 
Desmarais and Lerner (1994) argue, the degree of “closeness” 
is not the same at all times and for all parties, and it may vary 
from an identity relationship (in which the parties’ identities 
are merged), to a unit relationship (in which equal but inde- 
pendent partners cooperate), and even to a nonunit relation- 
ship (in which the parties compete with one another). 
According to Desmarais and Lerner (1994), strong effects of 
equity ratings on satisfaction are not to be expected within 
identity relationships—“where meeting a partners’ needs is 
most likely to create harmonious relations, while equal and 
reciprocal treatment may be alienating in close relationships” 
(p. 45)—in which the partners are not looking for long-term 
reciprocity. In a study with married couples, Desmarais and 
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Lerner found that, for respondents who believe that they are 
in an identity relation with their partner, satisfaction with the 
relationship correlates higher with the partner’s outcomes 
than with the respondent’s own outcomes. 

Lerner’s distinction of an identity relationship from a unit 
and a nonunit relationship (Lerner & Whitehead, 1980) corre- 
sponds to Clark and Mills’ distinction between the communal- 
versus the exchange-norm orientation in intimate partnerships 
(1979). The communal-norm orientation means feeling 
responsible for and being responsive to the other’s needs 
without expecting repayment. It is satisfying enough to meet 
the other’s needs. The partner’s outcomes are no less impor- 
tant than one’s own outcomes; on the contrary, they take 
precedence. Extending the authors’ argumentation slightly, 
the following could be stated: Whereas with exchange-norm 
orientations, outcomes (benefits) are balanced against inputs 
(costs), in communal-norm orientations the ratio between out- 
comes and costs is not decisive because one’s own inputs are 
viewed not as costs, but rather as welcome opportunities to 
meet the partner’s needs. Any ensuing rewards are interpreted 
not as profits on one’s own investments, but rather as an 
expression of the partner’s affection and love. 

However, conflicts are not unusual in close relationships, 
and because these conflicts are essentially justice conflicts, it 
is worthwhile to examine which justice principles are applied 
when conflicts occur. As long as an identity relation exists, all 
investments, all self-sacrifices, and all burdens are not bal- 
anced with one’s outcomes, but the balances may be made 
when the partner withdraws his or her love (Montada & Kals, 
2001). 

Research by Cate and colleagues (Cate, Lloyd, Henton, & 
Larson, 1982; Cate, Lloyd, & Henton, 1985) and Desmarais 
and Lerner (1989) has shown that the level of received re- 
wards (e.g., in the six resource areas of love, status, services, 
goods, money, and information) predicts relationship satis- 
faction better than does global equity (and equality as well), 
regardless of whether the relationship is traditional or modern 
in terms of gender-role orientation (M. W. Martin, 1985). Do 
these findings mean that self-interest is the dominant motive 
in close relationships? To answer this question, the partner’s 
rewards (stemming from the respondents’ own responses to 
their partner’s needs) also need to be assessed. It may well be 
that those relationships that both partners experience as re- 
warding are satisfying. One kind of empirical findings seems 
to support this interpretation: Rusbult (1987) and Hays 
(1985) found that own rewards minus costs (investments) 
predict relationship success less well than own rewards plus 
costs (which may mean the partner’s rewards). The latter 
index may reflect the mutual responsiveness of the partners to 
each other’s needs (Clark & Chrisman, 1994). 


An exchange orientation implies the normative expectation 
that one’s own investments (definable as the material, social, 
and personality resources one brings into the relationship) will 
be repaid or yield profits—if not immediately, then in the long 
run. This economic view of close relationships would suggest 
that partners keep track of their own investments and out- 
comes and—in the equity version of the model—of their part- 
ner’s as well. The few studies investigating this aspect have 
found that respondents who desire or already have communal 
relationships do not tend to keep track of the respective invest- 
ments and outcomes (Clark & Chrisman, 1994). Furthermore, 
as shown by Grote and Clark (1998), communal relationships 
represent the widely preferred ideal for partnerships. In the 
same study, these authors found the perceived fairness of the 
distribution of housework and child-care responsibilities to 
be positively related to adherence to communal norms in the 
partnership (for women) and negatively related to an adher- 
ence to exchange norms (for men and women). 

However, it is open to question whether the responsive- 
ness to the other’s needs, which is typical for a communal 
orientation, is motivated by love, sympathy, altruism, or jus- 
tice. The justice motive implies an awareness of the partner’s 
entitlements and of one’s own perceived obligations. Love 
and sympathy, same as altruism, may motivate to satisfy the 
other’s desires and needs without conceiving these as entitle- 
ments and without feeling obliged to do that. In social rela- 
tionships, applying the need principle of justice, feeling 
sympathy with the needs of the loved one, and being altruis- 
tic may motivate an actor to choose the same behavioral com- 
mitments. Nevertheless, the justice motive and sympathy, 
altruism, and love are distinct motives, and conflicts between 
them may occur or be induced (cf. Batson, 1996). The behav- 
ioral commitments are not informative with respect to the 
question how they are motivated. We need valid assessments 
of the actors’ motives behind their responses to the needs of a 
loved one. To assume a justice motive, one has, at least, to 
ask respondents explicitly about the others’ entitlements or 
their deservingness. Another approach is to observe or ask 
respondents about their emotions that imply perceived own 
violations of justice norms, namely feelings of guilt or of 
indebtedness—and to explore the justice appraisals assumed 
to be necessary components of these emotions. 

The justice motive may also be inferred from resentment 
of the partner. Freudenthaler and Mikula (1998) offer an ex- 
ample of this approach with their investigation of women’s 
sense of injustice regarding the unbalanced division of 
housework. In their interview study with employed women 
living in a partnership, perceived violations of entitlement are 
predicted by four variables related to household chores: un- 
fulfilled wants, social comparisons of their partner’s with 


other men’s commitments, normative social expectations of 
partner’s commitments, and lack of justification as to why 
their partner is contributing less than expected. Perceived 
violations of entitlement together with attributions of respon- 
sibility to the partner and lack of justification predicted 53% 
of the criterion variable blaming the partner. This set of vari- 
ables includes the key variables for the perception of injus- 
tice: entitlements on the basis of some justice norm and 
violation of these entitlements by a responsible actor who 
does not have convincing justification (cf. Lerner, 1987; 
Montada, 1991). To predict perceptions of violated entitle- 
ments, it is necessary to assess the justice norms applied by 
the individual. 

Another example based on a similar conceptual model of- 
fers a questionnaire study conducted by Reichle (1996) about 
losses and restrictions experienced by spouses after the birth 
of their first child. Anger (resentment) toward the spouse was 
predicted by attributing responsibility for one’s own losses 
and restrictions to the spouse, which was moderated by the 
perceived injustice of the losses and a negative balance 
of perceived gains and losses (explained variance = 74%). 
Marital dissatisfaction was explained by anger toward the 
spouse, extent of the experienced losses, number of losses at- 
tributed to the spouse, and attribution of responsibility for the 
losses to the spouse (explained variance = 77%). Within this 
sample, strong preferences for using the equality and the 
need principle in distributions of tasks, opportunities, and 
restrictions (in contrast to gender specific traditional norms) 
and strong preferences for negotiations as the just way to 
proceed in cases of disagreement were observed (Reichle & 
Gefke, 1998). 


The Effects of Social Exchanges on Third Parties 


Assessing the justice of social exchange relationships would 
be incomplete without examining the effects of exchanges 
on third parties. Adverse effects of exchange relationships on 
third parties raise justice problems—but to my knowledge, 
they have not yet been an object of psychological research. 

Contracts that are fair to the contractual parties may incur 
serious disadvantages for others. For instance, cartel con- 
tracts may be fair for the contract parties, but they are made at 
the expense of others. Exclusive contracts of sale put other 
suppliers at a disadvantage. Granting government subsidies 
to a big company in financial trouble may be viewed as fair by 
its employees but as unfair by its competitors. Labor contracts 
between employers’ organizations and unions may be viewed 
as a fair distribution of profits but may cause rationalization 
measures leading to the dismissal of part of the workforce, or 
they may prevent the expansion of the workforce, which 
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would have provided jobs for the unemployed. Even in close 
relationships, adverse effects on third parties are not unusual. 
Parents may enjoy the loyal support of their partner in cases 
of conflict with their adolescent child who in turn considers 
this loyalty to be a coalition at his or her own cost. 

Therefore, it is adequate to expand the view from the 
directly concerned exchange parties and to examine the con- 
sequences for others and for the social system. 


THE JUSTICE OF RETRIBUTIONS 


The phrase retributive justice normally means the justice of 
retribution for crimes and negligence, as well as the justice 
of compensations for caused damages and harm. However, 
special achievements (intellectual, artistic, moral, etc.), 
especially those that go beyond the call of duty, are also to be 
repaid or acknowledged. Because the justice of acknowl- 
edgement of special achievements has received little research 
attention thus far, it is only mentioned here; the retribution 
for crimes and negligence is treated in more detail. 


Just Retribution and Punishment 


Though just retribution for crimes is anchored in criminal 
law, in criminal justice proceedings, and in precedents in 
court judgments, empirical social and behavioral sciences 
can contribute significant insights, because the sense of jus- 
tice held by the general public does not perfectly coincide 
with that reflected by the legal code and court decisions. 

What is just retribution for the violation of criminal law? 
A first answer could be reparation for the damages caused, 
analogous to the equity principle of social exchanges. Such 
regulations exist in the civil code, as can be seen in liability 
laws: The defective product must be replaced; the damage 
must be compensated; the price must be reduced if services 
are insufficient; the caused secondary costs (expenses, oppor- 
tunity costs) are to be balanced out. Compensation for dam- 
ages 1s not a prevalent goal in criminal justice. 

For most people’s sense of justice, equitable compensa- 
tion would be an inadequate atonement for a crime. This is 
true not only for crimes which have caused irreversible 
losses: How could, for example, the murder of a person, per- 
manent health impairments following a physical injury, or 
psychological damage caused by terror or humiliation be 
compensated? This is generally true because the violation of 
criminal law means a violation of the moral consensus with- 
out which the social community cannot survive (Miller & 
Vidmar, 1981). Regardless of the amount of damage, this in- 
jury to the moral code of society demands punishment. The 
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violation of the basic values of the community, those which 
make up its identity and its self-esteem, as well as violation 
of the sacred symbols of a society (e.g., its religious tenets, its 
flag) demand retribution. The primary purpose of retribution 
is atonement. 

The rational-choice model also offers an explanation of 
why compensation alone is insufficient. If the bank robber 
could compensate the crime simply by returning the money, 
there would be no deterrent: He or she could attempt it again 
and again without risk, in the hopes of one day not being 
caught. Therefore, retribution must be independent of com- 
pensation for damages. Alongside atonement and compensa- 
tion for damages, a third purpose in legal punishment is 
deterrence, which is supposed to have an effect on the perpe- 
trator as well as on the public at large. Additional functions of 
legal punishment are considered: building up or reinforcing a 
sense of justice in the population, resocializing the perpetra- 
tor, and protecting the community by incapacitation of the 
offender (e.g., by imprisonment or death penalty). 

Analyzing the valuations of the various functions of legal 
punishment, Vidmar & Miller (1980) identified two main mo- 
tives: a controlling and a retaliation motive. The retaliation 
motive emphasizes the perpetrator’s guilt and the severity of 
the crime. The function of atonement is placed in the fore- 
ground. The controlling motive aims at keeping the criminal 
in check and protecting society. Of importance are the dan- 
gerousness of the criminal and the degree of harmfulness of 
the crime. Miller and Vidmar also differentiate the motives 
according to whether they are directed at the perpetrator or at 
a third party: A motive for retaliation can also be to dampen 
public outcry. We should take this psychological analysis a 
step further and distinguish between deterring the perpetra- 
tors and resocializing them because different measures seem 
to be appropriate for those two goals. 

It is remarkable that the victims of crimes are only mar- 
ginally considered in the functions of retribution, if at all. In 
fact, in the historical development of criminal justice in the 
modern age, the main focus was protecting society and guar- 
anteeing fairness to the perpetrator, not to providing justice to 
the victim. For a long time, the victim played exclusively the 
role of a witness in criminal proceedings. The witness does 
not have a powerful role: He or she does not control the pro- 
cedure, and his or her credibility and reputation may be 
doubted. Only recently have victims had the right to function 
as joint plaintiffs in such crimes as rape and bodily injury, 
thereby gaining a bit more control in the course of the trial; 
this should be relevant for their assessment of procedural 
justice, according to hypotheses forwarded by Thibault and 
Walker (1975). Whether the victim received justice was of 
no concern to the criminal justice system. Victimology has 


finally taken the rights of victims seriously—for example, 
their need for acknowledgement of their status as victims, 
which will be documented by the conviction of the perpetra- 
tor (Fischer, Becker-Fischer, & Diichting, 1998). 

The various goals of legal punishment may conflict with 
one another and thus create their own problems of justice: Is 
the retributive sentencing of a young offender for robbery with 
grievous bodily harm to the victim just, or is a mild sentence 
with an attempt at his resocialization more just? When general 
deterrence is the objection, the mitigating factors associated 
with the individual offense may easily be overlooked. 


Perpetrator’s Responsibility and Blameworthiness 


In criminal law the severity of the crime is the first decisive 
factor in determining the degree of guilt and blameworthiness 
of the offender (recognizable by the range of penalties for a 
crime). Moreover, mitigating and aggravating circumstances 
are considered. Special importance is given to the assessment 
of the offender’s responsibility for the criminal act and to po- 
tential justifications of this act. This practice coincides with 
the sense of retributive justice in the public. 

A guilty verdict presupposes the attribution of responsibil- 
ity to the perpetrator. The assessment of the defendant’s 
responsibility is a core problem in jurisprudence. Defendants 
may use any of eight arguments to deny or to diminish their 
responsibility for their behavior and its consequences 
(cf. Hamilton & Hagiwara, 1992; Heider, 1958; Montada, 
2001b; Semin & Manstead, 1983): 


1. Denial of agency. Persons may deny that their behavior 
was under their voluntary control. Reasons given for the 
lack of volitional control include lack of competence, fa- 
tigue, external influences, effects of drugs, intense affects, 
and so on. In the courtroom, for instance, insanity, intense 
emotional states, or being under the influence of drugs 
are sometimes accepted as factors that exclude or reduce 
actors’ responsibility for their deeds. 


2. Lack of foreseeability of consequences. Persons may deny 
that the consequences of their actions could have been 
foreseen. 


3. Lack of intent. Persons may deny that the negative effects 
of their actions were intentional; this may lower the de- 
gree of responsibility assigned to them, but does not free 
them of responsibility in every case (Heider, 1958). 
The person may be judged not to be malevolent, but he or 
she may still be blamed for carelessness. 


4 


Assigning coresponsibility to others. Persons may try to 
reduce their own responsibility by attributing responsibil- 
ity to coactors. 


5. Displacing responsibility. Persons may deny their own re- 
sponsibility by ascribing the responsibility for their actions 
or omissions to others, asserting that they were seduced, 
persuaded, misinformed, or forced by these third parties. A 
special subcategory of this type is the displacement of re- 
sponsibility to authorities who have given orders for the 
action to be taken. 


6. Mental retardation and developmental immaturity. 


7. Lack of adequate socialization and education. Disadvan- 
taged childhood and adolescence, deficient socialization, 
and a lack of education are arguments frequently used 
by lawyers to deny the defendant’s responsibility, because 
these factors are widely believed to cause behavior 
disorders. 


8. Denial of having caused damages or harm. Even if the 
person does not deny responsibility for an action or omis- 
sion, he or she may deny the presumed effects of that 
action, either by doubting the existence of any harm or 
damage or by denying that harm or damage was caused by 
the action or omission in question. 


Apart from arguments to deny or diminish responsibility, 
blameworthiness can be reduced by justifications of one’s 
action or omission. Justifications do not deny responsibility. 
Rather, they offer reasons such as the following that are 
expected to reduce their blameworthiness and liability for 
compensation: 


1. Persons may make reference to their benevolent intent or 
to the positive effects or benefits of their actions or omis- 
sions. They may claim that any negative effects are bal- 
anced out by positive ones. 


2. Persons may assert that the victim had been informed and 
had consented (e.g., to participate in a high-risk medical 
research program or to engage in sexual contact). 


3. Persons may legitimize their action as a just retaliation or 
punishment. The victim is viewed as an offender who 
deserves to be punished. 


4. Persons may derogate the victim as being inferior or dan- 
gerous, and may assert that their own actions are appro- 
priate for this kind of person (cf. Bandura, 1990, on 
dehumanization of victims). 

5. Persons may justify their actions by referring to legitimate 
self-interests, whether personal or communal. 

6. Persons may legitimize their actions by referring to their 
normative obligations or higher-order values (e.g., group 
norms, obligations of obedience, religious norms). 

7. Persons may interpret their action as defense of their rep- 
utation (e.g., their face, so to speak). 
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8. Persons may legitimize their action by referring to con- 
sensus information, either that “most people act or would 
have acted the same way” or that “most people approve of 
the legitimacy of the action.” 


Arguments to deny or diminish responsibility on the one 
hand and justifying arguments on the other may be presented 
by defendants (cf. Sykes & Matza, 1957, on the defense pleas 
of criminals in general; Deegener, 1997, on men charged with 
sexual child abuse), or by the defendants’ representatives. 
Even victims trying to cope with intense outrage and hatred 
against the offender may use such arguments to calm down 
their emotions. 

There is much empirical proof suggesting that the puni- 
tiveness of victims as well as that of observers who were not 
directly affected is dependent on the attributed level of 
offenders’ responsibility; this also holds true outside the 
courtroom. 

In people’s sense of justice, responsibility and justifica- 
tions do have much weight (Burnstein & Worchel, 1969; 
Kolik & Brown, 1979; Shklar, 1990; Zillmann & Cantor, 
1977; and others). In a vignette study, Schmitt, Hoser, and 
Schwenkmezger (1991) distinguished six grades of responsi- 
bility for an injury: (a) intended injury; (b) awareness and ac- 
ceptance of possible injury; (c) careless action; (d) impulsive 
action, forced action, or unforeseeable effects; (e) benevolent 
action performed in a clumsy manner; and (f) behavior not 
under volitional control. They found a high correlation be- 
tween these levels of responsibility and the mean state-anger 
scores of respondents taking the perspective of victims in the 
vignettes. The perceived blameworthiness of actors may be in- 
ferred from these state-anger scores. Montada and Kirchhoff 
(2000) have demonstrated that credible justifications substan- 
tially reduce victims’ anger at the offender, as well as the 
degree of punishment the victims regard as appropriate. 

Retributive justice requires a valid assessment of the of- 
fender’s responsibility and blameworthiness. The offender’s 
own views of his or her blameworthiness must be given a 
careful hearing. Giving voice to the offender is a central prin- 
ciple of procedural justice; yet other’s views and information 
by others also need to be included in the decision-making 
process. 


Blameworthiness, Apologies, and Retribution 


According to the concept of retribution as atonement, a 
penalty is just when it is proportionate to the degree of guilt. 
The penalty is reduced, however, if sincere apologies by the 
offender are offered. Goffman (1971) named the following 
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components for honest apologies in a private context: The 
perpetrator must (a) express remorse and emotional distress 
because he or she has violated a legal and moral norm and 
has harmed the victim; (b) he or she accepts responsibility 
for the violation and liability for blame; (c) he or she credi- 
bly expresses willingness to observe the moral rule in the 
future; and (d) acknowledges that it is up to the victim to 
accept or to refuse the apology, and that forgiveness is a 
grace granted by the victim which cannot be claimed by the 
perpetrator. 

It has been empirically proven that sincere apologies rec- 
oncile victims and judges (as well as observers not directly 
involved) and reduce their desire for retribution and their 
punitiveness (Miller & Vidmar, 1981; Montada & Kirchhoff, 
2000). Goffman (1971) explained this phenomenon as fol- 
lows: By showing remorse, the perpetrator accepts the valid- 
ity of the violated norm, accepts his or her own guilt and 
blameworthiness, brings him- or herself once again back to 
the normative consensus of the community, and confirms the 
views of the victim. Therefore, apologies attenuate the ret- 
ributive counteraggression of victims as well as of the puni- 
tiveness of the broader public (Ohbuchi et al., 1989). The 
perpetrator’s attempt at reparation has similar effects (Darley 
& Shultz, 1990). Some courts also reduce the penalties if in 
perpetrator-victim compensation an agreement was reached 
(Rössner, 1998). 


Victims’ Need for Retribution 


Whereas the assessment of appropriate penalties has been 
widely studied, only a few studies exist of the victim’s needs 
for retribution. It is known, however, that for the victims of 
violent crimes, retributive reactions by the state are more im- 
portant than are reparations (Baurmann & Schädler, 1991; 
Pfeiffer, 1993). The large majority of victims assess sen- 
tences as too mild in their own cases (Richter, 1997). 

A study of victims of violent crimes (rape, physical injury, 
attempted murder, robbery, kidnapping) done by Orth 
(2000)—on the average 4 years after the crime and 2 years 
after the court trial—pointed out that two thirds of the sample 
were dissatisfied with the court’s judgment; and those who 
were dissatisfied tended to react with pronounced feelings of 
indignation, disappointment, helplessness, mistrust in the 
legal system, a diminished belief in a just world, reduced 
self-esteem, and reduced trust in the future. The effects of 
dissatisfaction, however, were moderated by victims’ ap- 
praisal of the proceedings as just. Procedural justice was as- 
sessed by some of Leventhal’s (1980) criteria and by the 
relational criteria proposed by Lind and Tyler (1988) that will 
be discussed in a following section. 


THE JUSTICE OF SOCIAL SYSTEMS 
AND POLITICS 


The justice of allocations and existing distributions in general 
can be assessed for different levels, for instance, within pri- 
mary groups, within casual social formations as realized in 
most laboratory experiments, within organizations and insti- 
tutions (Elster, 1992, speaks of local justice), at the level of 
the society (Brickman, Folger, Goode, & Schul, 1981, speak 
of macrojustice), and at the international level. 

It is important to note that for assessing the justice of allo- 
cations and distributions, one needs to determine social bor- 
derlines that specify who is principally entitled to receive a 
share of the resources to distribute, who is obligated to bear a 
share of the tasks and loads to be allocated, and so on (Cohen, 
1986). However, the determined borderlines may be criti- 
cized as unjust. Should the inheritance of a deceased person 
who has not made his or her will be distributed only among 
his or her descendants, or should those persons be included 
who have self-sacrificingly cared for him or her for years? 
Should the profits of a business be distributed exclusively to 
the shareholders or should the stakeholders participate? Who 
has to bear the losses? Should the tax revenues of rich states 
be distributed exclusively within the state or should develop- 
ing states also participate? 

Because the constitution, the legal system, and the institu- 
tions of a society have an impact on distributions, these compo- 
nents of the societal system are also the objects of justice 
appraisals—for example, the economic system, the labor laws, 
the health and welfare system, the educational system, envi- 
ronmental protection laws, immigration rules, the generation 
contract; all of these may be valued as basically just or may be 
criticized as unjust, as well as the politics responsible for their 
implementation and adaptation. Constitutions may be criti- 
cized for failing to guarantee human rights, the protection of the 
environment, or animal welfare; the economic and tax systems 
may be criticized for allowing the development of huge in- 
equalities in wealth or for demotivating individual productiv- 
ity; the educational system may be blamed for failing to provide 
equal opportunities for the socially disadvantaged or failing to 
meet the needs of gifted students. Justice arguments can be 
found to support or censure any given policy, and all parties 
concerned usually seem convinced that their view is the only 
valid one; this is how normative standards are conceived—as 
generally valid and as binding for all concerned. 


Imbalances of Justice at the Societal Level 


Not every claim, not every evaluation of social conditions, 
and not every protest against injustice is justified. Claims and 


protests are often based on a one-sided subjective concept of 
justice: Unbalanced claims—for the preservation of the sta- 
tus quo, of freedom rights, of the equality principle, of the 
need or the equity principle, and so on—lead to unjust solu- 
tions because all other principles of justice have been vio- 
lated. This idea may be illustrated by a few case examples 
that are currently disputed. 


Affirmative Action 


Not seldom, efforts to correct one injustice result in the cre- 
ation of another. A prominent example is the affirmative action 
taken to correct the indisputable historical disadvantages of 
women in the labor market, problems that still persist to this 
day. For instance, women are still underrepresented in top- 
level positions. Interpreting this fact as unjust discrimination 
against women, various types of affirmative action have been 
taken to give preference to female applicants over their male 
competitors. The justice problem with this policy is that it aims 
to correct a historical problem by reversing gender privileges 
in today’s generation of students and young professionals. 
Today’s young women have certainly suffered fewer injustices 
in education and in the labor market than have earlier genera- 
tions of women, and today’s young men are less privileged 
than were their male predecessors. Therefore, it is problematic 
for these historical injustices to be corrected by affirmative ac- 
tion’s affecting only the present generation of young women 
and men, with no impact on preceding generations. The ques- 
tion is whether it is just that the costs of this measure have to be 
borne exclusively by young men. Moreover, justice—at least 
in terms of equity—is more frequently valuated at the level of 
individuals, not at the level of groups or collectives. Therefore, 
as D. M. Taylor and Moghaddam (1994) argue, many people 
have justice problems with affirmative action policies. 

This approach to correcting a historical injustice is based 
on an implicit assumption advanced by the new feminist 
movement—namely, that the gender categories male and 
female were in fact social groups. Would it not be a mistake to 
believe that the whole group of women will vicariously partic- 
ipate in and profit from the success of some—mostly already 
privileged—young women? Women are not a social group, 
but rather a social category. This distinction is psychologically 
an important one (cf. Griffith, Parker, & Törnblom, 1993). 
Individually, most women are members of gender-mixed so- 
cial groups such as families, where they are bound not only to 
daughters, sisters, and mothers, but also to sons, husbands, 
brothers, and fathers. They may be proud of the careers of the 
male as well as ofthe female members of their family. 

This is not to deny the continuing existence of unjust 
gender inequalities in the labor markets. Unjust gender 
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inequalities are recognized by many of those who doubt that 
the concrete policy of affirmative action will result in new in- 
justices. This is also true for affirmative action in the realm of 
ethnic inequalities in education and in the job market (cf., for 
instance, Bobocel, SonHing, Holmvoll, & Zanna, 2002). 


Environmental Protection Policies 


There is little doubt that current and future environmental 
pollution entails not only health risks, but also the risk of cli- 
mate changes with incalculable consequences. There is also 
little doubt that the current pollution of air, soil, and water en- 
tails gross injustices. Because pollution is a side effect of pro- 
ductive processes in agriculture, industry, and business, and 
of traffic, air conditioning, and so on, the justice concern is 
about the allocation of the profits, costs, and risks resulting 
from these processes and activities. Some people and some 
populations benefit more than do others from these processes, 
and those who profit most are unlikely to be those who have 
most to lose from the risks and disadvantages of pollution (cf. 
the concept of environmental racism, Clayton, 1996). 

The costs of pollution are still typically externalized—that 
is to say, the costs and risks are mainly borne not by those 
who have caused them, nor by those who have most to gain 
from the processes in question. The externalization of costs 
bears always the risk of an injustice. One of the policies for 
reducing pollution injustices is the internalization of the costs 
by means of pollution taxes. However, pollution taxes may 
result in new justice problems. 

First, there is no guarantee that the taxes raised will be 
used to compensate the victims of pollution. In fact, not even 
all of the victims of pollution can be identified because its 
harmful effects are long lasting, interact with many other fac- 
tors, and are cumulative. Furthermore, air and water pollution 
extend across regional and state boundaries, and such pollu- 
tion may involve delayed risks for the future when coming 
generations may be affected by the greenhouse effect. 

Second, pollution taxes may intensify social inequalities. 
They may be ruinous for poorer firms, but not for richer ones; 
they may be prohibitive for poorer car owners, but not for 
more affluent drivers who can easily afford to pay the new 
taxes. The result may be the bankruptcy of some firms, the 
consequent dismissal of employees, and the aggravation of 
social inequalities (Montada & Kals, 2000; Russell, 2000). 
Looking at the branched systemic effects of every interven- 
tion, it is not easy to avoid new injustices. 


Justice for the Defendant Versus Justice for the Victim 


The principle of giving defendants the benefit of the doubt 
is uncontested in criminal law, but the practice violates the 
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victims’ claims to retribution, at least in the case of those vic- 
tims who have no doubts as to the guilt of the defendant. 
Many such victims experience an acquittal or a light sentence 
by the judges as a secondary victimization (Orth, 2000) after 
the state was not able to protect them from primary victim- 
ization by the crime. 


Underserved Welfare Benefits 


The fact that an ostensibly indolent unemployed father prof- 
its from the generous welfare benefits accorded to his family 
is unjust. It may be justified, however, by the constitutional 
guarantee of equal opportunity for all citizens: The children’s 
opportunities have to be safeguarded, and they are not re- 
sponsible for their father’s unemployment. Furthermore, the 
economic system does not allow the father to be forcibly 
employed. The assessment of justice requires looking at the 
social system as a whole, and (frequently) considering which 
of several injustices is to be less tolerable. 


State Subsidies to Business 


Subsidizing uncompetitive business sectors aids those 
employed in these sectors and thus can be justified by the 
principle of need; but it may hinder modernization. Subsidies 
for preservation instead of investment in the future can place 
a mortgage on the future and a burden on the next generation. 
Every instance of subsidization by means of public funds that 
has no sustainable effect is an injustice to the needs of the 
next generation. The concerns of the next generation should 
carry the same weight as those of the present generation. 

In political conflicts, the principle of preserving acquired 
rights is frequently asserted. In the spring of 1997, coal miners 
in Germany took to the streets to force a continuation of the 
hitherto-granted government subsidies, which consisted of 
about LSD 55000-per job position per year. If an equal amount 
of subsidy had been demanded to create new jobs for all the 
four million unemployed persons searching for jobs (the un- 
employment rate was 10.4%), that would have required the en- 
tire federal budget. Applying the equality principle of justice 
in this conflict would have demonstrated the injustice of a one- 
sided application of the principle of acquired rights preserva- 
tion and would have undercut the miners’ demands. 


Pension Systems on a Transfer Basis 


The current pension system in Germany raises at least two 
justice problems. First, with the current shift in the age pyra- 
mid, the younger generations can no longer be guaranteed that 
they will receive a pension of an equal ratio to their 


contributions as currently is paid to the pensioners; this is tan- 
tamount to an exploitation of the younger generation. Second, 
parents with children are exploited by the pension system in- 
sofar as their investments in time and money in the develop- 
ment and education of their children represent an essential 
contribution for the future performance capabilities of the pen- 
sion system, but the parents do not receive equitable financial 
compensation for their investments. In addition, if they reduce 
their paid work to save time for caring for their children, as 
mothers frequently do, their own pensions will be lowered. 


What Is Unjust and What Would Be Just? 


It is much easier to reveal injustices than it is to establish jus- 
tice within complex social systems. Pointing to imbalances is 
not yet a generally accepted solution. As justice conflicts and 
justice dilemmas become apparent, the question is how they 
may be resolved. There are several approaches to avoid or to 
settle justice conflicts and to resolve justice dilemmas: 


e Just procedures in decision making help to avoid creating 
feelings and perceptions of injustice. 


e When more than one principle of justice is valid in a case, 
a mixed application of principles would be helpful to 
avoid gross imbalances. 


e Mediation methods can settle justice conflicts in which 
(ideally) all concerned have a guided discourse about all 
issues and perspectives at stake and have a chance to find 
a mutually accepted solution. 


These approaches are outlined next. It should become obvi- 
ous that justice is a personal and social construction. 


AVOIDING AND SETTLING JUSTICE CONFLICTS 


Procedural Justice 


Given the broad spectrum of options for appraising, con- 
structing, and realizing justice in the various fields of social 
life, conflicts about justice are unavoidable. How can these 
conflicts be settled? How can peace be preserved? One answer 
is to ensure fair decision-making—using procedures that are 
broadly accepted as just within the population (Luhmann, 
1983). This solution holds for parliamentary procedures, elec- 
tions, institutional decisions, decisions of courts, arbitration, 
and decisions of other authorities. The separation of powers 
is a crucial element in the appraisal of justice in a given 
society—this includes the right to appeal against decisions of 
authorities, also against institutional and parliamentary deci- 
sions in administrative and constitutional courts. 


Extensive empirical research has confirmed the impact of 
fair procedures on conflict resolution and on the acceptance 
of authoritative decision-making. From a normative perspec- 
tive, the question of which procedures are just is a crucial 
one. Although Thibaut and Walker’s (1975) book is entitled 
Procedural Justice, the authors address the issue of control 
more than they do that of justice. Preferences for procedures 
of conflict resolution that give more control over to the par- 
ties involved may well be motivated by self-interest—as 
Thibaut and Walker have argued—and not by a justice 
motive. Therefore, it is crucial to ask whether the parties in- 
volved claim to have control by voice only for themselves or 
equally for the other party. Leventhal (1980) has drawn up a 
set of justice rules for the procedures of decision making— 
for example, impartiality of the authorities, consistent use of 
arguments, consideration of relevant information, objectivity 
in the review of information, and revision of decisions if new 
information becomes available. The granting of a voice to all 
concerned is also essential—all concerned should be given 
the opportunity to present their views and claims and to take 
influence on the decision-making process by having their 
views and claims considered by the authorities. These are 
criteria that have emerged in Western cultures as procedural 
rights to be claimed by everyone. There is ample general 
evidence of what has been dubbed the fair procedure effect, 
which refers to the phenomenon that perceived procedural 
fairness helps the parties involved to accept even those 
decisions or outcomes that are less favorable than they had 
expected or hoped for (Greenberg & Folger, 1983). 

However, a higher level of education is probably required 
for individuals to understand exactly what is actually meant by 
Leventhal’s list of criteria—and to determine whether author- 
ities observe these rules or violate them. Intellectually less 
demanding criteria for procedural fairness would therefore be 
helpful. The concept of respectful and decent treatment by au- 
thorities is easy to grasp. In their group value theory of proce- 
dural justice, Lind and Tyler (1988) emphasize the way people 
are treated by authorities. According to their theory, people 
care about their valuation and status in their group, commu- 
nity, company, or society, and they infer this status from the 
way they are treated by authorities. Because the way one 
is treated by authorities is informative with respect to one’s 
social status and has an impact on one’s self-esteem—as is as- 
sumed by political theorists such as Lane (1988) and Rawls 
(1971), as well as by psychologists (cf. Tyler, Degouey, & 
Smith, 1996; Van den Bos, Lind, Vermunt, & Wilke, 1997)— 
the concern for respectful and decent treatment in conflict- 
resolution procedures would appear to be motivated by the 
striving to gain or to maintain high self-esteem and a positive 
self-concept. Lind and Tyler’s (1988) group value theory of 
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procedural justice has inspired a wide range of research activ- 
ities and has generated an impressive body of knowledge about 
the favorable impact that perceived respectful and decent treat- 
ment by authorities has on one’s own perceived status, on 
one’s self-esteem, on the acceptance of decisions, and on one’s 
trust in authorities’ fairness and the legitimacy of institutions 
(Tyler et al., 1997; Vermunt & Törnblom, 1996). 

One question as to the justice component of group value 
theory remains open, however: Is everyone entitled to claim 
decent, respectful treatment by authorities? Even those who 
have seriously violated the law and social norms? Respectful 
treatment by benevolent authorities may enhance self- 
esteem, and it may produce further beneficial effects, includ- 
ing the willingness to accept the authorities’ decisions, but 
that alone does not imply that justice is involved. Only if au- 
thorities were obliged to treat everyone in a respectful or 
even friendly manner, everybody would be entitled to claim 
this kind of treatment, regardless of whether they deserve it. 
But is everyone truly entitled to receive respectful treatment? 
Or is this kind of treatment only deserved and claimable by 
certain people—for example, by law-abiding citizens, by 
people who have accumulated merits, by employees who al- 
ways have given their best? 

Brockner et al. (1998) and Heuer, Blumenthal, Douglas, 
and Weinblatt (1999) have made first steps to approach this 
question empirically. They could identify subjects’ self-esteem 
as a moderator variable: Specifically, having a voice and influ- 
ential informational input into an ongoing dispute is correlated 
more closely with perceived procedural fairness among 
people with high self-esteem than it is among those with low 
self-esteem. Respondents with high self-esteem seem to as- 
sume that they deserve and are entitled to respectful treatment 
as well as to being given a voice. Consequently, they and only 
they tend to associate respectful treatment with fairness to 
resent unfairness if they do not get what they expect. 

Nevertheless, positive effects that are analogous to the fair 
procedure effect may well be observable also in cases in 
which perpetrators are benevolently treated respectfully and 
decently by legal authorities: Such defendants may gain trust 
that the sentence is fair, may accept the authority of the judges 
instead of opposing them, may feel accepted as members of 
the community, and may reciprocally accept the rules of that 
community. Benevolence and grace may well have signifi- 
cant integrating power, even if they are not to be claimed. 


Settling Justice Conflicts by Mediation 


There is good reason to assume that at their core, all hot social 
conflicts are conflicts of justice (Montada, 2000). Insights into 
procedural justice—as well as insights into the prevalence of 
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justice dilemmas and knowledge of how to deal with justice 
dilemmas—are helpful to settle social conflicts. We can dif- 
ferentiate several strategies for dealing with conflicts of jus- 
tice in a mediation setting; they are outlined in the following 
discussion. 


Articulating Conflicting Concepts of Justice 


Existing justice conflicts are often not clearly articulated at 
the beginning of a mediation. The conflicts manifest them- 
selves as emotions, specifically as resentment of the opposing 
party. It happens on occasion that the parties cannot clearly 
articulate their views of justice and require assistance with 
this articulation. To be able to assist, the mediators need an 
repertory of hypotheses concerning the principles of justice. 
Only clearly articulated concepts of justice can be communi- 
cated and reflected. 

Mediators must keep in mind that the manifest objects of a 
conflict are not necessarily identical with the underlying deep 
structure of the conflict—an example can be found, for in- 
stance, in the many conflicts between parents and their ado- 
lescent children: the demands of the children for autonomy 
and the demands of the parents for maintaining authority and 
the acceptance of their norms and values by the children. The 
manifest conflicts about dress, hairstyle, neatness, contacts 
with peers, and so on do not directly reflect their deep struc- 
ture. Thus, solutions to the manifest conflicts do not always 
lead to a lasting settlement because the deep structure of con- 
flicting demands is not articulated and not addressed in the 
agreement. 


Imparting Understanding for the Other Party’s 
Concepts of Justice 


After the proper articulation of the conflicts, the second step 
in the process is imparting an understanding of the respective 
demands and normative concepts of the other party. Under- 
standing does not mean that their demands and normative 
concepts were adopted and accepted as justified. Each party 
should, however, reformulate the other party’s views of jus- 
tice and injustice in such a way that this party feels itself to be 
correctly understood; this is analogous to giving and having a 
voice in decision making, and it is a sign of reciprocal re- 
spect. If, in addition, each party can be induced to formulate 
arguments for the views of the respective opposing party, 
then a great deal has been achieved; this is in line with 
principles of discourse ethics and the ideal communication 
situation (Habermas, 1990). 


Imparting Insight Into the Dilemma Structure 
of Conflicts of Justice 


Settling conflicts is made easier when insight is gained that 
divergent principles of justice do exist and may be valid and 
that justice dilemmas result from this fact (Montada & Kals, 
2001). The normative character of justice implies that every- 
one is convinced that his or her own conception of justice is 
valid for everyone else. Each party involved draws the unre- 
flected conclusion that others are either in error in their dif- 
fering viewpoints or are egoistic, perhaps even maliciously 
violating justice. If the person, however, recognizes that a 
justice dilemma exists, then he or she no longer views the po- 
sition of the other parties as completely illegitimate and his or 
her own position as the only legitimate one. In conflict medi- 
ation, the questioning of the exclusive validity of every single 
principle of justice by pointing to the concurrent validity of 
competing principles is an important strategy (Montada & 
Kals, 2001). 

The justification of the validity of a moral rule or principle 
is to be distinguished from the justification of decisions in 
concrete cases in which competing principles are relevant 
(Habermas, 1993). This step requires concrete attempts to 
qualify all conflicting claims for justice. Does this mean gen- 
eral relativism of values, a questioning of their validity? No! 
Not one of the aforementioned principles of justice is un- 
founded. Their validity can be established with good argu- 
ments. As this applies, however, to all conflicting principles, 
this implies that none of these principles is valid exclusively. 
We must counter a negative relativism that states Nothing is 
valid with the positive relativism No principle is valid ab- 
solutely; many principles are valid. Making one principle ab- 
solute and applying it to the exclusion of others would violate 
all other principles. 

It is the wisdom of institutions to consider various princi- 
ples of justice in their regulations and decisions. The social 
market economy, for instance, is an attempt to harmonize the 
rights of the individual citizen to free economic activity with 
the maxims of the social welfare state. Rawls’ maximin- 
principle is also a suggestion toward integration of the free- 
dom of all citizens to economic activity (which ensures best 
collective wealth) with the rights of every citizen to partici- 
pate in the general prosperity (1971). 

Regulations for decision making and individual decisions 
seem to find rather broad acceptance when several principles 
of justice are considered at the same time; this can be seen in 
the research on the distribution of scarce goods such as 
university study places, subsidized housing, and transplant 
organs, as well as in the research on the layoff of employees 


(Elster, 1992). If the decision procedures are, moreover, 
judged as being just, then their acceptance can be expected. 


Qualifying Norms of Justice Empirically 


If the parties are claiming different principles to apply in the 
pending conflict, it can be helpful to show that each party is 
applying completely different standards of justice in different 
situations and contexts; this shall demonstrate that the parties 
accept more then one standard as valid and the debate can be 
focused on the reasons that one principle is preferred in the 
specific case at conflict. A further possibility of qualifying 
principles empirically consists of pointing to the application 
of a multitude of principles and their intermixing in various 
areas of society. Here we can also point to empirical studies 
on institutional distribution procedures in the allocation or 
withdrawal of goods in short supply (Elster, 1992) as well as 
to Walzer’s (1983) observations that various norms of justice 
are used in different spheres of justice. 


Communicating the Subjective Implementations 
of Justice Principles 


If the conflict is not about which principle of justice is to 
be applied—that is, if the parties to the conflict apply the 
same principle but think they have contradicting justifiable 
claims—the justifications of these claims must be finely ana- 
lyzed. As an example may few cases of divorce mediation in 
which at least one party claims a fair balancing of the former 
exchanges. That means that everything must be disclosed— 
all resources, all achievements, all sacrifices, all the love and 
loyalty invested in the marriage, all burdens borne, and also 
all returns. There can be no guarantee that a proper balance 
will be reached in a divorce settlement. But just communicat- 
ing an understanding for the subjective assessments of the 
other party may be a step toward improving the relationship. 


Adhering to Principles of Procedural Justice 


Mediation requires that the rules of procedural justice will be 
observed—the impartiality of the mediators, extensive prob- 
ing into personal perspectives and demands, careful consid- 
eration of information presented, a respectful and polite 
interaction, and so on. In mediation, the decision is made not 
by a third party (e.g., a judge), but rather by the parties them- 
selves. For the mediation proceedings to be judged as fair, the 
mediators’ behavior is important. The mediation is conflict 
settlement worked out in common by the parties and the 
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mediators; therefore, the fairness of the proceedings depends 
also on the parties’ behavior. 

It is true that one cannot oblige parties to the conflict to be 
impartial—an essential principle of procedural justice. But 
one can oblige them not only to listen to the positions and 
arguments of the opposing party, but also to show with their 
own reformulation of those arguments that they have under- 
stood the other party’s position. When this action is coupled 
with a respectful and appreciative attitude, shown by Lind, 
Tyler, and others to be significant, a proper precondition is 
created to judge the mediation as fair proceedings. It is easier 
to come to an agreement under this precondition, and the last- 
ing acceptance of this agreement is more probable. 


JUSTICE AS A PERSONAL 
AND SOCIAL CONSTRUCTION 


Although the justice motive is universal, divergent views 
about justice are indisputable. This becomes evident in 
historical changes, cultural differences, and the ubiquitous 
debates and conflicts in the private sphere—in the political 
arenas as well as in the courtrooms. The debates and conflicts 
have two different topics: (a) Which principle or rule of jus- 
tice is the right one to be applied?, and (b) What is the right 
application of a principle? With respect to the distribution of 
a scarce good, it may (a) be debated whether the equality, the 
equity, the need, or some other principle is the right one to 
apply; and it may (b) be debated who at all is entitled to be 
considered for the distribution, who is more or less needy, 
who has contributed more, and so forth. The first topic is the 
choice of a principle (or a set of principles); the second topic 
is its implementation. 


Justice Appraisals: Intuitions or Moral Reasoning? 


In many situations we experience and perceive injustices 
spontaneously and emotionally (Lerner, 1998). Those who 
are outraged have no doubts that justice was violated; that 
means they have a sense of justice telling them what is just 
and unjust in the specific case at stake. This is not a deliber- 
ated, reflected, thoroughly proofed judgment, but rather is an 
intuitive appraisal. It may well be that the outrage will disap- 
pear when the case is reappraised, when new hypotheses 
about the facts are formed and tested (Bernhardt, 2000) or are 
offered by credible others, by the offender (Montada & 
Kirchhoff, 2000), or by observers (e.g., Zillmann & Cantor, 
1977). 
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Referring to Shweder and Haidt (1993), who have distin- 
guished between two categories of moral judgments, those 
based on cognitive intuitions and those based on reasoning, 
Lerner (1998) has distinguished between preconscious expe- 
riential processes and rational processes in coming to conclu- 
sions about justice and injustice. The moral emotions of 
resentment and guilt are mostly based on preconscious expe- 
riential processes. The belief in a just world (BJW) is a 
“fundamental delusion” (Lerner, 1980) that is preconscious 
in character and motivated by the desire for a trustworthy, re- 
liable world. When adult people answer questionnaire items 
regarding whether the world is basically a just place where 
everybody gets what he or she deserves, most respondents 
hesitate to agree with such a statement. The rational system is 
activated, and too many instances of undeserved victimiza- 
tion and discrimination begin to surface. But even if this ma- 
jority of respondents do not concede that the world is really a 
just place, there remain significant interindividual differences 
in their ratings to test hypotheses derived from BJW theory. 
In the large majority of questionnaire studies, the theoreti- 
cally expected pattern of results was found to corroborate 
BJW theory (cf. Furnham, 1998; Maes, 1998; Montada, 
1998b). 

Like the frequently preconscious cognitions implied in 
emotions (e.g., Epstein, 1984), intuitions of justice and injus- 
tice are often preconscious—not reflected, well articulated, 
or well reasoned. Nevertheless, they are cognitions that can 
be modeled as personal or social constructions (Cohen, 1989) 
of an idea about what is or what would be just in a category 
of cases or in specific cases. The large variety of criteria or 
standards of justice defining entitlements and obligations has 
already been depicted here. Some process of selection must 
be assumed—whether it occurs spontaneously, habitually, or 
done after thoughtful personal deliberation, or done by for- 
malized decision making as in court trials. Even if an indi- 
vidual person’s sense of justice was shaped by socializing 
agents and socializing experiences, the internalization of nor- 
mative standards can be understood as a personal choice 
(Montada, 1993). Moreover, entitlements and obligations 
have to be attributed in concrete cases (e.g., on the basis of 
some principle of justice). The appraisal of an injustice im- 
plies that a subject’s entitlements have been neglected by 
some agent or agency who is considered responsible. 

Looking at unreflected or reflected appraisals of justice 
and injustices as a process of selection and construction open 
the eyes for the spectrum of options given to evaluate a case. 
Psychological research has tried to identify factors that have 
impact on this process of construction. To illustrate, a couple 
of research lines shall be mentioned, beginning with dis- 
positional factors that influence the fabrication of justice 


judgment, and ending with coping processes of victims 
who try to reduce their aversive feelings of being unjustly 
victimized. 


Dispositional Factors in Appraisals of Justice 


Various dispositions have been conceived and operationalized 
that have an impact on the appraisal of justice and injustice. A 
couple of them are mentioned in this section. There are indi- 
vidual preferences for specific principles of distributive justice 
that have characteristic consequences for the appraisal of the 
reality and for the motivation of activities. Those who prefer 
the need principle tend to resent, for instance, gross social in- 
equalities more and are more ready to support socially disad- 
vantaged people than are those who prefer the merit principle. 
In contrast, the latter tend more to blame the disadvantaged for 
lacking efforts to improve their life situation and for having 
self-inflicted their problems (e.g., Montada et al., 1988). 
Those preferring the merit principle tend, for instance, to 
oppose assertive action policies that use preferential treatment 
(at least, if the unfairness of the status quo is not made salient; 
cf. Bobocel et al., 2002). 

As the justice motive may compete with self-interests, the 
centrality of justice in one’s life was measured and found to 
predict voluntary prosocial commitments (e.g., Moschner, 
1998). 

People vary in their sensitivity to befallen injustice; those 
who score high on this trait measure perceive themselves 
more frequently unfairly disadvantaged (Schmitt & 
Mohiyeddini, 1996). 

Belief in a just world is challenged by observed injustices. 
Those scoring high on BJW scales tend more to blame 
socially disadvantaged people for having self-inflicted their 
situation problems. They do this because they want to protect 
their view of a just world in cases in which equalizing unjust 
social inequalities would be very costly or even impossible 
(Lerner 1977; Reichle, Schneider, & Montada, 1998). 

As mentioned before, people with high self-esteem seem 
to feel more entitled to have a voice in decision procedures 
than do those with low self-esteem (Brockner et al., 1998; 
Heuer et al., 1999). 

A case can be viewed from the victim’s perspective, the 
observer’s perspective and the offender’s perspective. Sup- 
posing self-serving biases, one might expect that the victim is 
attributing more responsibility and blameworthiness to the 
offender than the offender is ready to concede, while the ob- 
servers’ appraisals may depend on their attitudes toward the 
victim, the offender, and the experienced threat of their BJW. 
However, the self-serving bias was not observed in harmo- 
nious intimate relationships: Questioning couples separately 


about injustices they have committed against the partner and 
about those they have suffered from the partner, Mikula 
(1998) found with harmonious couples that the appraisal of 
injustices suffered by the partner was more lenient than of 
those oneself had committed to the partner. In disharmonious 
partnerships, the self-serving bias was the rule. 

The role of attitudes toward victims and toward offenders 
was intensively studied in research on juries. Prejudices 
against defendants have a significant impact on the attribu- 
tion of blameworthiness and on sentencing. 


The Social Construction of Justice by Social Movements 


Social movements are stipulated by focusing a social 
injustice—for example, the withholding of civil rights from 
minorities, the discrimination against women in the labor mar- 
ket, the exploitation of natural resources by the living genera- 
tion at the costs of future generations, the discrimination 
against homosexuality, the exploitation of children by child 
labor, the huge inequalities between the rich and the poor 
nations, the torture practiced in many countries, and so on. 

Social movements try to appeal to and if necessary change 
the public sense of justice. Their views about injustices do not 
always reflect common sense and are not always shared by the 
majority. Even the disadvantaged (the victims themselves) 
might not share their views: The women in the workforce 
comparing themselves not with men but with other women 
(Crosby, 1982) did not feel disadvantaged. The children 
in child labor may consider their lot as normal, or they may 
even be proud to contribute a bit to the survival of their fami- 
lies. Homosexual persons in earlier times may have not 
protested against unjust discrimination but may have de- 
plored their “deviant” desires. 

The social movements must present convincing arguments 
that the status quo, the current social practice, is unjust. They 
must gain public attention in the media, but ultimately they 
have to convince the majority—at least the majority in the cen- 
ters of power. Major (1994) has outlined some of the psycho- 
logical processes and strategies to change the public 
awareness of entitlements and obligations. 


Coping With Injustice 


Persons who are suffering an injustice have (in principle) 
several options to respond: They can claim correction of the 
existing injustice, they can claim an equitable compensation 
from the wrongdoers, respectively from those who are liable; 
they can start a lawsuit; they can retaliate the victimization; 
they can claim an apology from the wrongdoer; they can 
appeal for social support and some compensation from their 
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group or community; they can try to forget the case; they can 
ruminate on the case, thus alimenting their resentment and bit- 
terness; they can pray for a just punishment; they can excuse 
and forgive the wrongdoer; and finally, they can cope with 
their resentment. With regard to the construction of justice are 
those cases of interest in which an equitable compensation and 
a just retaliation are not possible and the resentment is not 
calmed down by a honest apology of the wrongdoer. How to 
cope with the continuing feelings of resentment and bitter- 
ness. According to an analysis of the cognitions implied in the 
emotion of resentment (Montada, 1994), the coping can take 
one of four strategies: 


e The suffered harm and losses can be reappraised: Are the 
harm and the losses really that severe? Is it, for instance, 
that violating to have been insulted by a person who has 
such a bad reputation? In cases of objectively severe and 
even irrevocable losses the victims may look for com- 
pensating positive experiences; this was called search for 
meaning in experienced losses. For instance, the victims 
may have found their true friends through this event, or 
they may be proud of the way they have mastered their 
fate. Gains are counterbalancing the losses. 


e Victims may think about their entitlements or the norms of 
justice violated by the offender and may qualify the validity 
of these norms. This point is addressed in more detail in this 
chapter’s section about mediation in justice conflicts. 


e Victims may think about the offenders’ responsibility: Has 
the offender really acted malevolently, intentionally, reck- 
lessly, merely carelessly, or even with good intentions but 
clumsily? Was he or she responsible alone or were others 
responsible too? The victims can even attribute some 
responsibility to themselves in what has been called self- 
blame (Bulman & Wortman, 1977). The diffusion of re- 
sponsibility (e.g., the poor education of the offender), 
especially the attribution of a bit of coresponsibility to 
oneself calms down resentment (Montada, 1992). Acci- 
dent victims actively use reattributions of responsibility as 
a strategy to cope with their feelings of resentment and 
injustice (Montada, Schneider, & Seiler, 1999). A compre- 
hensive review of research about the effects of self- 
responsibility is provided by Dalbert (2001). 

e Victims may think about possible justifications of the 
offender, misconceptions, conflicting obligations, own 
provoking behavior, and so on. Imagined justifications 
calm down resentment and hostility (Bernhardt, 2000; 
Montada & Kirchhoff, 2000). 


These coping strategies, if effective, result in a changed view 
of the case and of its injustice. 
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Observers whose belief in a just world is shattered by 
being confronted with victims of injustice either commit 
themselves to restore justice if that would be feasible (e.g., 
Lerner & Simmons, 1966); claim justice from powerful oth- 
ers as the state (e.g. Schmitt, 1998); or they deny the injus- 
tices by derogating the victims (Lerner & Simmons, 1966), 
blaming the victims for having caused their own fate, or 
denying that harm and losses are serious (e.g., Montada, 
1998a). 


CONCLUSIONS AND OUTLOOK 


We have good reasons to assume that the concern for justice 
is universal, and we have ample empirical evidence that a 
universal consensus on what is to be considered as just or 
unjust is the exception rather than the rule. We can observe 
diverging views between individuals, groups, and cultures, 
and we know of significant historical changes. This is not to 
say that no cases would exist in which views about justice are 
shared in a culture: Experimental social psychology has no 
problems finding and arranging cases that are considered 
unjust by nearly everybody. But the omnipresence of con- 
flicts and debates about what is unjust and what would be 
just—in private lives, in public debates, and in the political 
arena on the local, national, and international levels—gives 
evidence of diverging views. 

Two kinds of social conflicts and debates should be dis- 
tinguished: (a) conflicts about the standards or principles of 
justice to be applied, and (b) conflicts about facts that are rel- 
evant when a specific standard of justice is applied. For in- 
stance, in conflicts about the layoff of employees, the choice 
and the weight of different standards or criteria of justice may 
be disputed (seniority, merit, current productivity, or needi- 
ness of the employees vs. equal chances of all or an equal 
cutback of wages and working time for all, an alternative to 
layoffs). Otherwise it may be disputed who has more or fewer 
merits, who is more or less productive, or needy, and so forth. 
The first kind of conflict is obviously a justice conflict, but 
the latter is also a justice conflict, insofar as criteria of proce- 
dural justice may subjectively seem violated when one’s own 
view of the facts is not shared by everybody—namely, the 
criteria of voice, impartiality, or objectivity. 

It is an important task of empirical research to identify the 
standards of justice that are applied and to describe the cul- 
tural and individual preferences that may be specific to con- 
texts and cases. On the basis of such information, a reliable 
assessment of the conflicting views in actual disputes is pos- 
sible; such an assessment is a precondition for a discourse 
and for conflict settlement. Until now, research has focused 


only a few standards of justice and has not paid attention to 
the whole spectrum of criteria. 

An enlightened discourse has to take several perspectives 
at a problematic case or a justice conflict. Being aware of the 
prevalence of justice dilemmas—meaning that two or more 
valid principles of justice are conflicting—should help to 
avoid and to overcome one-sided views. 

Looking at the character of justice appraisals, Lerner’s 
distinction between preconscious, intuitive, and experiential 
versus rational processes in coming to judgments about jus- 
tice and injustice is a very important one (Lerner, 1998). Many 
appraisals of injustice are intuitive and not consciously re- 
flected upon. Furthermore, the parties of justice conflicts have 
primarily intuitive beliefs about justice and rarely have 
founded their positions on reflected moral reasoning. What is 
the problem with that? Experienced and observed injustices 
may motivate actions that have good and productive conse- 
quences: Protest may assert the validity of a justice norm, stop 
further violations, motivate compensation for the victims, and 
so forth. But it is also true that experienced and observed in- 
justices may motivate terrible actions, up to homicide, war, 
and genocide. Moreover, they may mean a serious impairment 
of the well-being and mental health of individuals. Therefore, 
it might be well-advised to subject these intuitions of justice 
and injustice to reflected moral reasoning. 

For this purpose, the typical use of the singular form when 
speaking of justice may be counterproductive because it 
may suggest that a single view of justice is valid: Which view 
should be valid other than the own intuitively compelling 
one? Whenever views of justice and injustice instigate emo- 
tions or motivate actions that have negative effects for the 
subjects, for others, or for the social systems, moral reason- 
ing may help to demonstrate that no single view can claim 
absolute validity: Accepting one view, one principle absolute 
would violate all others. Trying to integrate and to balance 
various principles of justice in their decisions and regulations 
is the wisdom of institutions and authorities. 

Moral reasoning and discourses about justice are one way 
that views about justice are built up or changed; these are not 
the only ways, however. The socialization and adoption of 
views of justice takes many more ways. Social facts as well 
as the dominant ideologies and traditions have a coining 
effect on people’s minds, at least as long as they are not crit- 
icized by trustworthy people as unjust. 

A look at social movements and the psychological 
processes and strategies used to change the public awareness 
about entitlements and obligations is informative, as Major 
(1994) has shown. It is no less informative to examine 
psychological barriers against changes of the worldview—a 
consolidated belief in a just world, the conviction that the 


status quo is justified or deserved, self-serving biases, or 
prejudices. 

At the individual level, the construction of justice can also 
be observed in victims of injustice, in observers, and in per- 
petrators. The coping strategies of victims who try to avoid or 
to reduce burning outrage and hatred against their victimizers 
or who search for answers to the question Why me? are well 
described. Observers who are not able or not willing to bear 
the costs of correcting undeserved victimizations of others 
reconstruct a case in order to preserve their belief in a just 
world (Lerner, 1980), which is to be considered a personal 
resource (Dalbert, 2001). And the perpetrators try to justify 
their deeds as Sykes and Matza (1957) and others have de- 
scribed; these are examples of the motivated malleability of 
justice views. 

On the other side, we have many examples that justice 
views can impose themselves compellingly and uncontrol- 
lably to individuals, leading to intensive emotions of resent- 
ment of a perpetrator or of guilt when the victim him- or 
herself has failed. Such justice views function psychologi- 
cally as categorical (unconditional) imperatives. This is not 
proof of their indisputable exclusive validity, but it is proof of 
the moral character of justice norms, and it supports the the- 
sis that the justice motive is a primordial motive that is not 
instrumental and cannot be reduced to some other motive like 
self-interest. 

In psychological research on this topic, it is crucial to 
make sure that concern for justice is the object of investiga- 
tion and not some other concern. Entitlements and corre- 
sponding obligations shall be the focus and participants’ 
concern can be seen by looking at the emotional reactions to 
violations of justice norms like guilt and resentments. 
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Although aggression, violence, and evil are interrelated, con- 
temporary research is so specialized that it is unusual to 
group them together, and this chapter is unique in considering 
them together with ideas and research on peace. Why place 
such disparate fields in the same chapter? In part, it is because 
we study aggression and violence in order to avoid the evil 
that they occasion and to achieve peace. However, if peace 
were simply the absence of violence, it would not require 
separate treatment; and if peace were a completely indepen- 
dent field, it would be better to make use of separate chapters. 
Typically, there are separate chapters on aggression and 
prosocial behavior. Here, however, we argue that the achieve- 
ment of peace rests on an understanding of aggression, vio- 
lence, and evil, yet requires us to go beyond that material to 
include not only what is usually conceived as prosocial 
behavior but the use of aggression in a struggle with violence 
and evil. Hence, we must consider these topics in conjunction 
with one another. The attainment of peace requires us to have 
an understanding of aggression, and the pitfalls of violence 
and evil, as well as the various paths that may lead toward a 
peaceful world. 

We begin with aggression because although it is often 
violent, aspects of aggression may be necessary for the 
achievement of peace. We will then sample the voluminous 
literature on forms of violence and the ways in which it may 
be controlled. Although many of these forms are clearly 
related to aggression, Gandhi asserted that the worst form of 
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violence is poverty. Such violence is masked and occurs be- 
cause of unjust economic and political structures. And when 
we turn to examine evil we find that a difficult moral judg- 
ment is involved. Although most contemporary judgment 
considers violence to be evil, much of our striving for peace 
and justice involves us in violence that we do not acknowl- 
edge as evil. This is one of the facts that requires us to base 
any quest for peace (valued as good) on a thorough under- 
standing of violence and evil. In this way, we will not be 
naive when we finally come to consider how peace may be 
attained. 


AGGRESSION 


Aggression has different meanings that focus our attention on 
different aspects of behavior and lead to the creation of dif- 
ferent approaches to its understanding. We may distinguish at 
least three different definitions: 


1. Aggression as behavior intended to hurt an other, whether 
this intent is motivated emotionally (as by anger, pain, 
frustration, or fear) or instrumentally, as a means to an end 
(as in punishing misbehavior or intimidating an other to 
attain one’s end.) There are two caveats to this definition. 
First, the intention to hurt may be embedded in larger in- 
tentions that have quite different meanings. Although the 
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definition succeeds in avoiding the inclusion of hurting 
that is accidental or an unavoidable aspect of helping (as 
in washing a wound that needs to be cleaned), it does 
include behaviors as disparate as a swat on the bottom to 
correct a child and the dropping of a nuclear bomb to win 
a war. Second, the other who is hurt may be the self (as in 
suicide) and may include animals. 


2. Aggression as assertive, moving-out behavior that is 
aimed at getting what one desires (sometimes without 
regard for the wishes of others). 


3. Aggression as the assertion of one’s power in a rela- 
tionship and the removal of challenges to what one be- 
lieves ought to exist. 


If we work from the first definition, we may view aggres- 
sion as behavior that is clearly to be discouraged and socially 
controlled. However, the second definition is much more 
positive, at least in an individualistic culture, and we may 
want to support such behavior as long as it is balanced by a 
concern for others. The third definition raises a number of 
evaluative issues. It is morally neutral if one accepts chal- 
lenges or power as a fact of life, but its presumption of a 
power relationship between persons or groups may be 
viewed as morally repugnant. These alternatives may be kept 
in mind as we examine four major approaches to aggres- 
sion: as socially learned, as emotion based, as biologically 
grounded, or as embedded in conflict. 


Social Learning Theories 


Focusing on aggression as behavior that results in personal 
injury or property destruction, Bandura (1973) showed how 
people may learn such behavior by modeling the aggressive 
behavior of others. Shown an adult striking a large inflatable 
“Bobo” doll, children learn the observed pattern of behavior. 
The pattern is then encouraged or inhibited by what happens 
to the model. If the behavior is rewarded, the model is liked 
and chosen for emulation. Even when children are critical of 
the aggressive means that are used, the amount of their 
aggression increases. If the model is punished, the child’s 
subsequent aggression decreases. Models also function by 
suggesting the social acceptability of some forms of behav- 
ior, thus facilitating patterns of behavior that have already 
been learned. 

Since the direct punishment of aggression is itself aggres- 
sive, it models aggression at the same time that it discourages 
it. Thus, although small amounts of nonabusive spanking can 
be beneficial in disciplining children between the ages of 2 
and 6 years (Larzelere, 1996), physical punishment generally 


promotes aggression. The frequency of physical punishment 
is linearly related to the frequency of aggression toward sib- 
lings (as well as toward parents) across a wide range of ages 
(Larzelere, 1986). 

The modeling of aggression may occur in families, neigh- 
borhoods, or on TV, and in each of these cases numerous 
studies show that children exposed to aggressive models are 
more apt to engage in aggressive behavior. Children growing 
up in abusive families are apt to assault their own children 
(Silver, Dublin, & Lourie, 1969); higher rates of aggressive 
behavior occur in neighborhoods where there is a subculture 
of violence that provides models and rewards aggressive be- 
havior (Wolfgang & Ferracuti, 1967); children exposed to a 
film depicting police violence show more violence during a 
subsequent game of floor hockey than do children who had 
just watched an exciting film on bike racing (Josephson, 
1987); the general aggressiveness of teenagers (as rated by 
teachers and classmates) is correlated to the amount of vio- 
lence that they watched on TV when they were children 
(Turner, Hesse, & Peterson-Lewis, 1986); and both self- 
reported aggression and the seriousness of criminal arrests at 
age 30 is predicted by the violence level of the TV show per- 
sons watched at age 8 (as reported by their mothers 22 years 
earlier; Eron, Huesmann, Lefkowitz, & Walder, 1972). 

From the perspective of social learning theory, aggression 
is neither instinctive nor produced by frustration. It is a pat- 
tern of learned behavior that has been rewarded so that it is 
efficacious within a given society. Aggressive cultures as- 
sume that aggression is innate and natural, without realizing 
that there are other cultures where aggressive patterns of be- 
havior do not occur or occur with far less frequency. Although 
emotional conditions often precede aggression, numerous 
studies have shown that loss, frustration, or anger lead to ag- 
gression only when an aggressive pattern of behavior has 
been learned and reinforced. For example, Nelson, Gelfand, 
and Hartmann (1969) involved children in competitive or 
noncompetitive play and then had them observe either an ag- 
gressive or a nonaggressive model. Those who had lost in 
competitive play were most prone to behave aggressively, but 
only when they were exposed to the aggressive model. 

We may see aggression as a pattern of learned behavior, 
but Huesman (1986) has proposed that we may also concep- 
tualize it as a more general social script, a program of how to 
act in problematic social situations. Children learn such 
scripts by observing how others behave in life and on TV. 
Realistic violence by a perpetrator with whom the child can 
identify is highly salient and easily leads to fantasy and re- 
hearsal as a way of solving problems. Aggressive scripts may 
be used in quite different circumstances and provide ways to 
gain attention and get one’s way. Among middle-class peers, 


the use of such scripts is likely to result in unpopularity. The 
resulting social isolation may lead to increased television 
viewing, and an even greater reliance on the use of aggressive 
scripts, eventually producing the habitual use of violence. 

In Huesmann’s (1998) informational processing model, 
people use a heuristic search process to retrieve a script that is 
relevant for their situation. The use of an aggressive script 
will depend on how situational cues are interpreted, the avail- 
ability of aggressive scripts, the normative evaluation of such 
a script once it is activated, and the interpretation of conse- 
quences. Regarding the last factor, Huesmann pointed out that 
if a child is beaten for aggression, the child may feel disliked 
rather than interpret his or her behavior as unprofitable. 

Although most researchers focus on the use of aggressive 
scripts by delinquents, the scripts are as available for use 
in international conflicts as in bullying and gang wars. 
McCauley (2000) pointed out that while the least socialized 
are more involved in personal violence, it is the best social- 
ized who are often involved in the intergroup violence of war. 
In fact, personal scripts are an aspect of the societal myths 
that we shall consider when we deal with the concept of evil, 
and Schellenber (1996) pointed out that interpersonal vio- 
lence may be more influenced by the extent to which a soci- 
ety engages in war than the reverse. Thus, Ember and 
Ember’s (1994) analysis of the relationship between war and 
interpersonal violence in 186 societies suggested that social- 
ization for aggression and severity of childhood punishment 
appear to be more a consequence rather than a cause of war, 
and it is this socialization that is most directly related to 
interpersonal violence. 

Aggressive scripts are available for use in any social con- 
flict, and if aggressive behavior is perceived as justified, an 
observer is more apt to identify with the aggressor and more 
apt to model the behavior. In the context of competition, the 
goal of winning may entail a willingness to hurt, and this is 
easily conflated with a willingness to hurt in order to win. 
Paradoxically, the fact that context affects the meaning of a 
behavior is an argument for Bandura’s behavioral definition 
of aggression as that which results in (rather than intends) 
harm. For example, bomber pilots often do not intend to hurt 
civilians. Any intention to harm is embedded in the goal of 
carrying out a mission, and attention is directed toward mun- 
dane means (the pilot who carried the Hiroshima bomb was 
primarily concerned that added weight might prevent a safe 
takeoff). 

Many social forces inhibit aggression, and Bandura 
(1999) has extended his earlier work by examining how ag- 
gression is more likely to occur when a person is morally 
disengaged from the victim. Such disengagement may occur 
by justifying the aggression, by using euphemistic labels, or 
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by using advantageous moral comparison. It is facilitated by 
displacing responsibility for the damage that is done (as in 
Milgram’s, 1974, experiments), by diffusing responsibility 
(Zimbardo, 1995), and by increasing the distance between 
persons and evidence of the pain that they are inflicting 
(Kilham & Mann, 1974). Finally, moral disengagement oc- 
curs when dehumanization prevents empathic responsiveness 
(see Bandura, Underwood, & Fromson, 1975). Personally, I 
would argue that we witness the contrast between moral en- 
gagement and moral disengagement whenever we either em- 
pathize with the struggles of an ant or step on the nuisance. A 
scale measuring the extent of moral disengagement has been 
used in different nations and shown to be positively related to 
support for military action (McAlister, 2001). 


Emotional Bases 


In Bandura’s work on the modeling of aggressive behavior, 
the meaning of the behavior as intent to harm is implicit. 
Berkowitz (1993) emphasized that this meaning may be cru- 
cial. This becomes apparent if we focus on aggressive ideas 
and emotions. If one person tackles another in a football 
game, we may see the skillful, determined act of an athlete, or 
we may see a deliberate attempt to injure another person. For 
Berkowitz, it is only in the latter case that the model may 
activate the aggressive thoughts and emotional reactions that 
may lead to aggressive behavior. He pointed out that model- 
ing is more apt to occur when a person identifies with the 
aggressor and when negative emotional states exist. He also 
distinguishes between instrumental aggression that occurs as 
a means of achieving some planned end and emotional 
aggression that is grounded in passion and is typically spon- 
taneous and unplanned. The deliberate “taking out” of a star 
player is quite different from a blow thrown in temper. 
Berkowitz tends to focus on emotional aggression, which 
he sees as pushed out, impelled from within, although this 
impulsion can be influenced by external cues. He sees such 
aggression as having both a motoric component (tightened 
jaw and fists, striking out, etc.) and an urge to hurt, injure, and 
destroy. He makes two major points. First, he argues that the 
emotional state that underlies emotional aggression is not 
only anger, but all negative affect. He shows that the discom- 
fort produced by heat, cold, noise, overcrowding, frustration, 
or free-floating annoyance lead to the increased probabil- 
ity or strength of aggressive behavior. For example, when a 
student makes a mistake, other students behave more ag- 
gressively when they are in a hot room (Baron, 1977), and 
they use more punishment than reward when they are in 
pain. Riots are more apt to occur in hot spells (Baron & 
Ransberger, 1978), and domestic assaults occur more 
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frequently when air pollution is high (Rotton & Frey, 1985; 
see also Berkowitz, 1982; Anderson & Anderson, 1998). 

Second, he asserts that the emotional state consists of a 
network of feelings, ideas, memories, and expressive motor 
reactions that are associated with one another so that the acti- 
vation of one part of the net will activate other parts. Un- 
pleasant memories will promote a negative mood, and this 
will increase the probability of negative thoughts and ag- 
gressive behavior. An external cue that has an aggressive 
meaning (e.g., a weapon) may activate aggressive thoughts 
and—particularly if negative feelings are present—lead to an 
increased probability of aggressive behavior. Thus, in the 
classic experiment by Berkowitz and LePage (1967), angered 
subjects delivered more shocks to their partner when guns 
rather than badminton rackets were in the room (for further 
findings see Turner, Simons, Berkowitz, & Frodi, 1977). 
Even more disturbing, Berkowitz argued that cues associated 
with pain, frustration, suffering, and aversive stimuli in gen- 
eral may activate negative affect and increase the probability 
of aggression. Thus, Berkowitz and Frodi (1979) showed that 
when women university students were angered and dis- 
tracted, they were more punitive when the child they were 
supervising was “funny looking” and stuttered. The activa- 
tion of negative affect does not necessarily lead to aggression. 
Such behavior may be inhibited by either fear of punishment 
or empathic concerns, and a person may learn to respond with 
other behaviors. However, a person is susceptible to the influ- 
ence of negative moods and external stimulation, and aggres- 
sive behavior is a “natural” response to negative affect that 
will tend to occur whenever self-control is reduced. 


Biological Perspectives 


From a biological perspective, many species of animals ex- 
hibit aggressive behavior, and we may consider a good deal of 
this to be instinctually based. Inasmuch as this is so, there 
are constraints on the ease with which we can modify aggres- 
sive behavior. The concept of instinct can be approached from 
the view of contemporary behavioral evolutionary theory or 
from its original conception as used by Freud. 


Behavioral Evolution 


Rather than view aggressive behavior as a learned response, 
we may conceptualize aggression as the product of evolu- 
tionary processes, that is, behavior patterns based on genetic 
influences that have persisted because they have been adap- 
tive, helping members of a species survive in specific envi- 
ronments. Thus, we may find aggressive patterns of behavior 
programmed into the nervous system because the genes that 
served as the basis for these programs were selected by the 


reproduction of the organisms that possessed them. In its 
original conception, instinctual behavior was viewed as a sort 
of drive that, like hunger, was directed toward a goal, build- 
ing if it were not satisfied and becoming less and less partic- 
ular about the ideal goal object until it could be satisfied by 
something that would not ordinarily be chosen. However, it is 
difficult to imagine the goal of an aggressive drive because 
there are so many different functions for aggression. There is 
the aggression involved in predation, in the defense of the 
young, in the struggle between males for mates, and so on. It 
may be better to consider instinctual aggression as comprised 
of particular behavior patterns that can be released by partic- 
ular cues in the environment. We may then consider both 
internal and external factors that may influence these aggres- 
sive patterns, as well as how the patterns may have adaptive 
significance. For example, in most species males engage in 
more intraspecies aggression than females, and this aggres- 
sion is involved in the competition to fertilize females. In 
some species the struggle for mates involves the establish- 
ment and defense of territory and may function to spread 
members of a species out over territory, thus preserving food 
supplies. 

When we examine the human species, we find that soci- 
eties vary widely in the amount of aggression. However, 
within any given society males always appear more aggres- 
sive than females. Observational studies find boys more ag- 
gressive than girls (E. E. Maccoby & Jacklin, 1974); teenage 
males are more apt to offer more violent solutions (Archer & 
McDaniel, 1995); and violent crimes are more apt to be com- 
mitted by men (J. Q. Wilson & Herrnstein, 1985). However, 
Straus and Gelles (1990) suggested that within-family 
aggression may be as prevalent in females. 

The sex differences in aggression that are found appear to 
be related to testosterone, which seems to influence both the 
development of the brain and some of the physiology underly- 
ing current behavior (R. T. Rubin, 1987). However, the exact 
relationship between testosterone and aggression is unclear, 
in part because behaving aggressively appears to affect the 
release of testosterone and because testosterone levels may be 
related more to dominance behavior than to aggression as in- 
tent to hurt. Mazur and Booth (1998) argued that testosterone 
both rises in response to a challenge to dominance and in- 
creases in winners and decreases in losers. Alternatively, van 
Creveld (2000) argued that males gravitate toward violence 
because their bodies are better adapted for aggressive combat 
and because the exercise of this advantage is a way for men to 
solve the existential problem posed by the fact that they cannot 
bear children. They justify their existence by insisting on the 
necessity of violence and their preeminence in its exercise. 

In humans, as in other animals, it is easy to imagine how 
aggressive behavior in a conflict between males could result 


in the reproductive advantage of stronger males and, hence, 
that the genes of aggressive males would be more likely to be 
reproduced in specific environments (Daly & Wilson, 1985). 
However, if we imagine early humans as existing in hunter- 
gatherer groups, it seems clear that cooperative behavior 
within the group would also contribute to survival. Even 
when the sacrifices involved in such cooperation might result 
in a disadvantage for a particular individual and his or her 
genes, genes related to cooperative behavior might be pre- 
served if the cooperation favored kin or others who recipro- 
cated the cooperation, or if the penalty of not cooperating was 
high, or (in certain conditions) if the group itself benefited 
(see D. S. Wilson & Sober, 1994). It seems probable that the 
genetics favoring cooperative behavior may offset those fa- 
voring male in-group aggressiveness. However, this may not 
be so when we consider intergroup combat. Thus, it has been 
noted that boys evidence more intragroup cooperation than 
girls when their group is in competition (Shapira & Madsen, 
1974). In his studies on the conflict between Hindus and 
Muslims, Kakar (1996) observed that when boys and girls 
were asked to construct an “exciting” scene with toys and 
dolls identifiable as Hindu or Muslim, the boys constructed 
scenes of violence whereas the girls created scenes of family 
life. Further, Thompson (1999) has pointed out that group se- 
lection may also have favored the selection of aggressive 
individuals who are willing to die for their group when it is in 
combat. 


Freudian Theory 


Freud’s psychoanalytic theory is written from a biological 
perspective that is based on the older idea of instincts as dri- 
ves or needs. The development of his thinking about aggres- 
sion is complex and has been described and critiqued by 
Fromm (1973). Taken literally, few would agree with Freud’s 
conceptualization. However, on a metaphoric level his theory 
allows a rather elegant statement of a viable theoretical posi- 
tion, best expressed in his letter to Albert Einstein on the 
cause of war (Freud, 1933). Working in the manner of evolu- 
tionary biologists today, Freud assumed that early humans 
lived in small groups. He postulated that these groups were 
initially dominated by the compelling aggression of the 
strongest male. This dominance could eventually be over- 
come by an aggressive union of weaker males. However, 
such a union had to be maintained by the growth of law and 
feelings of unity. As Freud (1933, p. 276) put it, “Here, I be- 
lieve we have all the essentials: violence overcome by the 
transference of power to a larger unity, which is held together 
by emotional ties between its members.” As Freud surveyed 
history he was not encouraged by what he saw. Although ag- 
gression within groups is contained by laws that are enforced 
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by group union, there is no way to contain aggression be- 
tween groups. Some propose the ideal of a union between 
groups, but a mere ideal is not sufficient to overcome the ag- 
gressive drive of independent groups, and powerful groups 
are unwilling to grant sufficient power to a superordinate 
body. 

In spite of the apparent cogency of the previous argu- 
ments, it should be noted that the consensus of contemporary 
social scientists is that there is no instinctual press for war 
per se. Thus, the group gathered at Seville to examine the 
problem declared, “It is scientifically incorrect when people 
say that war cannot be ended because it is part of human na- 
ture” (UNESCO, 1991, p. 10). The reasoning behind such a 
statement is well explicated by Fromm (1973). He pointed out 
that war, like slavery, is a human institution. Early hunter- 
gatherer groups had no reason to engage in warfare because 
there were no goods to plunder. Far from being an aspect of 
“primitive” man, warfare develops along with the develop- 
ment of civilization. Agriculture and animal husbandry lead 
to surpluses and the development of specializations and hier- 
archies of power that then become involved in the conquest of 
other peoples. Fromm pointed out that there were and still are 
peaceful peoples, as well as evidence for a relatively high de- 
gree of civilization in a number of matrilocal societies that 
existed before warfare began. Of course, these facts do not 
preclude a possible instinctive base for the in-group biases 
that are so prevalent whenever groups must share resources. 
Culture interfaces with a biological base for both cooperation 
and violence. Although chimpanzees show extensive cooper- 
ative behavior, Blanchard and Blanchard (2000) pointed out 
that they also form raiding parties and that this aggressiveness 
is inhibited when they are afraid. 


Conflict Theories 


Rather than focus on aggression as the response of individu- 
als, we may examine the role it plays in the relations between 
people, as a way of wining conflicts, establishing dominance, 
or managing impression. This more sociological approach 
may be linked to both emotion and biology. Regarding emo- 
tion, I proposed that anger is best regarded as a response to a 
challenge to what a person asserts ought to exist (de Rivera, 
1977, 1981). I argued that when persons become angry, they 
are attempting to remove a challenge in a way that is analo- 
gous to an animal defending territory. Learned aggressive 
responses are recruited to serve this end. 

Considering aggression as an aspect of conflict is close to 
the biological perspective in that it considers the function of 
aggression in a specific cultural environment. In game theory 
the choice of a competitive or mistrustful, rather than cooper- 
ative, strategy may be regarded as involving aggression in the 
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sense that benefits to the self are sought without regard for in- 
jury to the other. Research shows how many situations se- 
duce persons to play aggressively even when this is not in 
their best interest (Deutsch, 1958). And the game theory used 
in the study of conflict is enriched by utilizing the perspective 
of evolutionary biology (D. S. Wilson, 1998). 

Scheff (1999) asserted that emotional sequences play a 
crucial role in both interpersonal and intergroup conflict, and 
he focused on the roles of pride and shame as signals of soli- 
darity and alienation. He argued that if shame is acknowl- 
edged, connections of solidarity and trust can be built. 
However, shame is often unacknowledged. Such unacknowl- 
edged shame may involve painful feeling with little ideation 
and is often signaled by furtiveness. When it is bypassed, it 
involves rapid thoughts that occur with little feeling and is 
accompanied by hostility or withdrawal from the other in 
ways that mask the shame. The unacknowledged shame feeds 
on itself, and the person becomes ashamed that he or she is 
ashamed, experiences a panic state, or enters a humiliated 
fury. A typical pattern is to mask the shame with anger, and 
Scheff sees such shame-anger loops at the heart of destruc- 
tive conflicts. He suggested that they account for the need for 
vengeance and are at the heart of deterrence strategy. The 
danger of appearing weak, the underlying unacknowledged 
shame, is euphemistically treated as face-saving and status 
competition. 

Seeing aggression as an aspect of conflict reminds us that 
at least two parties are involved and that aggression increases 
as the parties respond to one another. In their examination 
of conflicts between individuals, groups, and nations, J. Z. 
Rubin, Pruitt, and Kim (1994) described how conflicts esca- 
late in aggressiveness. This escalation occurs in different 
ways: Influence attempts move from light to heavy tactics, 
from persuasive attempts to threats and violence; issues pro- 
liferate from small to large so that parties become increas- 
ingly involved in the conflict and commit more resources to 
it; issues move from the specific to the general so that the 
relationship between parties deteriorates; motivation shifts 
from simply doing well for the self to winning and then 
to hurting the other; and participants may grow from few to 
many. Thus, the strength of the aggressive responses (from a 
harsh word to a physical threat), the generalization of the at- 
tack (from one aspect of behavior to a description of charac- 
ter), and the extensity of the conflict (from a disagreement 
over one thing to disagreements over many) all may increase. 

Fortunately, such conflicts often subside, decreasing as for- 
bearance prevails, tempers cool, and apologies are made. This 
is particularly true when the parties to the conflict have com- 
mon interests and a history of cooperation. However, conflicts 
that spiral and escalate may lead to structural changes that 


make it difficult for the conflict to subside. These involve psy- 
chological transformations. When groups are involved, there 
are changes in group and community dynamics. The psycho- 
logical changes that occur involve the development of nega- 
tive attitudes and beliefs about the other, the development of 
competitive and hostile goals, and the deindividuation and 
dehumanization discussed earlier. As R. K. White (1984) de- 
scribed, an intensely negative image of the other begins to de- 
velop, and the other becomes regarded as immoral, inhuman, 
and evil. When groups are involved, they become polarized. 
They become increasingly extreme in hostile attitudes and de- 
velop norms that resist compromise as well as contentious 
group goals that contribute to in-group solidarity at the ex- 
pense of the out-group. They select militant leaders, become 
more liable to the problems of groupthink (Janis, 1983), and 
initiate the development of militant subgroups. The entire 
community may become polarized as members are forced to 
choose sides and neutrality becomes impossible without one’s 
loyalty becoming questioned. 


VIOLENCE AND ITS CONTROL 


Violence seems to imply a damaging of the other, a hurting 
that is more than temporary pain. However, what is consid- 
ered to be violent often seems to depend on one’s side. Thus, 
Blumenthal, Kahn, Andrews, and Head (1972) found that 
persons tend to distinguish between the violence of those 
they dislike and the “justified force” of groups they favor. 
Further, the violence that is ordinarily considered is the direct 
violence involved in using force to injure, damage, or destroy 
a person or to violate unjustly a person’s rights. We tend to 
overlook indirect violence. In this section on violence we 
consider the many forms of direct violence that occur be- 
tween persons, within communities, and in and between soci- 
eties. However, we also consider the violence that is done 
when human beings are distorted and prevented from devel- 
oping their potential, the structural violence described by 
Galtung (1969). 


Personal Violence 


Violence often occurs in an interpersonal context. We first 
consider men who are particularly prone to violence, and 
then examine family violence, rape, and bullying. 


Violent Men 


Men committed about 90% of the violent crimes between 
1960 and 1980 in the United States (J. Q. Wilson & 


Herrnstein, 1985). Although this percentage is changing (it is 
now 83%), men are more apt to be involved, and some are 
particularly prone to violence. Thus, Berkowitz (1993) cited 
a number of studies that present evidence for a degree of vio- 
lent consistency across situations and across time. For exam- 
ple, Farrington (1989) reported on a longitudinal study of 
400 working-class males that lived in a section of London. 
Youths and their parents were periodically interviewed; 
teachers rated their behavior; and court records were exam- 
ined. Of the 100 youths who were most aggressive at age 9, 
14% had been convicted of a violent offense by age 21 as 
compared with 4% of the other boys. 

Aggressive scripts appear to be much more prevalent in 
some subcultures, are available to groups, and are promoted 
by negative affect. Thus, Dunning, Murphy, and Williams 
(1988) documented how the British football hooligans, who 
“have an aggro” and violently attack strangers, are working- 
class males who grow up in families where they witness, re- 
ceive, and are coached in violence until many enjoy the skills 
and thrills involved. However, it should also be noted that the 
studies showing how particular people are prone to violence 
also reveal the inaccuracy of prediction over time. Many peo- 
ple change to become more or less aggressive as they age. 
And it must also be realized that the reactions of others can 
amplify initial marginal deviance so that greater problems 
develop (Caprara & Zimbardo, 1996). 

What are the motivations behind violent crime? Using 
peer interviews of 69 men who had records for repeated vio- 
lence, Toch (1969/1993) distinguished nine types of motiva- 
tional processes. His most frequent classification (used to 
describe 28 cases) involved the promoting or defending of a 
self-image that seemed to be constructed as a compensation 
for the fact that the person was not convinced of his own 
worth. By contrast, another 16 cases involved a basic ego- 
centrism that evidenced a complete lack of empathy with 
others, who were simply seen as objects. Although Toch 
demonstrated that his classification can be used with some 
reliability, it is clear that often more than one process is in- 
volved, and his main concern was to point out that violence 
should not be treated as homogeneous. He also pointed out 
that once a man has been involved in violence, he may de- 
velop a habit of using violence. Violence creates its own 
needs and reinforces the very insecurities and egocentricity 
that was its source, and a person may define the self in a way 
that makes violence more probable in the future. 

Baumeister and Campbell (1999) distinguished three dis- 
tinct processes that they believe may be involved in an intrin- 
sic appeal to commit violent acts (as opposed to instrumental 
motives for either selfish or idealistic ends). They differenti- 
ated sadism (the pleasure of inflicting suffering or terror 
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involved in Toch’s bullying category) from violent thrill 
seeking. They argued that the former is an addictive process, 
while the latter is more the sporadic sensation seeking of the 
bored (and often drunk and impulsive). They estimated that 
sadism may develop in about 5% of persons who are repeat- 
edly violent and possibly explained by Solomon’s (1980) 
opponent process theory. By contrast, they saw violent thrill 
seeking as motivated by boredom combined with high sensa- 
tion seeking and low impulse control (and more likely to 
occur under the influence of alcohol). Both processes are, of 
course, opposed by any guilt feelings an individual may have. 

A third process involves threatened egoism and, at first 
glance, appears to be related to Toch’s category of self-image 
promoting and defending. However, Toch views his category 
as involving persons who are not convinced of their own 
worth, while Baumeister and Campbell feel that the process 
they are describing is more a reaction when a narcissistic 
view of the self is threatened. This disparity may involve 
some important distinctions (along with some semantic and 
measurement issues). Baumeister, Smart, and Boden’s (1996) 
literature review seemed to show that aggressors tend to have 
favorable, even grandiose, views of the self, and Bushman 
and Baumeister’s (1998) laboratory study showed the highest 
aggression coming from the injured self-esteem of insulted 
narcissists. However, the review article often involved the 
aggression of psychopaths and rapists whom Toch would 
classify as egocentric, and narcissism also implies egocen- 
tricity. We may need to distinguish the inflated (and narcis- 
sistic) self-esteem of the bully from the poor self-esteem of 
other violent persons. 

The previous analyses are useful in understanding persons 
whose violence is not condoned by the society in which they 
live, but how may we understand the tolerated violence of 
leaders such as Hitler, Stalin, or Poi Pot, lieutenants such as 
Himmler or Goering, or the masses involved in publicly 
sponsored killings such as those that occurred in the Roman 
coliseum or any number of blatant purges and massacres? 
Although we may assume a certain amount of egocentricity, 
objectification, and lack of empathy, there appear to be 
deeper motivational reasons, the sort of instinctual tendency 
for destruction postulated by Freud. An alternative is sug- 
gested by Fromm’s (1973) analysis of destructive character 
structure. Fromm distinguished between a biologically adap- 
tive aggression that humans share with other animals and a 
malignant aggression that is distinctly human and maladap- 
tively destructive, a point discussed more fully when we con- 
sider the nature of evil. One of the character structures he 
describes involves the worship of technique and the fusion of 
technique with the destructiveness seen in modern warfare. 
This may be objectively measured and related to political 
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attitudes that favor military power and the repression of dis- 
sent (see M. Maccoby, 1972). 


Family Violence 


Although there are obvious differences between child abuse 
and domestic violence, it seems important to relate these 
two types of violence to each other and to the rather ne- 
glected topic of sibling abuse. For example, Patterson (1982) 
showed that the families of antisocial and abused children 
fail to provide consistent and effective discipline when chil- 
dren are aggressive, fail to monitor their whereabouts, and 
do not provide positive reinforcement for prosocial skills. He 
suggested that in such families, a problem child learns to be 
aggressive by attacking and dominating siblings, and a sam- 
ple of such children shows that they attack siblings at almost 
10 times the normal rate, as well as being involved in hitting 
their fathers and mothers and being hit by them (Patterson, 
1986). 

One often thinks of child abuse as involving physical or 
sexual abuse. However, by far the most prevalent form of 
abuse involves neglect. Thus, Sedlak (1990) reported an inci- 
dence rate of 14.6 per 1,000 as contrasted with rates of 4.9 for 
physical abuse and 2.1 for sexual abuse. Parental neglect usu- 
ally occurs in situations of low family income and education 
and often where there is a high level of stress and a lack of so- 
cial support (Garbarino, 1991). In such situations there is 
probably less parental maturity, less knowledge about child 
development, and a greater degree of attachment distur- 
bances. Further, mothers (and one presumes fathers) may be 
quite depressed, and this may contribute to the neglect of 
children (see Pianta, Egeland, & Erickson, 1989). Poverty 
and the lack of social support also appear to be a factor in 
both physical and sexual abuse. 

Azar (1991) pointed out that in order to understand fully 
how abuse occurs we must look at the interpersonal dynam- 
ics that occur within the social context. Her investigations of 
abusive mothers reveal that they often misperceive a child’s 
behavior. If a 3-year-old spills a glass of milk, the mother 
may perceive willful disobedience and may lack the skills to 
cope with what she perceives as a challenge to her authority. 
Learned patterns of aggression that occur in a situation per- 
ceived as a struggle for dominance may account for a good 
deal of physical abuse. In any case, in their review of theories 
of child abuse, Azar, Povilaitis, Lauretti, and Pouquette 
(1998) argued that the best way to understand child abuse is 
to focus on how a parent interacts with a child in a social sit- 
uation that is influenced by both societal and cultural factors. 
The interaction will be influenced by a child who may be 
more or less pleasing and difficult and by a mother who may 


have more or less parental and social skills, impulse control, 
and ability to manage stress. And the interaction occurs in a 
context that may or may not provide helpful or aggressive 
models, be stressful, or lend social support. To some extent 
this overall model for understanding may also be applicable 
to sexual abuse. However, sexual abuse seems less dependent 
on societal stress and more on personality factors such as 
high familial dependence, psychopathy, or pedophilic ten- 
dencies (Rist, 1979), or, more generally, as having its origin 
within perpetrators rather than in the interaction between per- 
petrator and victim (Haugaard, 1988). 

Domestic abuse in the sense of partner abuse is often 
attributed to male batterers, and the U.S. Department of Justice 
(1995) reported about twice as many wives and girlfriends 
killed by husbands and boyfriends as the converse 
(1,500:700). However, a telephone survey of 6,000 married or 
cohabiting couples found that as many females as males ap- 
peared to be involved in partner violence (Straus & Gelles, 
1990). In many cases of extreme violence the superior physi- 
cal strength of the males was offset by the more frequent use of 
weapons by females. In a similar vein, surveys of lesbian cou- 
ples have found as much or more violence as in heterosexual 
couples (Waldner-Haugrud, Gratch, & Magruder, 1997). 

Regardless of the degree to which violence in perpetrated 
by males, it seems important to distinguish between differ- 
ent sorts of perpetrators. Holtzworth-Munroe (2000) distin- 
guished between three groups of men involved in marital 
violence: Those who become violent within the family only 
as a result of an inability to manage conflict escalation; those 
who have difficulty with trust issues and become overly de- 
pendent on their wives, resorting to violence when their 
needs are not met; and those who are antisocial and violent in 
all relationships. Clearly, the management of domestic abuse 
in each of these cases requires quite different strategies. In 
some cases couples therapy seems appropriate, but in others 
it would simply prolong abuse. In some cases separation of- 
fers a solution. However, Hart (1992) reported that about 
75% of emergency room visits and calls to law enforcement, 
and 50% of the homicides, occur after separation. 

Intervention programs attempting to teach men anger 
management and conflict resolution skills in small groups 
typically report a 53% to 83% success rate (Edleson, 1996). 
Although these rates are encouraging, the lower percentages 
occur when there is a longer follow-up time and when suc- 
cess is based on the reports of the victims or on arrest rates. 
Further, the rates are based on men who complete the pro- 
grams (which last from 10 to 36 sessions) and appear to 
ignore differences in different types of abusers. In one eval- 
uation, of about 500 men who contacted the program, only 
283 attended the first session, and only 153 completed the 


sessions. Programs are also available for the treatment of 
aggressive women (see Leisring, Dowd, & Rosenbaum, 
in press), although these may face the same attendance prob- 
lems as do the men’s programs. 


Rape 


Estimation of the prevalence of rape depends a great deal 
on how it is defined and how the statistics are collected 
(Muehlenhard, Powch, Phelps, & Giusti, 1992). In a well- 
designed study involving over 3,000 college women, 15% 
reported that they had experienced unwanted sexual penetra- 
tion because a man had used physical force or given them 
alcohol or drugs, and an additional 12% reported having had 
to resist physical force (Koss, Gigycz, & Wisniewski, 1987). 
Few of these instances were reported, and many may not 
have been termed “rape” or recognized as such. 

After years of study, Malamuth (1998) presented data 
showing that coercive sex is most apt to be perpetrated by 
males who have both an orientation toward impersonal sex 
(often related to a violent childhood) and a hostile masculin- 
ity (involving feelings of rejection and a desire to dominate 
women). To some extent these risk factors may be mitigated 
by the ability to empathize. 

If the problem of sexual aggression were only a problem 
of restraining a relatively small percentage of males, it might 
be fairly easily addressed. Unfortunately, it seems clear that 
larger percentages can be influenced by subcultural norms in 
some gangs, military units, sports teams, and fraternities. 
These subcultures suggest that what seems like rape to some 
is merely normal masculine action. Such norms encourage 
the objectification of women as sexual objects and the rein- 
forcement of rape myths (O’ Toole, 1997). Fortunately, it ap- 
pears that it may be possible to create intervention programs 
that decrease the acceptance of such norms (Flores & 
Hartlaub, 1998). However, it may also be possible to create 
conditions such as war that lead men to rape. 

Although we have been examining rape as a form of 
personal violence, rape may also be used impersonally as an 
instrument of war (Copelon, 1995). Enloe (2000) pointed out 
the different ways in which such militarized rape may be 
used to achieve political objectives and may become institu- 
tionalized. The brutality of such rape is particularly devastat- 
ing because victims are often subsequently rejected by their 
own communities (Turshen, 2000). 


Bullying and Malicious Gossip 


A defining aspect of bullying is that the behavior occurs re- 
peatedly so that there is a pattern of abuse and intimidation 
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(Boulton & Underwood, 1992). So defined, Bernstein and 
Watson (1997) reported that from 7% to 10 % of U.S. school 
children are victimized by about another 7% of the children 
who are active bullies. Similar percentages are found in Great 
Britain (Atlas & Pepler, 1998), with prevalence highest in the 
middle-school years. The average bully appears to have nor- 
mal self-esteem (see Bernstein & Watson, 1997), and the 
average victim has less self-perceived social competence 
compared with other members of their peer group (Egan & 
Perry, 1998). There are a number of detrimental aspects to 
bullying. Those who are bullied feel unsafe at school and ap- 
pear to be at risk for illness, failure, and depression (Wylie, 
2000). The bullies begin to think that they can get what they 
want by using power to dominate others, something that 
often fails to work in adult life (Oliver, Hoover, & Hazler, 
1994). In addition, the majority of children who witness the 
bullying typically feel fear and fail to intervene. Thus, they 
may be being trained to be ineffective bystanders. Fortu- 
nately, it is possible to train teachers, students, and parents in 
ways to intervene. Such training, involving the entire com- 
munity, has proven effective in Scandinavia, where programs 
have reduced bullying by 50% (Olweus, 1991, 1993). 


Community Violence 


Although it sometimes takes personal forms, some types of 
violence are more communal than interpersonal. Among such 
forms are riots, gang extortion and warfare, and police vio- 
lence. Although space considerations prevent an adequate 
discussion of these forms of violence, some contemporary 
work may be noted. 


Riots 


The failure of structural factors in predicting which cities 
would have the most commodity riots, together with difficul- 
ties in satisfactorily predicting which individuals would par- 
ticipate in them, has led McPhail (1994) to abandon both 
structural strain and relative deprivation as predictors and to 
advocate extending Snyder’s (1979) work of examining the 
factors that affect the interpersonal processes that assemble a 
riot. Such an approach inquires into communication patterns 
and the motives of individuals as they assemble to engage in 
collective goals. McPhail argued that it is important to distin- 
guish between collective goals that do not intend violence 
(although violence may result) and collective goals that do 
intend violence (as is the case with England’s football hooli- 
gans). It would also seem wise to consider the group emo- 
tions that can occur when group members share attitudes and 
have a common sentience (see K. K. Smith & Crandell, 
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1984). Such group emotion may play a role in the generation 
of crowd violence in cases where police suppositions of vio- 
lence may create a self-fulfilling prophecy of violence in 
sports fans (Stott & Reicher, 1998). 

A nuanced account of the group emotions involved in 
riots is provided by Kakar’s (2000) description of the pro- 
tracted communal clashes between Hindus and Muslims in 
Hyderabad, India. After a skillful delineation of economic, 
political, historical, demographic, social-psychological, and 
psychoanalytic accounts, he focused on the psychological 
shifts that occur at the outbreak of violence. He noted how the 
character of rumors begins to change from general threats to 
rumors that the body is threatened by previously benign sub- 
stances, how the boundaries of individuals with peaceful reli- 
gious identities and a basic sense of trust become replaced by 
a transcendent communal identity with a propensity for anxi- 
ety and violence, how individual behavior becomes governed 
by a different sort of morality, and how a history of coexis- 
tence is replaced by a history of violence. Of course, these 
shifts are perpetrated by demagogues, and much of the vio- 
lence is perpetrated by gangs of young people, but Kakar’s 
point is that the entire community is caught up in altered 
identities and that certain norms are still in existence that en- 
able people to return to their traditional religious identities 
and live in relative harmony after the violence subsides. 


Gangs and Gang Warfare 


In 1996 there were about 31,000 gangs with approximately 
846,000 members in the United States (Office of Juvenile Jus- 
tice and Delinquency Prevention, 1998). It seems likely that 
gangs develop whenever societies fragment and lower-class 
males lack access to legitimate sources of power and prestige. 
It might be interesting to study gangs as quasi states. 
Certainly, delinquent gangs involve symbols of identification 
for group membership, territorial claims, leadership power 
struggles, in-group protection, and out-group antagonism 
(Capozzoli & McVey, 2000). When the Soviet Union 
collapsed, hundreds of violent groups emerged who, as 
Volkov (2000, p. 709) noted, “intimidated, protected, gath- 
ered information, settled disputes, gave guaranties, enforced 
contracts and taxed.” He argued that these entrepreneurs 
of violence created organizations that were essentially 
“violence-managing agencies.” The more successful gradu- 
ally became legitimized by becoming involved in prosocial 
activities and absorbed in the process of state formation. 
Hill, Howell, Hawkins, and Battin-Pearson (1999) noted 
four risk factors that predict adolescent involvement in 
gangs. On the community level these include poverty, high 
rates of mobility, and dysfunctional norms. A crucial problem 


created by gang warfare (and by civil war in general) is the 
impact that violence has on children. Kostelny and Garbarino 
(2001) pointed out that in some Chicago neighborhoods, 38% 
of elementary school children have seen a dead body outside, 
and 21% have had someone threaten to shoot them. They 
have noted how repeated violence often leads to regression, a 
loss of trust, sense of no future, and increased aggressive be- 
havior. They propose a series of measures to counteract these 
affects, including home visiting and early education pro- 
grams, as well as specific violence prevention programs at 
both the elementary- and middle-school levels. 

One might think that children who have suffered the sort 
of violence that occurs when civil society has disintegrated 
would themselves become violent. However, this is not nec- 
essarily so. What sort of moral character might develop in 
South Africa, where, after the violence of apartheid, children 
were subjected to criminal, domestic, and vigilante violence, 
often with a lack of clarity about the reasons for the violence? 
Dawes (1994) reported that the majority of children do not 
appear to have become violent or even to seek retaliation. In 
fact, many evidence an increased empathy. He points out that 
moral behavior is learned in a sociocultural context. People 
construct their identities and reputations as members of 
groups and learn moral conduct in settings that assign re- 
sponsibility to the roles that people chose to play. Hence, 
children may learn that violence is called for in one situation 
but immoral in another. Although a culture may arise and lead 
some to assume violent roles, such violence is not automati- 
cally produced by being exposed to violence but is subject to 
the rhetoric and morality developed by a group in a situation. 

Merely increasing the number of police in an area high in 
gang crime does not seem to be effective, but a study by 
Fritsch, Caeti, and Taylor (1999) suggested that if the addi- 
tional police focus on curfew and truancy enforcement, it is 
possible to reduce gang-related crime. However, many inves- 
tigators argue that suppression is less effective than social in- 
terventions that offer centers of activity for at-risk youth 
before they become involved in gangs (Spergel & Grossman, 
1997), and there is some evidence that such centers are effec- 
tive alternatives (Thurman, Giacomazzi, Reisig, & Mueller, 
1996). 


The Use of Violence in Social Control 


Unfortunately, the process of maintaining social control often 
involves violence and far more violence than appears justifi- 
able. In fact, it may be argued that punishment and the use of 
any violence, as opposed to an aggressive use of force and 
physical restraint, fails to deter violence and cannot be justi- 
fied (Gilligan, 2000). When violence is used, it may occur in 


the form of police violence in the streets or violence within 
prisons in the form of torture or cruel and unusual punish- 
ment. In the United States the reported instances of death 
from police violence are relatively low, about 300 deaths per 
year. It seems evident that laws and strong civilian control 
must demand a professional force that minimizes the use of 
violence and that excellent training is crucial. Toch, Grant, 
and Galvin (1974) argued that the best way to achieve control 
over unacceptable police violence is to have peer review pan- 
els who review all arrest reports, tally deployed violence, and 
work with those officers who exceed a predefined number of 
incidents. Such officers are helped to understand their behav- 
ior and create alternative approaches to handling the situa- 
tions that provoked their violence. When their incidents 
decrease, they themselves are enlisted to become members of 
the panel. 

In the United States an increasing number of persons are 
being imprisoned: The prison population in maximum- 
security prisons more than doubled from 1987 to 1997. There 
are now over 1.6 million persons in federal and state prisons, 
and inmates outnumber guards 38 to 1 (U.S. Department of 
Justice, 1999). Note that this figure does not include persons 
in county and city jails. Much of this increase is due to non- 
violent drug offenders and targets African American and 
Black men. As Haney and Zimbardo (1998) pointed out, the 
increased use of imprisonment reflects a policy choice to im- 
prison individual lawbreakers rather than to correct the social 
conditions that contribute to crime. There are troubling indi- 
cations that the increased privatization of prisons is leading to 
abusive practices such as the increased use of stun belts and 
solitary confinement. Further, there is every indication that 
prisons are failing to rehabilitate a majority of those who are 
incarcerated. Thus, Beck and Shipley (1997) reported that an 
estimated 62.5% of the prisoners released from 11 state pris- 
ons were rearrested for a felony or serious misdemeanor 
within three years of their release and that 41.4% were re- 
turned to prison or jail. Many prisons appear to be dominated 
by gangs (Lerner, 1984), and violence and domination occur 
between inmates and are often used by guards. It is evident 
that most prisons are providing an environment that encour- 
ages learning violence rather than responsibility (Haney & 
Zimbardo, 1998). 

There are managerial approaches that reduce violence 
(Reisig, 1998), vocational programs that offer structure and 
reduce assault rates (McCorkle, Miethe, & Drass, 1995), and 
educational programs that reduce violence and recidivism 
(Matthews & Pitts, 1998). However, neither governmental 
officials nor the American public seems willing to spend 
money on what is often seen as coddling prisoners. There ap- 
pears to be a general attitude that favors punishment over re- 
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habilitation, a general failure to distinguish between what 
might help different types of prisoners, and a general lack of 
compassion for those who find themselves in prison. 

Torture is the most troublesome form of police violence, 
and it often leaves its victims crippled both physically and 
psychologically. This is particularly true when, as is often the 
case, the aim of the torture was not to obtain information but 
to intimidate and destroy a person so that he or she could no 
longer function as a leader of resistance to those in authority. 
While victims differ widely in their posttorture symptoms 
and some have demonstrated an incredible capacity to for- 
give and heal, many need both physical and psychological 
treatment. Elsass (1997) has described effective treatment 
methods, and the journal Torture is devoted to the prevention 
of torture and the rehabilitation of its victims. 

As defined in the 1984 United Nations Convention 
Against Torture, torture involves the intentional infliction of 
severe pain or suffering, by or with the agreement of a public 
official, in order to obtain information or a confession, or to 
punish, intimidate, or coerce. Perhaps the most evident indi- 
cation of the extent of the problem is that by the year 2000 
only 119 of the 188 member states had endorsed the rules 
against torture promulgated by the UN convention. 

Amnesty International (2000) enumerated a 12-point pro- 
gram to eliminate torture. These include calling for every 
nation to officially condemn and enact laws against torture, 
refuse evidence obtained under torture, make the location of 
all prisoners known, allow prisoners to communicate, have 
all allegations of torture investigated by an authority inde- 
pendent of the prison system, have authorities clearly state 
their opposition to the use of torture, punish torturers, and 
compensate victims. 


Societal Violence 


Although community violence often reflects what is happen- 
ing within a society, there are forms of violence that occur 
throughout the society in which communities are embedded. 
This include the violence in its media and the violence that 
occurs when a society engages in war, is subjected to civil war, 
ethnic violence, and genocide, or must deal with terrorism. 


Media Violence 


We saw how aggressive behavior can be learned by following 
the models provided on film and TV. Hence, it is troubling to 
note that Hepburn (1997) reported that 57% of the TV pro- 
grams monitored at four different locations in the United 
States contained some sort of violence, whereas only 4% pre- 
sented an antiviolence theme. American children are exposed 
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to vast amounts of violence on TV; Signorelli, Gerber, and 
Morgan (1995) estimated that the average 12-year-old has 
seen over a 100,000 acts of violence. There is little doubt that 
this sort of exposure contributes to violence (see Eron, 
Heusmann, Lefkowitz, & Walder, 1996). The violence is 
most apt to be learned when an attractive perpetrator with 
whom the viewer can identify engages in justified and re- 
warded violence that fails to depict the harm suffered by the 
victim of the violence (S. L. Smith & Donnerstein, 1998). 

Media violence appears to promote violence in a number 
of different ways (see Berry, Giles, & Williams, 1999). Be- 
sides modeling violent behavior and weakening inhibitions 
about violence, it numbs or desensitizes reactions to violence 
and decreases empathy for victims. Similar negative effects 
occur as a consequence of playing violent video games 
(Anderson & Bushman, 2001). Although the evidence for the 
danger of viewing violence is increasing, warnings against 
viewing such violence appear to be decreasing in the U.S. 
mass media (Bushman & Anderson, 2001). 


Interstate Warfare 


Richardson (1960a) began the statistical study of war when 
his concern for human life led him to define war in terms of 
human deaths rather than in terms of declarations or histori- 
cal significance. Setting 1,000 deaths as a lower limit and the 
log of deaths as a scale of magnitude (thus 1,000 deaths is a 
magnitude-3 war), he argued that counting was the best anti- 
septic for prejudice and proceeded to count the wars between 
1820 and 1945. He showed that many magnitude-6 wars 
(about a million deaths) were not remembered because they 
lacked political significance (e.g., the Taiping rebellion, the 
war in La Plata), and many of the 188 magnitude-3 wars were 
completely overlooked. He also established that the nation 
responsible for the most wars keeps changing in different 
periods so that focusing on containing any given aggressor 
cannot prevent war. 

Richardson proposed a sort of molecular model of war 
that imagined nations as bumping up against each other, with 
some of these conflicts resulting in war. Those with more bor- 
ders and energy have a greater chance of collisions. In accord 
with such a model, he shows that the number of wars that a 
nation fights correlates highly with its number of borders (he 
includes colonies in this count), and the number of wars 
breaking out in any given year follows a Poisson (chance) 
distribution. Factors we might think of as lessening the prob- 
ability of war, such as common language or religion, do not. 
What does lessen the probability of war between peoples is 
the number of years in which they live under a common gov- 


ernment. The probability of war decreases geometrically with 
each decade of common government. 

Which disputes result in war? Vasquez and Henehan 
(2001) showed that the probability of war is greater when 
there is a territorial dispute than when there is a policy dis- 
pute. Wallace (1979), who investigated 99 serious interna- 
tional disputes occurring between 1815 and 1965, reported 
that 26 resulted in war and that in 23 of these cases the war 
was preceded by an arms race. There were only five cases 
where an arms race did not lead to war, and we are probably 
fortunate that the arms race between the United States and the 
Soviet Union proved to be in this category. 

When arms races occur, there is instability in the balance 
of power, and the race accelerates exponentially in a way that 
Richardson (1960b) can describe with a simple pair of differ- 
ential equations. Basically, this elegant mathematical model 
reveals that races occur when a pair of nations are more afraid 
of each other than they are concerned with the cost to their 
own economy. A more complex model dealing with more 
than two nations and chaotic transitions is described by 
Behrens, Feichtinger, and Prskawetz (1997). Richardson’s 
model stresses deterministic factors, and Rapoport (1960) 
has pointed out how such an approach may be contrasted 
with an approach that involves strategic gaming over inter- 
ests or struggles involving different ideologies. Applying this 
latter approach, R. Smith, Sola, and Spagnolo (2000) demon- 
strated that in the conflict between Greece and Turkey, the 
amount each nation spends on arms does not depend on what 
the other is spending but is a function of bureaucratic and 
political inertia. Current spending by the United States also 
appears to evidence this pattern. 

Subsequent to Richardson’s work, the Stockholm Interna- 
tional Peace Research Institute has kept an ongoing account of 
wars that focuses on number of deaths. Their statistics reveal 
millions of largely overlooked deaths, with 10 magnitude- 
6 wars that have occurred since 1945 and about 30 wars going 
on in any given year. Over recent years the number of inter- 
state wars has decreased while the number of intrastate (civil) 
wars has increased. There have also been an increasingly large 
percentage of civilian deaths, which now account for about 
85% of the casualties. One ray of hope for decreasing inter- 
state conflict is offered by statistics that demonstrate that 
fewer militarized disputes occur when nations have important 
trade relations and when nations are democracies (Oneal, 
Oneal, Maoz, & Russett, 1996). Under such conditions war is 
not in the interest of those in power. However, these statistics 
do not consider support for covert interference, as in the U.S. 
involvement in overthrowing the democratically elected gov- 
ernments of Guatemala and Chile. Nor do they consider that 


the number of reasonably democratic nations is not high and 
that the conditions for democracy may be difficult to achieve 
(see de Rivera, in press). Nevertheless, we are reminded that 
interstate war is not inevitable. 


Civil War, Ethnic Violence, and Genocide 


Although civil wars sometimes simply reflect a struggle for 
power within a dominant group, they often involve ethnic 
group interests or ideological differences that become in- 
volved in a struggle for power. Their complexity is nicely 
captured in a series of case studies that deal with the wars in 
Central America, Ireland, Israel, Rwanda, and Sri Lanka from 
a psychological point of view. In Ireland (Cairns & Darby, 
1998), Sri Lanka (J. D. Rogers, Spencer, & Uyangoda, 1998), 
and many other nations, important ethnic groups have both 
inflicted and experienced the sort of prejudicial treatment 
so aptly demonstrated in Tajfel’s (1982) studies. Niens and 
Cairns (2001) applied social identity theory to the under- 
standing of ethnic conflict and have concluded that overcom- 
ing the stereotypes that are involved requires contact 
situations in which people’s group memberships are more 
rather than less emphasized. 

In considering these disputes it is important to note that 
there are often many people within each group who are will- 
ing to treat the other group fairly. However, extremists within 
each group oppose any efforts to take the interests of the 
other group into consideration. Rather than creating a com- 
mon intergroup political front, the moderates appear con- 
strained by their intragroup identity with their extremists so 
that rational compromises that would be in the interest of 
both groups are impossible to achieve. Some methods that 
may be helpful in resolving these conflicts will be discussed 
when techniques of negotiation are considered. 

Gurr (1996) identified 268 politically significant national 
and minority peoples (about 18% of the world’s population), 
three fourths of whom experienced political disadvantages. 
Almost 100 of these groups participated in violent conflict 
between 1945 and 1990. He argued that it is important to rec- 
ognize the grievances of minorities, the fact that cultural 
identities are important aspects of human being, and that it is 
not always possible to assimilate a minority culture. A critical 
problem is posed by the fact that conflicting parties often find 
it difficult to create a common historical narrative. An exam- 
ple is furnished by Rouhana and Bar-Tal’s (1998) balanced 
account of the Israeli-Palestinian conflict. 

Gottlieb (1993) argued that some conflicts may be man- 
aged by allowing people to have two identities, a formal na- 
tional identity and a state identity. The national identity could 
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control some language use in education, local law, and mar- 
riage rites. The state identity could control currency, border 
defense, and other factors necessary for a nation-state to sur- 
vive in a global economy. Psychologically, such an arrange- 
ment makes sense because, as Brewer (1999) notes, it is quite 
possible to have a positive in-group identity that is indepen- 
dent of negative attitudes toward out-group members. Such 
in-group identities may be less threatened when clear bound- 
aries between groups are recognized within the bounds of a 
common state. 

In some nation-states, such as in Turkey in 1915, Germany 
in the 1930s, Iraq in the 1980s, and Rwanda in the 1990s, po- 
litical decisions lead to genocide. Most students of genocide 
argue that genocides are not the inevitable results of ethnic 
differences. They point to the fact that people often have 
lived together peacefully for years, often with a considerable 
amount of intermarriage. The genocide occurs when leaders 
emphasize group identity, often in order to consolidate power 
or mobilize support in a power struggle. Yet the genocide is 
only possible when rapidly arousing fear and hatred. In the 
case of Rwanda, D. N. Smith (1998) argued that official hate 
propaganda combined with projective sexual envy, a belief 
in sorcery, authoritarianism, and a breakdown in traditional 
restraints and opportunities. 

Staub (1989) examined a number of genocides in an 
attempt to conceptualize the common processes involved. He 
finds that they occur under circumstances of material depri- 
vation and social disorganization that frustrate basic human 
needs. In such circumstances, individuals feel helpless and 
increasingly rely on their group membership. The seeds of 
genocide are sown if the group develops a destructive ideol- 
ogy in which an enemy group is perceived to stand in the way 
of the fulfillment of a hopeful vision. The conditions for the 
genocide evolve as violence begins to occur and is justified 
by an increasing devaluation of the enemy group, a devalua- 
tion that may easily be mobilized for political purposes. 
Although Staub emphasized that genocide is the outcome of 
normal group processes, he noted that there appear to be cul- 
tural preconditions. These include prejudices that become 
part of a cultural background, an ideology of antagonism, and 
the lack of a pluralistic culture. In many cases there also 
appears to be a particularly strong respect for authority that 
makes it difficult to resist immoral orders and may contribute 
to the threat and anxiety experienced when authority is 
unable to fulfill basic needs. 

Once civil war has occurred, processes of reconciliation 
must restore the fabric of the society. The difficulties are im- 
mense, requiring a balance between needs for justice, the sav- 
ing of face, and the support of sources of power that may be 
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implicated by revelations of human rights abuses. Above all, 
as Lederach (1997) observed, the relationship between 
groups must change so that out-group members are no longer 
excluded from one’s moral framework, and this must occur at 
the grassroots level as well as at the level of top leadership. 
Lederach stressed the need for years of work in rebuilding 
trust in teams from different strata of the society. He showed 
that such reconciliation requires the assistance of third parties 
who can accompany disputants with an attitude of humility as 
they, both individuals and communities, wrestle with the 
problems of combining truth, justice, and mercy. 
Governmental initiative is often required, and certainly 
the most successful effort to date has been the South African 
government’s establishment of its Truth and Reconciliation 
Commission. Rather than attempting to punish those respon- 
sible for the torture and murders that occurred during the 
maintenance of apartheid, the commission was charged with 
establishing what happened—making known the fate of vic- 
tims and providing them with the opportunity to relate their 
accounts and achieve some measure of reparation, facilitat- 
ing the amnesty of offenders who made full disclosure, and 
recommending measures to prevent future violations (de la 
Rey, 2001). Although justice was not achieved in the sense of 
adequate reparations, of punishment of the guilty, or even of 
an adequate admission of guilt or request for forgiveness, the 
public hearings held by the commission provided a forum 
that allowed a public acknowledgment of what had happened 
and the establishment of a common moral framework. In 
contrast to the situation in Argentina, where there is still no 
public recognition for the abuses under the military dictator- 
ship or condemnation of those involved in torture and disap- 
pearances, the South African public can speak of what 
occurred and move on with a new public identity and history. 


Terrorism 


Terrorism may be distinguished from the guerilla tactics of 
rebels who are fighting a military opponent within their own 
territory. Terrorism involves random attacks on civilians as a 
means of gaining political ends and has been used by both 
states and revolutionaries (J. R. White, 1998). In the former 
case, a government that is engaged in a war attempts to 
destroy its opponent’s will to fight, or a despotic government 
maintains its power by creating an emotional climate of ter- 
ror that prevents the organization needed for political opposi- 
tion (de Rivera, 1992). In the latter case, groups without 
access to political power use terror to publicize their griev- 
ances, extort concessions, or overthrow a regime that is expe- 
rienced as repressive. All cases involve the training for 
aggression and moral desensitization described in the section 


on aggression. However, terrorism is situated in historical 
circumstances that have interesting and largely unexplored 
psychological aspects. 

An example is provided by the September 11, 2001, attack 
on the United States. Most of the terrorists were from Saudi 
Arabia. Although the government is repressive in that there 
are no ways to express discontent, the United States supports 
the regime in exchange for access to oil. The alternative 
to the king Monarchy would probably be an Islamic state 
rather than a secular democracy. Such a state is fundamen- 
tally religious and is conceived hierarchically rather than de- 
mocratically. Vatikiotis (1986) noted that it is not based on 
the skepticism, experimentation, and tolerance essential to 
pluralistic politics. It is based on a different psychology, and 
its stability would require the cultivation of a different set of 
emotional relationships and customs (de Rivera, in press). 
Hence, we are dealing with the problems of psychological 
identity and the ambiguous role of religion that will be con- 
sidered when we discuss the nature of evil. An examination 
of past attempts to deal with revolutionary terrorism suggests 
that the more successful have involved meeting the underly- 
ing needs that fuel the terrorism, as well as the suppression of 
terrorist elements. 


Structural Violence 


The concept of structural violence has been articulated by 
Galtung (1969, 1975/1980, 1996) to capture how economic 
and political structures may place constraints on the human 
potential. It sees violence as present when humans are dimin- 
ished and points to the fact that this occurs when social struc- 
tures prevent the meeting of human needs. Galtung pointed 
out that modern society is organized hierarchically and that 
those on top often use their position in ways that exploit those 
below, preventing them from having the resources they need. 
The top dogs are in control of resource distribution, and their 
decisions determine who has access to education, health care, 
and good jobs. Further, he argued that the top dogs maintain 
their power by a series of devices that work against the 
underdogs organizing a resistance. 

One measure of structural violence is furnished by the 
human poverty index (HPI; United Nations Development Pro- 
gramme, 1999). This index uses five variables that reflect the 
loss of potentials that could be resolved by public policies. 
These are the percentage of the population dying before age 40, 
the percentage of underweight children (under age 5), the per- 
centages of the population without access to potable water and 
without health care, and the percentage of illiterate adults. 

It is important to realize that the hierarchies of power and 
privilege that exist within each society are connected to those 


in other societies in ways that support one another. The top 
dogs in a poor nation are often quite wealthy and well con- 
nected to the top dogs in other nations, so they are positioned 
to use aid in ways that maintain their power. Although 
Galtung does not deny that domestic problems may generate 
international conflict, he stresses that many domestic prob- 
lems are exacerbated by the policies of exploitation of the 
elites in powerful countries. The entire system—the hierar- 
chies and the connections between them—completely masks 
the responsibility for the terrible violence that it occasions. 
For specific examples from Sub-Saharan Africa see Nathan 
(2000) and Tandon (2000). 

While we have discussed both direct violence and the 
structural violence that can be attributed to greed and the fear 
of losing power, a considerable amount of violence on both 
the personal and state level is motivated by what can only be 
considered as “good” motives. As Butigan (1999, p. 13) 
pointed out, “Violence is often motivated by fear, unre- 
strained anger, or greed to increase domination or power over 
others. It can also be motivated by a desire for justice in the 
face of injustice: a longing to put things right, to overcome an 
imbalance of power, or end victimization or oppression.” 
This fact requires us to look at the nature of evil. 


EVIL 


While aggression and violence are largely matters of fact, 
evil involves a moral judgment. We must consider how it is 
conceptualized and how religions attempt to contain it. 


Conceptions of Evil and Its Experience 


What is meant by evil? Berkowitz (1999) argued that it 
should be distinguished from mundane badness and that there 
is a commonly shared prototype for evil. This prototype re- 
flects action that not only is morally wrong but also reveals 
an excessive departure from social norms. The judgment of 
evil has to do with the helplessness of victims, the responsi- 
bility of the perpetrator, and the imbalance between the great 
wrong that is done for a relatively small gain. 

Staub (1999) argued that a conscious intention to destroy 
is not a necessary aspect of evil. Rather, the word evil is ap- 
propriately used to categorize the repetition of intensely 
harmful actions that are not commensurate with instigating 
conditions. He recognizes that the term communicates horror, 
and although he is opposed to romanticizing evil as mythic 
and incomprehensible, he believes that the concept of evil 
may be a useful way to mobilize prosocial group norms. As 
an example of evil and the need for the concept, he discusses 
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the evil involved in bystanders who allow genocides to occur. 
Stohl (1987) showed that nations are typically bystanders, 
and this is reflected in the minimal news devoted to accounts 
of genocidal actions. However, the judgment that genocide is 
evil is reflected by the development of an international norm 
against genocide that may eventually be enforced by the 
establishment of a permanent international court. 

Without the concept of evil it might be easy for people to 
avoid making judgments that need to be made. In this regard, 
Miller, Gordon, and Buddie (1999) have evinced concern that 
situational explanations of criminal actions may result in per- 
sons’ condoning such actions. They demonstrate that when 
persons make judgments after situational explanations, they 
have less unfavorable attitudes and punitive responses to- 
ward the perpetrator. However, they do not show that the 
criminal action is condoned, and many would argue that the 
action rather than the person should be considered evil. To 
consider a person or group evil may evade an examination of 
the situational conditions. 

Although psychologists have considered the concept of 
evil, they have not yet addressed evil as an experience. The 
concept of evil implies an objective judgment; the evil is 
experienced as “real.” Of course, one may argue that evil 
and all values are really subjective and relative—simply 
what a person likes or wants. Value in this view is reduced 
to what someone is willing to pay. Yet we continue to expe- 
rience value as existing apart from ourselves and as differ- 
ent from mere taste. As F. Heider (1958) asserted, value 
differs from what we want. It is characterized by what an 
objective order wants. We experience goodness and evil as 
objective in nature, as existing apart from our judgment of 
them, although we may recognize that our judgment may be 
faulty and may change with time. In the latter regard, Rozin, 
Markwith, and Stoess (1997) pointed out that smoking has 
recently become moralized. That is, smoking is now re- 
garded by many as bad in a moral sense, an object of dis- 
gust. Note that the process of moralization involves 
emotional responses that help constitute the very value that 
is perceived as objective. 

Evil exists in relation to what is Good, and the latter is 
what is necessary for life, for fertility, health, and success in 
getting food and outwitting enemies. In any society that is 
not completely secularized, Goodness exists because hu- 
mans exist and could not exist without it. Evil is more prob- 
lematic. Although some persons and religions regard Evil as 
essential and in primary opposition to the Good, others view 
it as secondary and existing because of the actions of hu- 
mans; still others view it as illusory, as existing only as an 
object of our perception. Likewise, the relationship between 
Good and Evil may be seen in different ways. Evil may be 
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viewed as a malevolent force or as ignorance, as repellant to 
Goodness or as the simple absence of Goodness. Thus, it 
may be symbolized as an active Devil or as darkness, as de- 
structive choice or as the obstacles between humans and 
Goodness. Ricoeur (1967) pointed out that humans have 
symbolized evil in three quite different ways that reflect dif- 
ferent experiences and conceptualizations. As stain, evil is 
contagious, and one may unintentionally become contami- 
nated by its impurity. As sin, evil is a ruptured relationship 
with God, a departure from a path or missing of the mark 
that may be affected by the actions of one’s people. As guilt, 
evil is a personal responsibility that occurs because of one’s 
intentions. In all cases, one is removed from Goodness and 
must be relived of the stain, sin, or guilt in order to reconnect 
with Goodness. 

K. G. Heider (1991) argued that in some cultures the basic 
moral conflict of life is more between order and disorder than 
between good and evil. Thus, in Indonesian and Japanese 
films the dominant concern appears to be the restoration of 
order rather than the triumph of the good over the bad. This 
may be related to a cultural tendency to see persons as more 
socially embedded than individually autonomous. The “vil- 
lain” is not inherently bad but is more an agent of disorder 
who is easily welcomed back into the fold once order is re- 
stored. However, the restoration of order may involve the 
recognition of evil and its removal. Thus, Wessells and 
Monteiro (2001) described how child soldiers who have 
engaged in unjustified killings may participate in purification 
ceremonies to be reintegrated into the community. 

Every society, and certainly our own, appears to have 
myths about evil, perhaps because we humans seem to need 
to give meaning to our suffering. In Western society, Ricoeur 
(1967) distinguished four such myths that continue to influ- 
ence our thinking: the Greek tragic and Platonic myths and 
the Babylonian and Judaic creation myths. Each views the 
source of evil quite differently. In the Babylonian myth, the 
world is created in the process of a power struggle between 
the gods; violence is used to create the order that prevents the 
agony of chaos; and humans must serve the state in order to 
prevent chaos. By contrast, in the Judaic myth a God peace- 
fully creates an essentially good world in which evil enters 
when people do what they are not supposed to do. Both of 
these myths are operative in our contemporary society. On the 
one hand, Wink (1992) has pointed out that much of the vio- 
lence portrayed on TV exemplifies the Babylonian myth. That 
is, there is a power struggle between bad chaotic forces and 
good order, and the good guys use violence to restore order. 
Likewise, the strategic policy of the (putatively Christian) 
United States is actually based on the use of violence to main- 
tain peace. On the other hand, the nation as a whole still 


subscribes to Judeo-Christian ideals of justice and believes in 
the freedom to choose between good and evil. 


The Ambiguous Role of Religion 


Taking Otto’s (1923) idea of the Holy as a starting point, 
Appleby (2000) argued that the sacred can either be the locus 
of violence as a sacred duty or a militant nonviolence dedi- 
cated to peace. Defining religion as a response to a reality that 
is perceived as sacred, he showed that it gives the authority to 
kill or to heal and argued that religious leadership determines 
which course is taken, appealing to religious identity either to 
exploit or to transcend ethnic animosities. On the one hand, 
Appleby showed how religion was an important element in 
the destruction of Bosnia and the development of Islamic ter- 
rorism. He distinguished fundamentalism as a response to sec- 
ularization (describing the terrorist violence that developed in 
2 of 10 such movements) from the ethnonationalistic use of 
religion (and often violence) to unify a state and discussed 
how both differ from cult violence. On the other hand, he gave 
concrete examples of dozens of Gandhi-like figures who have 
worked for peace and discussed the role that religious organi- 
zations have played in peace meditations. He convincingly 
demonstrated that religion is always a construction of a sacred 
past and has the potential to inculcate nonviolence as the reli- 
gious norm. He argued that religious education should be de- 
voted to this end and supported with the technical skills and 
material resources it needs to organize peace. 

The choice between good and evil is central to Fromm’s 
(1955, 1973) analysis of evil. He pointed out that human be- 
ings, as distinct from all other animals, are aware of them- 
selves as apart from nature and aware of their ultimate death. 
This existential dilemma creates common needs that must be 
met. These include needs for an object of devotion and for 
affective ties, unity, effectiveness and stimulation. Each 
can be met in either life-enhancing or life-destroying ways. 
An object of devotion can be an ideal or an idol; affective ties 
can be of love or sadomasochism; unity can be achieved by 
practicing an open religion or by losing the self in a trance 
state or a social role, effectiveness by creating or destroying, 
stimulation by active or passive excitation. Fromm sees these 
choices as determining whether a society and individual will 
become good or evil. 

While Fromm emphasized the role of choice in determining 
how to meet basic needs, both Staub (1999) and Burton (1990) 
saw evil as stemming from the frustration of basic needs such 
as security, identity, connective ties to others, effectiveness, 
control, and autonomy. They believe that if persons cannot ful- 
fill these constructively, they will engage in destructive behav- 
ior. Such an analysis lies in our understanding of some of the 


conditions that promote destructive behavior and in encourag- 
ing those with power to consider the needs of others. However, 
the emphasis on need fulfillment appears to neglect the role of 
personal responsibility and the fact that a large amount of vio- 
lence stems from greed. It reflects a liberal view of basic 
human goodness (if only needs were met by the state) as op- 
posed to a conservative view that sees everyone as basically 
selfish (and needing the state to enforce law and order). 

The previous analyses begin with the needs of individuals. 
By contrast, Macmurray (1961) argued that individuals exist 
only in relationships with others. He sees these relations as 
composed of two strands: a love (caring) for the other and a 
fear (concern) for the self. Although both strands are always 
present, one always dominates. When a caring for the other 
dominates, the person is unified. However, any real or per- 
ceived hurt, betrayal, or abandonment causes the fear for the 
self to dominate, and when this occurs a person suffers dual- 
istic splits (mind from body, reason from emotion, the practi- 
cal from the ideal, the self from the other). At this point a 
person (or society) may focus either on individualism (“if the 
other doesn’t care for me, I’d better care for myself”) or on a 
conforming collectivism (“if Um good, then they will care 
for me”). Because people assume that others are similar, the 
former leads to a Hobbesian analysis (the need for a strong 
state to enforce contracts between basically selfish people), 
whereas the latter leads to Rousseau, Marx, and the idea that 
people are basically good and will agree about basic needs. 
However, Macmurray asserted that people must continually 
wrestle with the choice as to whether concern for the other or 
concern for the self will dominate action. In his view, self- 
development occurs only when acceptance, understanding, 
forbearance, and forgiveness lead to the restoration of the 
dominance of caring for the other. Then, a person’s unity is 
restored and, with it, the ability for genuine freedom and co- 
operation (see de Rivera, 1989). 

How may we relate aggression to values of good and evil? 
In an attempt to distinguish between a “good” aggressive au- 
dacity, necessary for human progress, and an aggression that 
intends to destroy, Kelly (1965) proposed that the latter in- 
volves hostility. Hostility occurs when there is a threat to a 
person’s belief system and the person extorts evidence in an 
attempt to maintain beliefs and the way that one is living 
one’s life in the face of contrary evidence. Examples include 
fundamentalist terrorists or the middle-class Germans who 
became Nazis. Kelly might assert that the latter were not 
simply frustrated by inflation. Rather, they saw their belief 
system—their commitment to the value of hard work and 
thrift—crumple as the savings from their hard work were 
wiped away by inflation (see Moore, 1978). For Kelly, the 
alternative to hostility is to allow the experience of tragedy. 
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It is this experience, rather than the certainty that one’s be- 
liefs are valid, that is the basis of hope. 

Kelly’s analysis is supported by aspects of Peck’s (1983) 
examination of the “group evil” involved in the MyLai mas- 
sacre. On one level, Peck pointed out that it is easier for 
groups to commit atrocities because of the diffusion of re- 
sponsibility and the normal narcissistic influences of group 
pride and out-group denigration. However, on a societal 
level, Peck argued that the group that killed innocent vil- 
lagers manifested a broader societal problem. The group 
contained men who had been rejected from the broader soci- 
ety to do the dirty work that others did not want to see. The 
war itself was an attempt to defend a narcissistic image of 
American perfection, and when the situation in Vietnam pre- 
sented evidence of the fallibility of the American worldview, 
the government was willing to destroy Vietnam rather than 
acknowledge this error. It may be noted that the research that 
was recommended to prevent future atrocities was rejected 
on the grounds that it might prove embarrassing. 

The unwillingness to admit the tragic is an aspect of 
refusing to acknowledge evil, and Macmurray (1944) argued 
that a major problem is posed by the fact that one may do what 
one ought to do and yet still be involved in evil. A “just war” 
may be necessary; but when thousands of innocents are killed, 
the war is still evil, and the morally correct action of partici- 
pating in the war does not absolve a person from having par- 
ticipated in that evil. Note that a person holding such a point of 
view is protected from the sort of dissonance reduction that is 
involved when a person hurts another and then justifies the ag- 
gression by devaluating the other. This suggests that public 
ceremonies of atonement might protect a society from becom- 
ing involved in any more evil than is necessary. Perhaps if 
Americans had the opportunity to mourn the deaths of all the 
Koreans and Chinese killed in the Korean War, there would 
have been less readiness to become involved in Vietnam or 
continuing sanctions against Iraq. In any case it seems desir- 
able to confront the evil that is within as well as without. Such 
a confrontation leads us to examine the possibility of peace. 


PEACE 


By peace we do not mean the “negative” peace that is the 
absence of war but a “positive” peace (Barash, 1991) that is 
the opposite of evil—not the absence of conflict but the reso- 
lution of conflict in creative rather than destructive ways. We 
may imagine different aspects of this peace: the personal 
peace of inner harmony and compassion, the communal 
peace that exists when social norms and institutions promote 
a concern for the welfare of others and a peaceful resolution 
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of conflict, and the peace that results from an environment 
that allows people to satisfy their basic needs. There are at 
least four different paths to peace: paths of strength, negotia- 
tion, justice, and personal transformation. Each may be 
viewed as involving types of aggression. 


Peace Through Strength 


It is said that the sword is the olive leaf’s brother, and it 
seems self-evident that weakness invites attack while 
strength discourages it. Bullies pick on the insecure; crimi- 
nals flourish in the absence of police; and history is filled 
with one people’s expanding at the expense of another. Few 
would argue against the idea that some sort of strength is nec- 
essary for peace, and some, like Sumner (1911), would argue 
that peace is attained only by the imposition of order that oc- 
curs when states use their strength to expand their dominion. 
However, there are some problems with conventional inter- 
pretations of this path or with relying on it to produce posi- 
tive or even negative peace. Empirically, Singer and Small’s 
(1979) statistics, examining 59 recent wars, fail to show a 
significant relationship between strength and the probability 
of being attacked. Consider three problems: 

First, it is not clear how much strength is sufficient to pro- 
vide a sense of security. Surveys repeatedly show that a ma- 
jority of the American public feels secure against foreign 
attack and favor nuclear disarmament (Kay, 1998), and in 
1997 U.S. military expenditures were 172% of all its possible 
enemies combined (Council for a Livable World Education 
Fund, 1998). However, the government continues to spend far 
more than appears necessary (Defense Monitor, 2000). In part, 
the excess funding is due to economic pressure from the mili- 
tary-industrial complex (Fogarty, 2000), the need to maintain 
a weapons industry, and the desire to export weapons to main- 
tain a favorable balance of trade. However, to a large extent the 
extraordinary funding seems driven by an underlying insecu- 
rity that was not present before the beginning of the Cold War. 

Second, if we assume that the weak will be attacked, the 
obvious converse is that the strong will expand and attack. 
Hence, those who build strength will become involved in 
using power to impose their will. This appears to be true of 
the United States. 

Third, when two powers come into conflict with each 
other, they each build strength so that the other will not dom- 
inate them, and the resulting conflict is simply more deadly. 
History gives us Athens versus Sparta, Rome versus 
Carthage, the United States versus the Soviet Union, and 
dozens of other examples, and we saw earlier that structural 
changes in conflict spirals often have disastrous outcomes. In 
the future we may witness a race to dominate space weaponry. 


These problems have given rise to two quite different so- 
lutions: the development of nonviolent defense systems and 
the strengthening of the United Nations so that it could begin 
to function as a world government with an international 
police force. 


Nonviolent Defense 


Nonviolent defense may not be as impracticable as one might 
imagine. There are effective nonviolent self-defense forms 
such as aikido and tai chi, in which the defense maintains a 
calm center of gravity to take advantage of the momentum of 
an attack and the fact that the attacker is likely to be un- 
balanced. The defender gains control of the attack and turns it 
aside (Ueshiba, 1921). There are forms of community polic- 
ing in which the community prevents violence by maintain- 
ing civilized norms (J. Q. Wilson & Kelling, 1989), and 
Canada (1995) has called for the use of unarmed peace offi- 
cers trained and organized by local colleges. Finally, there 
are many examples of the successful use of nonviolent resis- 
tance against dictatorial governments. Sharp (1973) pub- 
lished the results of a historical survey that carefully 
examines the methods and dynamics of nonviolent action to 
influence political decisions. He gives specific examples of 
198 techniques that have been used, ranging from public 
assemblies and marches, through boycotts and strikes, to 
noncooperation, civil disobedience, and the establishment of 
alternative structures of government, including successful 
uses against the Russian and British empires, the Nazis, Latin 
American dictators, and the Soviet Union. Sharp’s (1990) 
pragmatic examination of when nonviolent defense has 
worked and what factors make such resistance possible 
distinguishes between situations conducive and noncon- 
ducive to nonviolent resistance. Relative power is not as im- 
portant as one might imagine. The contest is really one of 
wills, and a central factor is the cohesiveness of the nonvio- 
lent group and the ability to maintain communications so that 
tactics can be adapted to the changing situation. 

Although civilian defense may be an alternative to mili- 
tary might, it may be argued that any defense that is orga- 
nized by the state will be used to maintain structural violence. 
Citizens give the state a monopoly of violence so that it may 
maintain order and curb crime. And it may be argued that a 
democratically run state succeeds in having adequate police 
control and adequate control over its police. However, from 
an anarchist standpoint, states—at least nation-states based 
on centralized power—commit far more violence than their 
citizens do. Hence, Martin (1984) argued that working with 
state systems will never abolish war because states them- 
selves are the problem. His anarchist solution is to use 


grassroots strategies to build alternative institutions to the 
state and its existing bureaucracies. 


Developing the United Nations 


To some extent we already have the rudiments of a democra- 
tic world police force. The United Nations forces have been 
engaged in over 50 missions. These have included the moni- 
toring of elections, the provision of the international police 
presence needed after civil turmoil, the maintenance of buffer 
zones between former combatants, and armed interventions 
needed to prevent extensive civilian casualties. 

Clearly, the last case, armed interventions, is the most 
problematic form of intervention. Studies of the military in- 
terventions in northern Iraq, Somalia, Bosnia, Rwanda, and 
Haiti (Weiss, 1999) have attempted to assess the degree of the 
civilian costs incurred before intervention, the cost of mili- 
tary intervention, and the civilian benefits of the intervention. 
Weiss discussed the quandaries faced by those hoping to use 
military forces to achieve humanitarian assistance and rec- 
ommended careful “conflict impact assessment” before at- 
tempting to use military force in situations where a presence 
is not desired by both sides of the conflict. 

Although many problems are posed by military interven- 
tion once armed conflict has erupted, it may be argued that to 
have the possibility of military intervention may be helpful in 
influencing decisions in the early stages of a conflict that 
threatens to degenerate into military struggle. This is the 
position taken by Jentleson (2000) in his analysis of the possi- 
bilities of preventative diplomacy. He argued that the parties to 
a conflict are often driven to military action by the uncertainty 
of a situation in which the other side may strike first. In such 
situations, diplomacy—with the possibility of intervention 
and rewards—may be used to influence the calculus of 
whether to attack or negotiate. The participants in the volume 
edited by Jentleson present 10 cases where preventative diplo- 
macy either succeeded in averting potential disaster (as in the 
Baltics and North Korea) or missed opportunities (as in 
Chechnya and Yugoslavia). They discuss the use and misuse 
of intelligence; the strategy of using mixes of deterrents, in- 
ducements, and reassurances; and the necessity for fast action. 

Unfortunately, fast action is currently limited by the fact 
that there is no permanent UN military force so that each UN 
action requires the new recruitment of troops, equipment, and 
money from whatever nations are willing to donate (Holt, 
1995). It would be easy to create a small standing force, but 
the major powers are reluctant to set a precedent and begin an 
international force that could conceivably challenge their 
military preeminence. Given the fact that the United States 
has a veto power in the Security Council (which must concur 
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in the use of any UN forces), an interesting psychological 
problem is posed by why conservative representatives feel 
the need to maintain tight national control by blocking any 
permanent UN forces. This need for the maintenance of con- 
trol is manifested also in the reluctance to endorse a nuclear 
test ban treaty or an international criminal court for war 
crimes. 


Peace Through Negotiation 


Instead of regarding the other as an enemy, it is often possible 
to search for mutual gains, and trade has often been an alter- 
native to war. While horse trading has been known for mil- 
lennia, there have been a number of advances in the tactics 
and strategies of negotiation. One promising approach that 
has been advanced by Fisher and Ury (1981) is “principled” 
negotiation. Rather than either strongly maintaining a bar- 
gaining position or softly compromising in order to maintain 
a valued relationship, they argue that one should search for 
the interests that underlie the bargaining positions. The nego- 
tiator than attempts to create a solution that meets the inter- 
ests of both parties and searches for objective criteria to 
determine what is fair. Note that this approach uses aggres- 
sion in the sense of attempting to get what one wants and 
insisting on fairness. 

Although principled negotiation is a practical approach 
that can often be used, it assumes that the conflict to be nego- 
tiated is essentially a conflict about interests. However, some 
conflicts involve past wounds, different values, and the very 
identities of the parties to the conflict. This is often true when 
ethnic conflicts are involved. The Israelis and Palestinians, 
for example, do not simply have conflicting interests con- 
cerning security and sovereignty, but issues about the identity 
of the Jewish and Palestinian peoples as well. To deal with 
these sorts of conflicts, Burton (1990) advanced a form of 
“transformative” negotiation in which the negotiating 
process deals with the sharing of underlying needs and iden- 
tities as well as interests. Such negotiations require a deeper 
level of trust and, when successful, involve a transformation 
of identities so that definitions that reflect enmity or involve 
devaluations of the other are no longer aspects of identity. A 
discussion of such needs may be helpful in intractable con- 
flicts. For example, Cross and Rosenthal (1999) randomly 
paired 20 Jewish and 20 Arab students to discuss the dispute 
over the control of Jerusalem, and they contrasted different 
methods of negotiation. Participants who used a method in 
which they identified needs and fears about identity, recogni- 
tion, and security before they attempted to generate ideas for 
mutual satisfaction became less pessimistic about the conflict 
and showed a more positive attitude change toward the other. 
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In order to deal with the pain and shame of past injuries and 
the conflicts between identity needs, C. R. Rogers and Ryback 
(1984) and Kelman (1996) used problem-solving workshops 
in which people may share underlying pains, fears, and needs. 
In the approach used by Rogers and Ryback, a facilitator 
models the role of respectfully listening and accepting the 
initial hostility expressed by both parties. Without the accept- 
ing presence of the mediator, the hostility would be responded 
to defensively, but the authors write that after the mediator 
accepts the hostility, the underlying pain is expressed, and this 
is responded to sympathetically. This approach should be 
contrasted with the reframing of hostile statements in couples 
therapy, in order to avoid shame-hostility cycles, which is ad- 
vocated by Scheff (1999). 

Kelman’s workshops are carefully structured in ways that 
lead both parties to share their underlying needs. Since polit- 
ical leaders are usually hindered by the demands of their 
position and constituency, he attempted to work with opinion 
leaders and those who may come into political power in the 
future. These workshops are quite effective in getting partic- 
ipants to understand and empathize with opposing views. 
However, the participants are then confronted with the prob- 
lem of explaining their new tolerance to their compatriots in 
ways that avoid an accusation of being traitors. From my out- 
sider perspective it appears that these sorts of workshops 
need to occur between the liberal and conservative parties 
within the opposing sides of a conflict. It would be fascinat- 
ing to see the extent to which transformative negotiation 
could be used to arrive at creative solutions to the sort of clas- 
sic problems that have divided political parties. 

Differences in values are usually expressed in the rhetoric 
of political parties and typically are debated by having oppo- 
nents state their conflicting views and then rebut the views of 
their opponent as both attempt to create a rhetoric that will in- 
fluence third parties and capture their support. However, 
Rapoport (1960) has advocated another strategy, which he 
believes is more apt to produce creative solutions and mini- 
mize devaluation of the opponent. He suggested that each 
opponent should state the other’s point of view until the other 
agrees that it has been correctly presented. Then, rather than 
rebutting the other’s view, the opponent should create ways 
to agree with the other’s view, not by role playing the other 
side but by honestly finding points of agreement. Rapoport 
pointed out that any statement has a region of validity. Thus, 
if the other says, “11 plus 2 is 1,” a person may respond by 
agreeing to the extent that one is referring to clock time. Pre- 
liminary studies (de Rivera, 1968) have shown that the tech- 
nique is usable, and it would be interesting to see if it could 
be used to create acceptable public policies for divisive issues 
such as legalized abortion. 


When a negotiation between parties can be arranged, it is 
more successful than third-party mediation. Jackson (2000) 
studied 295 conflicts that occurred between 1945 and 1995 
and found 1,154 negotiation efforts, with 47% success (82% 
lasting more than 8 weeks), and 1,666 mediations, with 
39.4% success (51.7% lasting). Of course, mediation is prob- 
ably more often attempted when the level of hostility is high 
and interferes with negotiation, but it seems clear that third 
parties should first encourage direct negotiation. When hos- 
tility is high, there are innovative approaches to conflict man- 
agement that stress the use of third parties as go-betweens. 
Galtung and Tschudi (2001) argued that when emotions hin- 
der the ability of conflicting parties to dialogue with one an- 
other, it is often possible to create better dialogue with neutral 
conflict workers who can then work separately with the con- 
flicting parties to create a solution that transcends deep dif- 
ferences. Patai (1973) pointed out that in Arabic cultures a 
mutually respected third party may be used to request solu- 
tions that conflicting parties can grant out of generosity and 
respect, without appearing to give in to the other party with 
whom they are in conflict, and Pedersen (2001) reminded 
conflict workers that collectivistic cultures may manage con- 
flicts in ways that are substantially different from those 
favored in the West. 

Negotiation is increasingly being used to settle civil dis- 
putes, and in the future it may be used increasingly in crimi- 
nal cases. Zehr (1990) convincingly argued that many crimes 
rupture human relationships and that it is these relationships 
that need to be repaired. Currently, crime is viewed as con- 
trary to the state, and the state punishes an offender (who is 
made into a “criminal,” who often attempts to avoid respon- 
sibility by offering a defense) and largely ignores the victim. 
Zehr suggested that, as an alternative, the state ask the victim 
if he or she would like to meet the offender and see what the 
offender could do to restore the human relationship between 
them. Studies of trial programs of such “restorative justice” 
have found that about 50% of victims want to meet the per- 
son who wronged them, that it is usually possible to negotiate 
a way to restore the human relationship, and that in such 
cases there is much less recidivism. Justice has been attained 
nonviolently, by restoration rather than retribution. 


Peace Through Justice 


There are times when repressive forces are so strong that a 
completely unjust “peace” may exist for years. However, sit- 
uations inevitably change, and when opportunities arise, re- 
bellion and revolution occur, often killing some who opposed 
the injustice that existed. Yet the path of peace through justice 
is much more than an attempt to stave off violent revolution. 


It is an attempt to achieve a positive peace, a fundamentally 
just world. 

The conflict between those desiring justice and those re- 
sisting it could be achieved by negotiation. However, this can 
only occur if parties are willing to negotiate, and there are 
many situations in which people with property and power do 
not wish to negotiate, particularly when oppression and ex- 
ploitation are involved. Hence, the building of structural 
peace often requires the creation of social strain and disequi- 
libria (Montiel, 2001). Attempts at organization are often met 
with violence, and this may result in counter violence. Even 
when the oppressed succeed in using counter violence, it 
often happens that the success only replaces one system of 
exploitation with another. Knowing these facts and facing in- 
justice within the context of the British colonialism led 
Gandhi to create his method of nonviolent resistance. 

By nonviolence (literally, satyagrha or “truth power’) 
Gandhi (1983) did not mean either passivity or the use of so- 
cially acceptable nonviolent tactics to coerce his opponent to 
give in. He meant asserting the truth as one saw it while being 
open to the perceptions of opponents and their interests, treat- 
ing them with respect and attempting to convince them, and 
accepting suffering rather than inflicting it or giving in to in- 
justice. Inherent in his approach are the unity of means and 
end and the unity of all life. Gandhi’s approach is aggressive 
in the sense of asserting one’s will, and some have argued 
that his methods were coercive. However, Burrowes (1996, 
chap. 7) established that he always attempted to change the 
heart of his opponent and that any coercion that existed was a 
coercion for a negotiation that could satisfy the needs of both 
parties. Several of his nonviolent campaigns have been de- 
scribed and evaluated by Bondurant (1965); a general history 
of nonviolent methods and the dynamics of how they influ- 
ence political decisions has been presented by Sharp (1973); 
and the psychology of nonviolence, in both its positive and 
problematic aspects, has been discussed by Pelton (1974). A 
recent history of nonviolent social movements since 1970 
may be found in the volume by Zunes, Kurtz, and Asher 
(1999), and Sutherland and Meyer (2000) contrasted the role 
played by both nonviolence and violence in the struggle for 
freedom and social justice in Africa. 

Peace Brigades International (PBI) is an example of current 
nonviolence in practice. This nonhierarchical organization 
furnishes unarmed volunteers who accompany human rights 
workers who are committed to nonviolence but have received 
death threats because of their work. Working in a nonpartisan 
way with the government of the nation where atrocities are 
being committed, and backed by an international emergency 
response network that communicates with embassies through- 
out the world, PBI has been successful in the effort to open the 
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political space essential for democracy (see Mahony & 
Egeren, 1997). Building on such efforts, Hartsough and 
Duncan (2001) are attempting to organize a volunteer nonvio- 
lent global peace force that could be used in situations where 
unarmed peacekeepers could function as neutral observers. 


Peace Through Personal Transformation 


Although each of the previous paths toward peace have 
merit, it may be argued that correctly following any of them 
requires a sort of personal development that can only be 
called transformational. This is perhaps most evident in the 
person who commits to Gandhi’s path of nonviolent action 
and develops a willingness to suffer rather than inflict injury 
as he or she acts to further justice. However, it is also required 
in the negotiator who develops an ability to acknowledge 
shame and refuses to allow egoistic needs to interfere with 
the skillful conduct of a negotiation, or the practitioner of de- 
fense who remains centered and balanced in dealing with a 
situation of potential violence, the type of self required if we 
are to have a more peaceful world (de Rivera, 1989). Form- 
ing such a self constitutes another path toward peace that 
takes personal transformation as its means. Rather than fo- 
cusing on strength, negotiation, or justice, it emphasizes the 
development of an inner peacefulness that may spread out- 
ward to influence the conduct of others. An exemplar is the 
Vietnamese monk Thich Nhat Hanh (1991). 

Thich Nhat Hanh, who has helped thousands of refugees 
and has established a number of meditation centers, is com- 
mitted to living in the sacredness of each moment and devel- 
oping a compassion for all being. He and dozens of other 
activists who have published accounts, along with hundreds 
of less well known activists (see True, 1985), have dedicated 
their lives to a practical living of nonviolence that they 
assume will influence the people around them and gradually 
create an atmosphere of peace that will affect the communities 
in which they live and, eventually, national policy. Although 
there are numerous anecdotal accounts of the sorts of effects 
they have generated, there has been little systematic study 
of their influence. The personal transformations involved in 
the development of nonviolence may be examined in the con- 
text of character development or in uses of the imagination to 
promote the development of peaceful action. 


Character Development 


Although the public is acquainted only with a few well- 
known peace activists, there are many persons who have 
risked or devoted their lives to working for peace and justice, 
and one may ask how such commitment develops. Oliner and 
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Oliner’s (1988) inquiry into the background of people who 
risked helping Jews during the Holocaust reveals a family 
background that combined both warmth and caring within 
the family with the welcoming of people from different 
groups into the family. Both these factors seem important in 
Hannon’s (1990) study of a sample of 21 Boston peace ac- 
tivists. He found that many had some sort of religious social- 
ization that provided a moral basis that was challenged by 
radicalizing college experiences. He argued for an identity 
crisis that could in part be understood in terms of Erikson’s 
(1963) fifth stage, in which the adolescent seeks an ideology 
that can be affirmed by peers and that defines what is good 
and evil. However, in these activists the resolution of the cri- 
sis also involved a transition to Kohlberg’s (1973) postcon- 
ventional moral reasoning and Fowler’s (1981) transition 
from conventional to individualistic/reflective faith. This was 
often influenced by one or more adults who served as a sort 
of sponsor to the new identity, which usually also involved 
participation in a network of like-minded peers. In their study 
of 28 moral exemplars, Colby and Damon (1999) reported 
that the exemplars did not begin as exceptional people but be- 
came increasingly caring as their goals were transformed by 
their interactions with others. They found that the exemplars 
were characterized by an absence of conflict between selfish 
and moral goals. In accord with Macmurray’s (1961) concep- 
tualization, the absence of the more typical split between self 
and other coincided with a faith in the eventual triumph of 
goodness for humanity. 


The Use of Imagination 


Boulding (1988) pointed out that action is guided by a vision 
of the future and that many people lack a vision of what a 
peaceful world would be like. (In my own experience, students 
find it much easier to imagine alien abductions than a peaceful 
world.) Accordingly, she has experimented with workshops in 
which people are asked to imagine a future world that is peace- 
ful. She has found that such a world needs to be placed about 
30 years in the future so that it seems possible but not too re- 
mote. After imagining some of the details of a peaceful world, 
participants are asked to imagine the steps that enabled such a 
world to come into being and, finally, to come up with a plan 
for the steps they might personally take. 

Working from a neo-Jungian perspective, Watkins (1988) 
postulated that the imagination needed to work for peace is 
checked by a conflict with other aspects of the self. She asked 
persons to imagine the part of them that wants to work for 
peace and to construct a character to represent that part (Is it 
a man or a woman, rich or poor, how old, how dressed?). 
Similarly, persons create a character to represent the part of 
them that has other interests and things to do. Watkins postu- 


lated that it is the relationship between these characters that 
governs whether a person’s energy is available for peace 
work. Accordingly, she asked her subjects to imagine the two 
characters meeting each other and attempts to structure these 
meetings so that the two accept rather than reject one another. 
Preliminary evidence suggests that when persons are able to 
imagine a friendly meeting, they are more likely to engage in 
actions that promote peace. 

Macy (1983) used imagination in still other ways in the 
course of the workshops she has created to deal with the de- 
spair that she believes prevents many persons from taking 
action to stop the use of nuclear weapons. After exercises de- 
signed to help people feel and express pain and despair, she 
involved participants in empowering exercises. For example, 
persons may be asked to imagine themselves before they 
were born, looking at the earth and deciding to help. They ev- 
idently choose a particular time to be born, a nation and fam- 
ily to be born into, and a specific gender and personality so 
that they could act for peace. Next, they were asked to re- 
member why they made the choices they did. What are they 
here to do? Although Macy’s and Boulding’s workshops have 
clear immediate effects, we lack data on whether they affect 
long-term commitments. 


Developing Cultures of Peace 


Each of the four paths toward peace may be seen as ways to 
develop cultures of peace that could replace the cultures of 
violence that exist in many contemporary societies. Although 
such a goal is idealistic, it is not unrealistic. Peaceful cultures 
have existed in the past, and there are small peaceful cultures 
that exist today. An examination of such cultures reveals a 
number of interesting characteristics. Bonta’s (1993) anno- 
tated bibliography describes over 60 traditional peoples and 
contemporary subcultures. Although they differ in many 
ways, they all emphasize cooperative rather than competitive 
relationships, dislike power and downplay individual recog- 
nition and wealth, have many ways to prevent and resolve 
conflict, value group harmony over abstract concepts of jus- 
tice, and think of themselves as essentially peaceful. 

Ross (1993), who has contrasted the extent of conflict in a 
sample of 90 preindustrial societies, showed that the level of 
conflict is related to socialization practices. Cultures without 
much conflict tend to place a high value on children, are high 
in warmth and affection, and are low in male gender identity 
conflict. These psycho-cultural roots of peace are orthogonal 
with the way a society is structured, and Ross showed that the 
extent to which aggression is directed out at external targets, 
rather than expressed within the society, depends on the ex- 
tent to which there are strong cross-cutting interest ties within 
the society. 


Turning to modern societies, the Peace Forum "e (2000) so- 
phisticated index of the peacefulness of contemporary na- 
tions is based on a combination of measures of external and 
internal conflict and measures of domestic justice. The index 
reveals the relative peacefulness of the developed but small 
nations such as Denmark, the Netherlands, and Portugal, as 
contrasted with many of the less developed nations and the 
powerful permanent members of the Security Council (the 
United States ranks 51st among the 74 nations for which data 
are available). The people of these smaller nations may feel 
more secure because their life is more predictable. People can 
trust one another and their social institutions (Fogarty, 2000), 
and they may find it easier to accept the need for the mutual 
obligations and responsibilities stressed by Hearn (1997). 

Although powerful societies are often relatively violent, 
Boulding (2000) has described how there are always many 
peaceful elements mixed in with the violent components. In 
religion, for example, there is the idea of a holy war but also 
the image of the peaceful garden. And it may be noted that 
after the putatively Islamic terrorist attack on the United 
States, the president of Iran phoned the pope to discuss the 
importance of Christian-Muslim dialogue (Catholic Free 
Press, 2001). Boulding argued that these peaceful elements 
make it possible to conceive realistically of developing 
peaceful cultures in modern society. 

How might we conceptualize what such cultures could be 
like? One way is to consider the transformations that would 
be involved in moving from the culture of violence to which 
many of us have become accustomed to a culture of peace. 
Adams and True (1997) suggested that these transformations 
might be characterized as follows: 


1. The redefinition of power so that it was understood to 
involve joint problem solving and active nonviolence 
rather than the use of hierarchies that require violent 
domination. 


2. The mobilization of people and the attainment of solidar- 
ity by building relationships of understanding and trust 
between groups rather than having one group dominate 
another or by achieving solidarity by focusing on the 
defeat of a common enemy. 

3. The participation of all people in the decisions that affect 
their lives. 

4. The open sharing of information in the press and in civic 
society. 

5. The development and empowerment of the caring and 
nurturing qualities traditionally associated with the role of 
women. 

6. The development of a cooperative and sustainable (rather 
than exploitative) economies. 
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We may imagine a global culture of peace involving the 
previous transformations along with an environment in 
which armaments were controlled and human rights were en- 
sured. Such a culture has been advocated by 20 Nobel peace 
laureates and promoted by UNESCO. To assist this develop- 
ment, the General Assembly of the United Nations has 
launched a decade of initiatives to achieve a culture of peace 
and requested a progress report from the secretary general 
(see Adams, 2000). Current research is attempting to develop 
indicators for the eight aspects of such a culture so that it will 
be possible to assess progress toward its development. 

Of course, a global culture of peace both influences and is 
dependent upon the specific cultures of peace developed by 
different societies. Each nation has its own particular chal- 
lenges, and it seems clear that peace, like human rights, must 
be developed by a discourse between groups from within each 
society as theses groups dialogue with groups from without 
(An-Naim, 1992). The movement toward a culture of peace is 
the first social movement that includes nation states as well as 
people. However, progress toward its goal cannot depend on 
the initiative of those powerful states whose interest is in 
maintaining the status quo. Rather, the development of each 
of the components of cultures of peace will depend on the less 
powerful nations and on the hundreds of grassroots initiatives 
by nongovernmental organizations that are constructing the 
paths of peace described previously. Each of these paths, 
along with an understanding of aggression, violence, and evil, 
is critical to developing the aspects of peaceful culture. 

International arms control and the maintenance of human 
rights require some system of international security. Given 
the current state of human development, this security must 
rest on the strength of some international authority that can 
take aggressive action when it is required, but whose vio- 
lence is checked by a division of power and civilian control. 
Whenever that system of authority is reduced to the use of vi- 
olent means, this must be publicly acknowledged as an evil. 
Such an authority will develop only when the strengthening 
of emotional ties leads powerful nations to surrender their 
monopoly of violence. NATO and other regional forces are 
steps in this direction, and we may see a strengthening of UN 
police forces in an effort to control terrorism. 

The challenges to achieving a consensus about interna- 
tional norms on terrorism involve issues that must be aggres- 
sively negotiated. The path of negotiation, as well as an 
understanding of the structural changes that perpetrate con- 
flict, is also involved in attempts to increase democratic par- 
ticipation, the sharing of information, and intergroup trust. 
The latter rests on a mastery of transformative as well as prin- 
cipled negotiation. Such negotiation will be much easier if 
synergistic societal structures lead those who want power to 
meet the needs of those without it. 
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The transformation of hierarchies of power and the attain- 
ment of an equitable and sustainable economy require the de- 
velopment of justice by nonviolent action. The evil of 
structural violence can only be overcome by methods that 
employ the creative use of aggression pioneered by Gandhi. 
This will require a learning of a different set of scripts and 
myths, heroes, and heroines who overcome negative emo- 
tions and moral disengagement, as well as the development 
of norms for intervening when violence occurs. 

Gender equality and the development of the nurturance 
and caring that is required for domestic and civil peace will 
require personal transformations. This path will also be 
needed to develop the sense of security that constrains the de- 
sire for power of those in authority, to restrict the egoism that 
can hinder negotiation, and to develop the compassionate 
nonviolence needed to attain justice. These personal transfor- 
mation need not depend solely on individual efforts. If a cul- 
ture of peace develops, there will be ceremonies that 
remember all of the victims of war, a honoring of tragedy will 
replace claims of goodness, and signs will ask God to bless 
the world. These will help develop the sorts of persons re- 
quired by the culture. 
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PERSONALITY IN POLITICAL PSYCHOLOGY 


Political psychology “has a long past, but as an organized 
discipline, it has a short history,” wrote William F. Stone in 
The Psychology of Politics (Stone & Schaffner, 1988, p. v). 
Niccolò Machiavelli’s political treatise, The Prince 
(1513/1995), an early precursor of the field, has modern-day 
echoes in Richard Christie and Florence Geis’s Studies in 
Machiavellianism (1970). The formal establishment of polit- 
ical psychology as an interdisciplinary scholarly endeavor 
was anticipated by notable precursors in the twentieth century 
with a focus on personality, among them Graham Wallas’s 
Human Nature in Politics (1908); Harold Lasswell’s Psy- 
chopathology and Politics (1930) and Power and Personality 
(1948); Hans Eysenck’s The Psychology of Politics (1954); 
Fred Greenstein’s Personality and Politics (1969); and the 
Handbook of Political Psychology (1973) edited by Jeanne 
Knutson, who founded the International Society of Political 
Psychology in 1978. 

The purpose of this chapter is to sketch the rich history of 
personality in political psychology, to take stock of the cur- 
rent state of personality-in-politics inquiry, and to map out 
new directions for this emerging application of personality 
theory informed by the rich possibilities of contextually adja- 
cent scientific fields such as evolution, of which Theodore 
Millon wrote in the opening chapter of this volume. 


THE EMERGENCE OF PERSONALITY INQUIRY 
IN POLITICAL PSYCHOLOGY 


In the present chapter, the terms personality and politics are 
employed in Greenstein’s (1992) narrowly construed sense. 
Politics, by this definition, “refers to the politics most often 
studied by political scientists—that of civil government and 
of the extra-governmental processes that more or less directly 
impinge upon government, such as political parties” and 
campaigns. Personality, as narrowly construed in political 
psychology, “excludes political attitudes and opinions . . . and 
applies only to nonpolitical personal differences” (p. 107). 


Origins of Personality-in-Politics Inquiry 


Knutson’s 1973 Handbook, most notably the chapter “From 
Where and Where To?” by James Davies, defined the field at 
the time of its publication (Stone & Schaffner, 1988, p. v). 
Davies (1973) credits political scientist Charles Merriam of 
the University of Chicago with stimulating “the first notable 
liaisons between psychology and political science” (p. 18) in 
the 1920s and 1930s. Though Merriam did not personally 


exploit the fruitful possibilities he saw for a productive union 
of the two disciplines, his “intellectual progeny,” Harold 
Lasswell, “was the first to enter boldly into the psychological 
house of ill repute, establish a liaison, and sire a set of ideas 
and influences of great vitality” (p. 18). 

Machiavelli’s famous treatise serves as testimony that, 
from the beginning, the study of personality in politics consti- 
tuted an integral part of political-psychological inquiry. In the 
modern era, the tradition dates back to Sigmund Freud, who 
collaborated with William Bullitt on a psychological study of 
U.S. president Woodrow Wilson (Freud & Bullitt, 1967). 


Types of Personality-in-Politics Inquiry 


In examining the state of the personality-in-politics litera- 
ture, Greenstein (1969) proposed three types of personality- 
in-politics inquiry: individual, typological, and aggregate. 

Individual inquiry (Greenstein, 1969, pp. 63-93), which is 
idiographic in orientation, involves single-case psychological 
analyses of individual political actors. Although the single- 
case literature historically comprised mostly psychological 
biographies of public figures, such as Alexander and Juliette 
George’s Woodrow Wilson and Colonel House (1956) and 
Erik Erikson’s Gandhi's Truth (1969), it also encompassed 
in-depth studies of members of the general population, such 
as Robert Lane’s Political Ideology (1962). With increasing 
specialization in political psychology since the 1960s, the 
focus has shifted progressively to the psychological examina- 
tion of political leaders, while single-case studies of ordinary 
citizens have become increasingly peripheral to the main 
focus of contemporary political personality research. 

Typological inquiry (Greenstein, 1969, pp. 94-119), 
which is nomothetic in orientation, concerns multicase analy- 
ses of political actors. This line of inquiry encompasses the 
main body of work in political personality, including the in- 
fluential work of Harold Lasswell (1930, 1948), James David 
Barber (1965, 1972/1992), Margaret Hermann (1974, 1980, 
1986, 1987), and David Winter (1987, 1998) with respect to 
high-level political leaders; however, part of this literature 
focuses more on followers (i.e., mass politics) than on leaders 
De, elite politics)—for example, Theodor Adorno, Else 
Frenkel-Brunswik, Daniel Levinson, and Nevitt Sanford’s 
classic The Authoritarian Personality (1950) and Milton 
Rokeach’s The Open and Closed Mind (1960). Greenstein 
(1992) has submitted that typological study “is of potentially 
great importance: if political actors fall into types with known 
characteristics and propensities, the laborious task of analyz- 
ing them de novo can be obviated, and uncertainty is reduced 
about how they will perform in particular circumstances” 
(p. 120). 


Aggregate inquiry (Greenstein, 1969, pp. 120-140) in- 
cludes a large and diverse body of work on national charac- 
ter, conflict among nations, behavior in groups, and global 
psychologizing about humanity and society (pp. 15-16). 
Greenstein (1992) has written that the impact of mass publics 
on politics, except for elections and drastic shifts in public 
opinion, “is partial and often elusive,” in contrast to the 
political impact of leaders, which tends to be “direct, readily 
evident, and potentially momentous in its repercussions” 
(p. 122). 

In his review of “Personality and Politics” in the Hand- 
book of Personality (Pervin, 1990), Dean Keith Simonton 
(1990) observed that the psychometric examination of politi- 
cal leaders represents the leading edge of current personality- 
in-politics research (p. 671). Moreover, by 1990 the dominant 
paradigm in the psychological examination of leaders had 
undergone a shift from the earlier preponderance of qualita- 
tive, idiographic, psychobiographic analysis, to more quanti- 
tative and nomothetic methods—in other words, Greenstein’s 
(1969) typological inquiry. Simonton’s assessment is as 
valid now as it was more than a decade ago. Contemporary 
personality-in-politics inquiry focuses almost exclusively on 
the psychological examination of high-level political leaders 
and the impact of personal characteristics on leadership per- 
formance and policy orientation. 

Its other principal avenue of inquiry, the study of ordinary 
citizens, has retreated from the political personality landscape, 
although it left a legacy of momentous works such as Adorno 
et al. (1950), Rokeach (1960), and others. As Simonton (1990) 
has noted, “the heyday of personality studies conducted on the 
typical citizen is past; the personality traits germane to citizen 
ideology and candidate preferences have been inventoried 
many times” (p. 671). This trend represents a distinct shift 
from the personality-and-culture era of the 1940s and 1950s 
(McGuire, 1993), in which psychobiography, studies of 
national character, and research involving the authoritarian 
personality syndrome flourished (Levin, 2000, p. 605). In this 
regard, Greenstein (1992) pointed to “the vexed post-World 
War II national character literature in which often ill- 
documented ethnographic reports and cultural artifacts... 
were used to draw sweeping conclusions about modal national 
character traits,” with the result that by the 1950s, “there was 
broad scholarly consensus that it is inappropriate simply to 
attribute psychological characteristics to mass populations on 
the basis of anecdotal or indirect evidence” (p. 122). Accord- 
ingly, political personality inquiry became more leadership 
oriented in emphasis, with the study of followers (or mass 
publics) in the domain of political psychology increasingly 
shifting to cognate areas such as political socialization, 
political attitudes, prejudice and intergroup conflict, political 
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participation, party identification, voting behavior, and public 
opinion, which could be studied more systematically than the 
impalpable notion of national character. 


THE EVOLUTION OF PERSONALITY INQUIRY 
IN POLITICAL PSYCHOLOGY 


Political psychology, as much as any social-scientific en- 
deavor, has evolved in sociohistoric context. Accordingly, the 
evolution of personality-in-politics inquiry in the second half 
of the twentieth century can be viewed against the backdrop 
of three defining events: the legacy of the Nazi Holocaust and 
World War II; the Cold War and the threat of nuclear annihi- 
lation; and the collapse of communism in Central and Eastern 
Europe and the Soviet Union, with its attendant new world 
order. 


The Postwar Era 


The rise of Hitler and the Nazi Holocaust stimulated person- 
ality research in the areas of authoritarianism, belief systems, 
and ideology, as represented in the work of Adorno et al. 
(1950) and Rokeach (1960), noted previously—precisely the 
historical juncture that in the domain of social psychology 
stimulated vigorous research programs in conformity (e.g., 
Asch, 1955) and obedience (e.g., Milgram, 1963). 

In a definitive 1973 review of research developments in 
political psychology since Lasswell (1951), Davies identified 
four distinct lines of inquiry in post-World War II political 
psychology: (a) the study of voting behavior in stable democ- 
racies, the dominant trend, which had become “increasingly 
dull, repetitious, and a precious picking of nits’; (b) cross- 
national comparative research in relatively stable, democratic 
polities (which included “the vexed post-World War II 
national character literature” noted by Greenstein, 1992, 
(p. 122); (c) the genesis of behavioral patterns established in 
childhood (i.e., political socialization), which, along with 
cross-national research, “provided some relief from the 
{dominant trend’s] rather static study of behavior under stable 
circumstances”; and (d) psychological political biography 
(p. 21). Concerning the latter, which is most closely allied to 
contemporary political personality inquiry, Davies (1973) 
noted the futility of attempting to ascertain the psychological 
determinants of why some individuals emerge as leaders, 
given the rudimentary nature of available conceptual tools 
and measuring devices. More useful, according to Davies, 
would be analysis and description of leadership style, which 
had become increasingly sophisticated, as evidenced by 
the work of Barber (1972-1992)— "the boldest step yet in 
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establishing a typology applicable to all American presi- 
dents,” successfully making a case for “the predictability 
of. . . . how presidents will act” (Davies, 1973, p. 25). 


The Cold War Era 


By the 1960s, the Cold War, punctuated by the 1962 Cuban 
missile crisis, brought about an important shift in the direc- 
tion of political personality research. In the shadow of the 
nuclear sword, the focus of interest shifted from the mass 
politics of followers to the elite politics of foreign-policy 
decision making. In social psychology, this trend was paral- 
leled by research endeavors such as Charles Osgood’s (1962) 
explication of graduated reciprocation in tension-reduction 
(GRIT) and Irving Janis’s (1972/1982; Janis & Mann, 1977) 
influential work on groupthink and decision-making fiascoes. 
In his review of advances in the study of personality and 
politics, Greenstein (1992) noted that the 1970s and 1980s 
were marked by “burgeoning inquiry into political perception 
and cognitive psychology more generally” (p. 112), as repre- 
sented by Robert Jervis’s (1976) text on threat perception and 
deterrence and Richard Lau and David Sears’s (1986) edited 
collection of papers on political cognition. 

As a field, political psychology thrived in the sociohistoric 
environment of the Cold War, as witnessed by the publication 
of the Handbook of Political Psychology in 1973, with an 
important chapter on “Personality in the Study of Politics” by 
its editor, Jeanne Knutson; William F. Stones (1974) ground- 
breaking introductory political psychology textbook; and the 
founding of the International Society of Political Psychology 
in 1978. Greenstein, in his now classic Personality and Poli- 
tics (1969), set about the task of clearing a path “through the 
tangle of intellectual underbrush” (Greenstein, 1987, p. v) of 
conflicting perspectives on whether personality in politics 
was amenable to, and worthy of, disciplined inquiry. 

Well into the 1980s, however, three powerful influences 
would subdue the impact of Greenstein’s (1969) and Knutson’s 
(1973) important work in mapping out a conceptual frame- 
work conferring figural status upon the personality construct in 
the evolving study of personality in politics: the dominant in- 
terest in foreign-policy decision making against the backdrop 
of the Soviet-U.S. struggle for superpower supremacy; the 
cognitive revolution (see McGraw, 2000; Simon, 1985), which 
extended its reach from its parent discipline of psychology into 
mainstream political science; and the person-situation debate 
(see Mischel, 1990) then raging in personality psychology. 

In a preface to the new edition (1987) of Personality and 
Politics, Greenstein observed that “one kind of political 
psychology—the cognitive psychology of perception and 
misperception—has found a respected niche in a political 


science field, namely international relations” (p. vi). Ole 
Holsti (1989) asserted that the psychological perspective 
constituted a basic necessity in the study of international 
politics. As the 1980s drew to a close, Jervis (1989), in a 
paper outlining major challenges to the field of political psy- 
chology, wrote, “The study of individual personalities and 
personality types has fallen out of favor in psychology and 
political science, but this does not mean the topics are 
unimportant” (p. 491). Significantly, two decades earlier 
George (1969) and Holsti (1970) had published influential 
papers that revived the World War II-era operational code 
construct, in part because perception and beliefs were viewed 
as more easily inferred than personality—given “the kinds of 
data, observational opportunities, and methods generally 
available to political scientists” (George, 1969, p. 195). 

The renewed focus on operational codes—beliefs about 
the fundamental nature of politics, which shape one’s world- 
view, and hence, one’s choice of political objectives—steered 
political personality in a distinctly cognitive direction. 
Stephen Walker (1990, 2000) and his associates (Dille & 
Young, 2000; Schafer, 2000) would carry this line of inquiry 
forward to the present day. Moreover, Hermann (1974) 
initiated a research agenda that accorded cognitive variables 
a prominent role in the study of political personality. 
Hermann’s (1980) conceptual scheme accommodated four 
kinds of personal characteristics hypothesized to play a cen- 
tral role in political behavior: beliefs and motives, which 
shape a leader’s view of the world, and decision style and 
interpersonal style, which shape the leader’s personal 
political style. Hermann’s model warrants particular attention 
because of the degree to which it integrated existing perspec- 
tives at the time, and because of its enduring influence on the 
study of personality in politics. 

Conceptually, Hermann’s notion of beliefs is anchored to 
the philosophical beliefs component of the operational code 
construct. Her interest in motives stems from Lasswell’s 
Power and Personality (1948) and Winter’s The Power 
Motive (1973)—an approach to political personality that 
Winter (1991) would elaborate into a major political person- 
ality assessment methodology in its own right. Hermann’s 
construal of decision style overlaps with the instrumental be- 
liefs component of George’s (1969) operational code con- 
struct and aspects of Barber’s (1972/1992) formulation of 
presidential character, focusing particularly on conceptual 
complexity (see Dille & Young, 2000)—once again, an ap- 
proach to political personality that would later develop 
into a major branch of political personality assessment, as 
represented in the work of Suedfeld (1994) on integrative 
complexity. Finally, Hermann’s interpersonal style domain 
encompasses a number of politically relevant personality 


Obstacles to the Advancement of the Personality-in-Politics Enterprise 603 


traits such as suspiciousness, Machiavellianism, and task 
versus relationship orientation in leadership (see Hermann, 
1980, pp. 8-10). 

Methodologically, a common strand of cognitively and 
motivationally oriented trait approaches—such as those of 
Hermann (1987), Suedfeld (1994), Walker (1990), and 
Winter (1998)—is their reliance on content analysis of public 
documents (typically speeches and other prepared remarks or 
interviews and spontaneous remarks) for the indirect assess- 
ment of political personality (see Schafer, 2000, for a recent 
overview of issues in at-a-distance methods of psychological 
assessment). 

As Simonton (1990) has noted, “The attributes of charac- 
ter that leave the biggest impression on political affairs in- 
volve both cognitive inclinations, which govern how an 
individual perceives and thinks about the world, and motiva- 
tional dispositions, which energize and channel individual 
actions in the world” (pp. 671-672). Hermann’s model, in 
capturing cognition (including beliefs or attitudes) and moti- 
vation (recognizing the importance of affect in politics and 
checking the tendency in political psychology toward 
overemphasis of human rationality), clearly fills Simonton’s 
prescription. On the other hand, Hermann’s construal of deci- 
sion style as a personality (or input) variable is problematic. 
Renshon’s (1996b) integrative theory of character and politi- 
cal performance, for example, specifies political and policy 
judgments and decision making, along with leadership, as 
performance (output) variables. Finally, Hermann’s construal 
of personality in terms of interpersonal style is too restrictive 
for a comprehensive theory of personality in politics. 


The New World Order 


Epochal events such as the fall of the Berlin Wall in 1989, the 
collapse of communist rule in Central and Eastern Europe 
in 1989-1990, the disintegration of the Soviet Union in 
1991-1992, South Africa’s transition from apartheid state to 
nonracial democracy in 1994 following Nelson Mandela’s 
release from prison in 1990, and the Persian Gulf War in 1991 
marked the beginning of a new world order, which stimulated 
renewed research interest in psychometric inquiry—an area 
that contemporaneously began to emerge as a new paradigm 
for the study of personality in politics (Immelman, 1988, 
1993; Simonton, 1990). In psychometric personality-in- 
politics inquiry, standard psychometric instruments were 
adapted to “derive personality measures from biographical 
data rather than through content analysis of primary materials” 
(Simonton, 1990, p. 678), although some investigators (e.g., 
Kowert, 1996; Rubenzer, Faschingbauer, & Ones, 2002), 
though similar in intent, opted for indirect expert ratings 


instead of direct analysis of biographical data. The focus of 
psychometric inquiry is less on cognitive variables and 
foreign-policy decision making and more on a personological 
understanding of the person in politics, his or her personality 
attributes, and the implications of personality for leadership 
performance and generalized policy orientation. 

George and George’s (1956) psychoanalytically framed 
study of Woodrow Wilson, which relied on clinical insights 
rather than psychometric evaluation of biographical data, is 
the best known precursor of the personological trend in polit- 
ical personality research. In Simonton’s (1990) judgment, 
qualitative, nonpsychometric psychobiographical analyses 
“have leaned heavily on both theoretical perspectives and 
methodological approaches that cannot be considered a cen- 
tral current in mainstream personality research” (p. 671). Al- 
though some highly informative personological studies (e.g., 
Glad, 1996; Post, 1991; Renshon, 1996a, 1998) continued in 
the older psychobiographic tradition, the twentieth century 
closed with a distinct shift in a psychometric direction (Im- 
melman, 1998, 2002; Kowert, 1996; Lyons, 1997; Rubenzer 
et al., 2002). 

Although some contemporary psychobiographically ori- 
ented studies are theoretically eclectic (e.g., Betty Glad’s 
1996 study of the transfer of power from Gorbachev to 
Yeltsin in Russia and from De Klerk to Mandela in South 
Africa), the modern psychoanalytic reformulations of Heinz 
Kohut (1971, 1977) and Otto Kernberg (1984) have acquired 
considerable cachet in political psychology. Swansbrough 
(1994), for instance, conducted a Kohutian analysis of 
George Bush’s personality and leadership style in the Persian 
Gulf war. Similarly, Stanley Renshon’s (1996a) psychobiog- 
raphy of Bill Clinton is informed primarily by Kohutian self 
psychology. Jerrold Post’s (1991) psychobiographical analy- 
sis of Saddam Hussein is more indebted to Kernberg’s notion 
of narcissistic personality organization (see Post, 1993). 
Despite Simonton’s (1990) grim prognostication and Jervis’s 
(1989) observation that “Freudian analysis and psychobi- 
ographies are out of fashion” (p. 482), the psychobiographic 
tradition has been revitalized by the analytic insights of 
scholars such as Post and Renshon. 


OBSTACLES TO THE ADVANCEMENT OF THE 
PERSONALITY-IN-POLITICS ENTERPRISE 


Greenstein (1992) has formulated what may be the most con- 
cise statement of the case for studying personality in politics: 
“Political institutions and processes operate through human 
agency. It would be remarkable if they were not influenced by 
the properties that distinguish one individual from another” 
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(p. 124). Yet, specialists in the study of politics “tend to con- 
centrate on impersonal determinants of political events and 
outcomes” or define away personal characteristics, “positing 
rationality . . . and presuming that the behavior of actors can 
be deduced from the logic of their situation” (p. 106). The 
relevance of the study of personality with respect to political 
leadership is nicely captured in Renshon’s (1996b) con- 
tention that 


many of the most important aspects of presidential performance 
rely on the personal characteristics and skills of the president. . . . 
It is his views, his goals, his bargaining skills . . . , his judgments, 
his choices of response to arising circumstance that set the levers 
of administrative, constitutional, and institutional structures into 
motion. (p. 7) 


In this regard, Glad (1996), writing about the collapse of the 
communist state in the Soviet Union and the apartheid state 
in South Africa, has shown convincingly that the personal 
qualities of leaders can play a critical role at turning points in 
history. 


Scholarly Skepticism and Inadequate Conceptual 
and Methodological Tools 


Despite the conviction of personality-in-politics practitioners 
in the worth of their endeavor, the study of personality in pol- 
itics is not without controversy (see Lyons, 1997, pp. 792— 
793, for a concise review of “controversies over the presi- 
dential personality approach”). Greenstein (1969, pp. 33-62) 
offered an incisive critique of “two erroneous” and “three 
partially correct” objections to the study of personality in pol- 
itics, lamenting that the study of personality in politics was 
“not a thriving scholarly endeavor,” principally because 
“scholars who study politics do not feel equipped to analyze 
personality in ways that meet their intellectual standards. . . . 
[thus rendering it primarily] the preserve of journalists” 
(p. 2). The optimistic verdict more than three decades later is 
that political personality has taken root and come of age as a 
scholarly endeavor, as evidenced by the inclusion of the 
present chapter in this volume. 


Inadequate Transposition From Source to 
Target Discipline 


Although the enterprise of studying personality in politics 
has largely succeeded in countering common objections to its 
usefulness, it has been hampered by inadequate transposition 
from the source discipline of personality assessment to the 
target discipline of political psychology. For political person- 
ality inquiry to remain a thriving scholarly endeavor and 


have an impact beyond the narrow confines of academic 
political psychology, it will need to account, at a minimum, 
for the patterning of personality variables “across the entire 
matrix of the person” (Millon & Davis, 2000, pp. 2, 65). Only 
then will political personality assessment provide an ade- 
quate basis for explaining, predicting, and understanding 
political outcomes. Moreover, political personologists will 
need to advance an integrative theory, not only of personality 
and of political leadership, but also of the personality-politics 
nexus. In The Psychological Assessment of Presidential Can- 
didates (1996b), Stanley Renshon provides a partial blueprint 
for this daunting task. 


Inadequate Progress From Description of Observable 
Phenomena to Theoretical Systematization 


Ultimately, scholarly progress in personality-in-politics in- 
quiry hinges on its success in advancing from the “natural 
history stage of inquiry” to a “stage of deductively formu- 
lated theory” (Northrop, 1947). The intuitive psychologist’s 
“ability to ‘sense’ the correctness of psychological insight” 
(chapter by Millon in this volume) presents an easily over- 
looked obstacle to progress in political-personological in- 
quiry. Early in the development of a scientific discipline, 
according to philosopher of science Carl Hempel (1965), in- 
vestigators primarily strive “to describe the phenomena 
under study and to establish simple empirical generalizations 
concerning them,” using terms that “permit the description of 
those aspects of the subject matter which are ascertainable 
fairly directly by observation” (p. 140). Hermann’s (1974, 
1980) early work illustrates this initial stage of scientific 
development. In the words of Hempel (1965), 


The shift toward theoretical systematization is marked by the in- 
troduction of new, “theoretical” terms, which refer to various 
theoretically postulated entities, their characteristics, and the 
processes in which they are involved; all of these are more or 
less removed from the level of directly observable things and 
events. (p. 140) 


Hermann’s (1987) proposal of a model suggesting how lead- 
ers’ observable personal characteristics “link to form role 
orientations to foreign affairs” (p. 162) represents consider- 
able progress in this direction; however, it lacks systematic 
import. 


A Lack of Systematic Import 


Theoretical systematization and empirical import (opera- 
tional definitions) are necessary but not sufficient for 
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scientific progress. 


To be scientifically useful a concept must lend itself to the for- 
mulation of general laws or theoretical principles which reflect 
uniformities in the subject matter under study, and which thus 
provide a basis for explanation, prediction, and generally scien- 
tific understanding. (Hempel, 1965, p. 146) 


The most striking instance of this principle of systematic 
import, according to Hempel (1965), “is the periodic system 
of the elements, on which Mendeleev based a set of highly 
specific predictions, which were impressively confirmed 
by subsequent research” (p. 147). Hempel chronicled similar 
scientific progress in biological taxonomic systems, which 
proceeded from primitive classification based on observable 
characteristics to a more advanced phylogenetic-evolutionary 
basis. Thus, “two phenomenally very similar specimens may 
be assigned to species far removed from each other in the 
evolutionary hierarchy, such as the species Wolf (Canis) and 
Tasmanian Wolf (Thylacinus)” (Hempel, 1965, p. 149). 

For personality-in-politics inquiry to continue advancing 
as a scholarly discipline, it will have to come to grips with the 
canon of systematic import. At base, this means that theoret- 
ical systematizations cannot be constructed on the foundation 
of precisely those personal characteristics from which they 
were originally inferred (see chapter by Millon in this vol- 
ume). As Kurt Gödel (1931) demonstrated with his incom- 
pleteness theorem, no self-contained system can prove or 
disprove its own propositions while operating within the 
axioms of that system. 


TOWARD A GENERATIVE THEORY 
OF PERSONALITY AND POLITICAL 
PERFORMANCE 


Ideally, conceptual systems for the study of political person- 
ality should constitute a comprehensive, generative, theo- 
retically coherent framework consonant with established 
principles in the adjacent sciences (particularly the more 
mature natural sciences; see Millon’s chapter in this volume), 
congenial with respect to accommodating a diversity of 
politically relevant personal characteristics, and capable of 
reliably predicting meaningful political outcomes. In this 
regard, Renshon (1996b) is critical of unitary trait theories 
of political personality (such as those relying primarily on 
isolated personality variables, motives, or cognitive vari- 
ables), noting that “it is a long causal way from an individual 
trait of presidential personality to a specific performance out- 
come” and that unitary trait theories fail to contribute to the 


development of an integrated psychological theory of leader- 
ship performance. He ventures that “more clinically based 
theories ... might form the basis of a more comprehensive 
psychological model of presidential performance” (p. 11). 
The problem bedeviling contemporary personality-in- 
politics inquiry, however, is more profound than the precari- 
ous perch of leadership performance theories on a fragmented 
personological foundation. In his critique of postwar research 
directions in political psychology, Davies (1973) declared: 


There is... a kind of atrophy of theory and research that can help 
us link observable acts with their deeply and generally an- 
tecedent causes in the human organism, notably the nervous and 
endocrine systems. Aristotle sought such relationships. So did 
Hobbes, whose Leviathan (1651) founded its analysis of politi- 
cal institutions on a theory of human nature. And likewise, 
Lasswell has sought to relate fundamental determinants to 
observable effects—and vice versa. (p. 26) 


Similarly, but with greater theoretical precision, Millon 
(1990), in explicating his evolutionary theory of personality, 
distinguished between “true, theoretically deduced” nosolo- 
gies and those that provide “a mere explanatory summary of 
known observations and inferences” (p. 105). He cited Hempel 
(1965), who proposed that scientific classification ought to 
have an “objective existence in nature, . . . ‘carving nature at the 
joints,’ in contradistinction to ‘artificial’ classifications, in 
which the defining characteristics have few explanatory or 
predictive connections with other traits” (p. 147). Ultimately, 
“in the course of scientific development, classifications defined 
by reference to manifest, observable characteristics will tend to 
give way to systems based on theoretical concepts” (Hempel, 
1965, pp. 148-149). 

Greenstein (1987), pointing to the work of Gangestad and 
Snyder (1985) and Morey (1985), acknowledged the substan- 
tial progress since the publication of his seminal Personality 
and Politics (1969) “in grounding complex psychological 
typologies empirically,” yet pessimistically proclaimed that 
“complex typologies are not easily constructed and docu- 
mented” (Greenstein, 1987, p. xiv). Although Greenstein was 
clearly correct on both counts, he failed to report that these 
typologies had already been constructed and empirically 
documented (see, for example, Millon, 1986). Greenstein’s 
(1987) conclusion, that the difficulty of constructing a com- 
plex typology renders it “productive to classify political 
actors in terms of single traits that differentiate them in 
illuminating ways” (p. xiv), is therefore patently founded 
on a false premise. This pitfall of overlooking parallel de- 
velopments in clinical science is reminiscent of Barber’s 
(1972/1992) construction, de novo, of a rudimentary 2 x 2 
model for assessing presidential character, which yields little 
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more systematic import or prototypal distinctiveness than the 
humoral doctrine of Hippocrates, 24 centuries earlier. 


Toward a Politically Relevant Theory 
of Personality in Politics 


Renshon (1996b) has argued persuasively that a president’s 
character serves as the foundation for leadership effective- 
ness, in part because political parties (in the United States) 
have lost much of their ability to serve as “filters” for evalu- 
ating candidates, who are no longer mere standard-bearers of 
party platforms and ideologies (pp. 38-40). Renshon exam- 
ines the psychology of presidential candidates using theories 
of character and personality, theories of presidential leader- 
ship and performance, and theories of public psychology. For 
a concise, schematic outline of Renshon’s model, which is 
anchored to Kohut’s (1971, 1977) psychoanalytic self theory, 
the reader is referred to appendix 2 (pp. 409-411) of his 
book, The Psychological Assessment of Presidential Candi- 
dates (1996b). 

For the great majority of psychodiagnosticians, who are 
more familiar with Axis II of the Diagnostic and Statistical 
Manual of Mental Disorders (4th ed.; DSM-IV) of the 
American Psychiatric Association (APA; 1994) than with 
Kohutian self psychology as a framework for recording per- 
sonality functioning, Renshon’s (1996b) particular clinically 
based theory of political personality may be somewhat 
restrictive, if not arcane. Fortunately, the value of Renshon’s 
work with respect to mapping out an integrated theory of char- 
acter and leadership for political personality assessment is not 
contingent upon the utility of the personological component 
of his model; it can easily be molded to the theoretical pro- 
clivities of the practitioner, including—perhaps especially— 
those favoring a theoretical orientation more compatible with 
the DSM-IV. 


Toward a Psychologically Grounded Theory of 
Political Performance 


In developing a psychologically grounded theory of political 
performance, Renshon (1996b) distinguished between two 
key elements of presidential role performance: “making good 
policy and political decisions” and “pursuing and realizing 
policy purposes” (p. 12). With regard to the former, Renshon 
(1996b) proposed a model of judgment and decision making 
(pp. 206-223, 411) capable of accommodating those cogni- 
tive constructs that became popular in Cold War-era political 
psychology (e.g., integrative complexity). Concerning the 
second aspect of political performance, Renshon (1996b) pro- 
poses “three distinct aspects” (p. 226) of political leadership 


shaped by character: mobilization, the ability to arouse, en- 
gage, and direct the public; orchestration, the organizational 
skill and ability to craft specific policies; and consolidation, 
the skills and tasks required to preserve the supportive rela- 
tionships necessary for implementing and institutionalizing 
one’s policy judgments (pp. 227, 411). 

However, those seeking to develop a generative theory of 
personality and political performance confront a conceptual 
minefield—a problem highlighted previously with respect to 
the overly restrictive, psychodynamically framed character 
component of Renshon’s model, which limits its integrative 
potential. This issue is examined more closely in the next 
section. 


CONCEPTUAL PROBLEMS IN THE STUDY OF 
PERSONALITY IN POLITICS 


Unresolved conceptual problems that cloud personality-in- 
politics inquiry include a lack of agreement about the appro- 
priate levels of analysis; a lack of clarity about the requisite 
scope of inquiry; theoretical stagnation; and a failure of some 
approaches to satisfy basic standards for operationalizing 
the personality construct. 


Levels of Analysis 


In his early efforts to chart a course for the field’s develop- 
ment, Greenstein (1969) noted that the personality-in-politics 
literature was “formidably gnarled—empirically, method- 
ologically, and conceptually” (p. 2). He identified three oper- 
ational levels for the assessment of personality in politics: 
phenomenology, dynamics, and genesis. In Greenstein’s 
opinion, these distinctions are useful 


for sorting out the different kinds of operations involved in the 
psychological diagnosis of political actors, and for ordering 
diagnostic operations in terms of both the directness of their 
bearing on explanations of political action and the degree to 
which they can be carried out in a more or less standardized 
fashion. (p. 144) 


Phenomenology—regularities in the observable behavior 
of political actors—according to Greenstein, is “the most im- 
mediately relevant supplement to situational data in predict- 
ing and explaining the actor’s behavior” (p. 144), whereas 
explanations of genesis are “remote from the immediate 
nexus of behavior” and pose “difficult questions of valida- 
tion” (p. 145). With the increasing dominance of descriptive 
approaches and the dwindling influence of psychoanalysis in 


contemporary personality assessment (Jervis, 1989, p. 482; 
Simonton, 1990, p. 671), preoccupation with personality 
dynamics can be expected to wane, while psychogenesis 
already occupies a peripheral role in political personality, of 
primary interest to psychohistorians. 

Millon’s (1990) evolutionary model refines Greenstein’s 
three operational levels of analysis (phenomenology, dynam- 
ics, and genesis) by redefining genesis as a conceptual 
construct, relabeling dynamics as the intrapsychic level of 
analysis, disaggregating phenomenology into phenomeno- 
logical and behavioral data levels, and adding a fourth, 
biophysical, data level. 

The critical operational constructs are the clinical domains 
(or personality attributes), which provide an explicit basis for 
personality assessment. Millon’s (1990) evolutionary model 
specifies four structural domains (object representations, 
self-image, morphologic organization, and mood or tempera- 
ment) and four functional domains (expressive behavior, 
interpersonal conduct, cognitive style, and regulatory mecha- 
nisms) encompassing four data levels: behavioral (expressive 
behavior, interpersonal conduct); phenomenological (cogni- 
tive style, object representations, self-image); intrapsychic 
(regulatory mechanisms, morphological organization); and 
biophysical (mood or temperament). 


Scope of Inquiry 


Beyond simply refining Greenstein’s (1969) specification of 
operational levels for personality-in-politics inquiry, the 
scope of this endeavor must be elucidated if political person- 
ality is to extricate itself from the “tangled underbrush.” The 
requisite scope of inquiry is implied in the organizational 
framework of a representative undergraduate personality text 
(Pervin & John, 2001), which presents theory and research in 
terms of structure, process, development, psychopathology, 
and change—a formulation consistent with the organizing 
framework of structure, dynamics, development, assessment, 
and change that Gordon Allport employed in his seminal text, 
Personality: A Psychological Interpretation (1937). Millon’s 
(1990, 1996) contemporary clinical model of personality fol- 
lows this time-honored tradition by construing personality in 
terms of its structural and functional domains, normal and 
pathological variants, developmental background (including 
hypothesized biogenic factors and characteristic develop- 
mental history), homeostatic (self-perpetuation) processes, 
and domain-based modification strategies and tactics. 


Theoretical Orientation 


In an important recapitulation nearly a quarter-century after 
his landmark work in Personality and Politics (1969), 
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Greenstein (1992) resolved, “The study of personality and 
politics is possible and desirable, but systematic intellectual 
progress is possible only if there is careful attention to prob- 
lems of evidence, inference, and conceptualization” (p. 105). 
He went on to assert, however, that “it is not appropriate to 
recommend a particular personality theory,” suggesting that 
the theories of “Freud, Jung, Allport, Murray, and . . . many 
others” (p. 117) are all potentially useful. Although there is 
merit in Greenstein’s (1973) counsel to “let many flowers 
bloom” (p. 469), professional psychodiagnosticians—who 
tend not to treat the classic schools of personality theory as 
templates for tailoring their assessment tools—might find 
this assertion quite striking. Burgeoning scientific and tech- 
nological progress in clinical science over the past half- 
century practically dictates that assimilating contemporary 
approaches to psychodiagnostics and personality assessment 
provides a less obstacle strewn passage for personality-in- 
politics practitioners than steering a course illuminated solely 
by the radiance of the great pioneers of personality theory. 
Despite major advances in behavioral neuroscience, evolu- 
tionary ecology, and personality research in the past two 
decades (see chapter by Millon in this volume), personality- 
in-politics inquiry arguably has become insular and stagnant, 
with few fresh ideas and—with the exception of cognitive 
science—little indication of meaningful cross-pollination of 
ideas from adjacent disciplines. 


Necessary Conditions for Operationalizing 
Research Designs 


In the original Handbook of Political Psychology (1973), 
Knutson implored that, to be feasible for studying personality 
in politics, conceptual models should fulfill three critical re- 
quirements for operationalizing research designs in political 
personality: Clearly conceptualize the meaning of the term 
personality; delineate attributes of personality that can be 
quantified or objectively assessed, thereby rendering them 
amenable to scientific study; and specify how the personality 
attributes subjected to scientific inquiry relate to the per- 
sonality construct (pp. 34-35). As shown next, Millon’s 
(1990, 1996) evolutionary model of personality satisfies all 
three of Knutson’s criteria, making it eminently useful for 
studying personality in politics. 


Defining Personality 


From Millon’s evolutionary-ecological perspective, person- 
ality constitutes ontogenetic, manifest, adaptive styles of 
thinking, feeling, acting, and relating to others, shaped by 
interaction of latent, phylogenetic, biologic endowment and 
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social experience (chapter by Millon in this volume). This 
construal is consistent with the contemporary view of per- 
sonality as 


a complex pattern of deeply embedded psychological character- 
istics that are largely nonconscious and not easily altered, 
expressing themselves automatically in almost every facet of 
functioning. Intrinsic and pervasive, these traits emerge from a 
complicated matrix of biological dispositions and experiential 
learnings, and ultimately comprise the individual’s distinctive 
pattern of perceiving, feeling, thinking, coping, and behaving. 
(Millon, 1996, p. 4) 


Delineating the Core Attributes of Personality 


In constructing an integrated personality framework that ac- 
counts for “the patterning of characteristics across the entire 
matrix of the person” (Millon & Davis, 2000, p. 2), Millon 
(1994b) favors a theoretically grounded “prototypal domain 
model” (p. 292) that combines quantitative dimensional 
elements (e.g., the five-factor approach) with a qualitative 
categorical approach (e.g., DSM-IV). The categorical aspect 
of Millon’s model is represented by eight universal attribute 
domains relevant to all personality patterns, namely expres- 
sive behavior, interpersonal conduct, cognitive style, mood 
or temperament, self-image, regulatory mechanisms, object 
representations, and morphologic organization. 


Assessing Personality on the Basis of Variability 
Across Attributes 


Millon specifies prototypal features (diagnostic criteria) 
within each of the eight attribute domains for each personal- 
ity style (Millon, 1994a; Millon & Everly, 1985) or disorder 
(1990, 1996) accommodated in his taxonomy. The dimen- 
sional aspect of Millon’s schema is achieved by evaluating 
the “prominence or pervasiveness” (1994b, p. 292) of the 
diagnostic criteria associated with the various personality 


types. 


Additional Considerations 


Traditionally, political personality assessment has borne little 
resemblance to the conceptualization of personality shared by 
most clinically trained professional psychodiagnosticians, or 
to their psychodiagnostic procedures. In satisfying Knutson’s 
three criteria, Millon’s personological model offers a viable 
integrative framework for a variety of current approaches to 
political personality, thus narrowing conceptual and method- 
ological gaps between existing formulations in the source dis- 
ciplines of personology and personality assessment and the 


target discipline of contemporary political personality— 
specifically the psychological examination of political 
leaders. 

Although necessary for operationalizing research designs, 
Knutson’s (1973) three criteria provide an insufficient basis 
for applied personality-in-politics modeling. A theoretically 
sound, comprehensive, useful personality-in-politics model 
with adequate explanatory power and predictive utility must 
meet additional standards. I propose the following basic stan- 
dards for personality-in-politics modeling: 


1. The meaning of the term personality should be clearly 
defined. 


2. Quantifiable personality attributes amenable to objective 
assessment should be clearly specified. 


3. The personality attributes subject to inquiry should be 
explicitly related to the personality construct as whole. 


4. The conceptual model for construing personality in 
politics should be congruent with personality systems 
employed with reference to the general population. 


5. The conceptual model for construing political personal- 
ity should be integrative, capable of accommodating 
diverse, multidisciplinary perspectives on politically 
relevant personal characteristics. 


6. The conceptual model should offer a unified view of 
normality and psychopathology. 


7. The conceptual model should be rooted in personality 
theory, with clearly specified referents in political leader- 
ship theory. 


8. The personality-in-politics model should be embedded 
in a larger conceptual framework that acknowledges 
cultural contexts and the impact of distal and proximal 
situational determinants that interact with dispositional 
variables to shape political behavior. 

9. The methodology for assessing political personality 
should be congruent with standard psychodiagnostic 
procedures in conventional clinical practice. 

10. The assessment methodology should be inferentially 
valid. 

11. The assessment methodology should meet acceptable 
standards of evidence for reliability. 

12. For purposes of predictive utility, the assessment 
methodology should be practicable during political 
campaigns. 

13. For considerations of efficiency, the assessment method- 
ology should be minimally cumbersome or unwieldy. 

14. For optimal utility, the assessment methodology should 
be remote, indirect, unobtrusive, and nonintrusive. 


15. For advancing theoretical systematization, the concep- 
tual model should be nomothetically oriented, permit 
typological inquiry, and posit a taxonomy of political 
personality types. 


A PERSONALITY-IN-POLITICS AGENDA 
FOR THE NEW CENTURY 


In the new world order of the twenty-first century, personal- 
ity-in-politics inquiry is poised to reclaim personality as the 
central organizing principle in the study of political leader- 
ship, informed by insights garnered from the cognitive revo- 
lution preceding the close of the twentieth century and 
energized by the quickening evolutionary reconceptualiza- 
tion of personology at the dawn of the new millennium. 


From Cognitive Revolution to Evolutionary Psychology 


On the crest of major breakthroughs in evolutionary biology 
during the preceding quarter-century, the emerging evolution- 
ary perspective in psychology since the mid-1980s (see Buss, 
1999; Millon, 1990; Millon, this volume) represents the first 
major theoretical shift in the discipline since the cognitive rev- 
olution of the 1950s and 1960s. Conceptually, the integrative 
capacity of Millon’s (1990; Millon, this volume) evolutionary 
model renders it sufficiently comprehensive to accommodate 
major tenets of psychodynamic, behavioral, humanistic, inter- 
personal, cognitive, biogenic, and trait approaches to person- 
ality. Methodologically, Millon’s framework provides an 
empirically validated taxonomy of personality patterns com- 
patible with the syndromes described in DSM-IV, Axis II 
(APA, 1994). 

No present conceptual system in the field of political 
personality rivals Millon’s model in compatibility with con- 
ventional psychodiagnostic methods and standard clinical 
practice in personality assessment. Moreover, no current sys- 
tem matches the elegance with which Millon’s evolutionary 
model synthesizes normality and psychopathology. In short, 
Millon offers a theoretically coherent alternative to existing 
conceptual frameworks and assessment methodologies for 
the psychological examination of political leaders (see Post, 
2003, for an up-to-date collection of current conceptualiza- 
tions; see Kinder, 1999, for a series of reviews, both critical 
and laudatory, of “Millon’s evolving personality theories and 
measures”). 


The Utility of Millon’s Model as a Generative 
Framework for the Study of Personality in Politics 


The work of Millon (1990, 1994a, 1996, and his chapter in 
this volume; Millon & Davis, 2000; Millon, Davis, & Millon, 
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1996; Millon & Everly, 1985) provides a sound foundation 
for conceptualizing and assessing political personality, clas- 
sifying political personality types, and predicting political 
behavior. 

Epistemologically, it synthesizes the formerly disparate 
fields of psychopathology and normatology and formally 
connects them to broader spheres of scientific knowledge, 
most notably their foundations in the natural sciences 
(Millon, this volume). Diagnostically, it offers an empirically 
validated taxonomy of personality patterns congruent with 
the syndromes described on Axis II of DSM-IV (APA, 1994), 
thus rendering it compatible with conventional psychodiag- 
nostic procedures and standard clinical practice in personal- 
ity assessment. 

Millon (1986) uses the concept of the personality prototype 
(paralleling the medical concept of the syndrome) as a global 
formulation for construing and categorizing personality sys- 
tems, proposing that “each personality prototype should com- 
prise a small and distinct group of primary attributes that 
persist over time and exhibit a high degree of consistency” 
(p. 681). To Millon, the essence of personality categorization 
is the differential identification of these enduring (stable) and 
pervasive (consistent) primary attributes. This position is con- 
sistent with the conventional view of personality in the study 
of politics (see Knutson, 1973, pp. 29-38). In organizing his 
attribute schema, Millon (1986) favors “an arrangement that 
represents the personality system in a manner similar to that of 
the body system, that is, dividing the components into struc- 
tural and functional attributes” (p. 681; see Millon, 1990, 
pp. 134-135, for a concise summary of these attribute 
domains). 


The Core Characteristics of a Comprehensive 
Model of Personality in Politics 


Acomprehensive model for the study of personality in politics 
(see Fig. 24.1) should account for structural and functional 
personality attributes, at behavioral, phenomenological, in- 
trapsychic, and biophysical levels of analysis; permit supple- 
mentary developmental causal analysis (i.e., genesis or 
etiology); provide an explicit framework for risk analysis (i.e., 
account for normal variability as well as personality 
pathology); and provide an assessment methodology. Further- 
more, the personality model should be linked with perfor- 
mance outcomes, recognize the impact of situational variables 
and the cultural context on political performance, and 
allow for personological, situational, and contextual filters 
that may modulate the impact of personality on political 
performance. 
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Developmental causal analysis 
Biogenic factors 
Experiential history 
(Millon, 1996) 
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Structural personality attributes 


Functional personality attributes 


Self-image 
Object representations 
Morphologic organization 
Mood or temperament 
(Millon, 1990, 1996) 


Expressive behavior 
Interpersonal conduct 
Cognitive style 
Regulatory mechanisms 
(Millon, 1990, 1996) 
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Assessment 


Experiential filters 
Training or expertise 
Specific interests 
(Hermann, 1978, 1980, 1987) 


Risk analysis 
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Strategic performance modalities 


Tactical performance modalities 


Existence: pain-pleasure 
Adaptation: passive—active 
Replication: other-self 
Abstraction: cognitive 
(Millon, 1990, 1994a, 1996) 


Biophysical 
Behavioral 
Phenomenological 
Intrapsychic 
(Millon, 1990, 1996) 


| 


Situational constraints 
Historical antecedents 
Immediate situation 
(Smith, 1968, 1973) 
Cultural context 


| 


Psychological political forecasting 


Figure 24.1 A generative conceptual model for assessing personality and political 


performance. 


STRUCTURAL ATTRIBUTES OF PERSONALITY 


Structural attributes, according to Millon (1990), “represent a 
deeply embedded and relatively enduring template of im- 
printed memories, attitudes, needs, fears, conflicts, and so on, 
which guide the experience and transform the nature of ongo- 
ing life events” (p. 147). Millon (1986, 1990) has specified 
four structural attributes of personality, outlined in the follow- 
ing subsections. Where relevant, equivalent or compatible 
formulations in the field of political psychology are noted. 


Self-Image 


Self-image, located at the phenomenological level of ana- 
lysis, denotes a person’s perception of self-as-object or the 
manner in which people overtly describe themselves (Millon, 
1986; 1990, pp. 148-149). 


This domain accommodates self-confidence, an element of 
decision style in Hermann’s (1980, 1987) conceptual scheme. 
It also offers an alternative theoretical basis for construing 
Renshon’s (1996b) character domain of ambition, derived 
from Kohut’s (1971, 1977) psychoanalytic self theory. 


Object Representations 


The domain of object representations, located at the phenome- 
nological level of analysis, encompasses the inner imprint left 
by a person’s significant early experiences with others—in 
other words, the structural residue of significant past experi- 
ences, composed of memories, attitudes, and affects, which 
serves as a substrate of dispositions for perceiving and respond- 
ing to the social environment (Millon, 1986, 1990, p. 149). 
This domain accommodates Renshon’s (1996b) character 
attribute of relatedness, which is steeped in object-relations 


theory, including Kohut’s (1971) selfobject construct and 
Karen Horney’s (1937) interpersonal tendencies. 


Morphologic Organization 


Morphologic organization, located at the intrapsychic level 
of analysis, embodies the overall architecture that serves as 
framework for a person’s psychic interior—the structural 
strength, interior congruity, and functional efficacy of the 
personality system (Millon, 1986, 1990, pp. 149, 157). 

This domain, roughly equivalent to the notion of ego 
strength, provides a good fit for Renshon’s (1996b) realm of 
character integrity, derived from Kohut’s (1971) self theory 
and elaborated in terms of Erikson’s (1980) notions of ego 
identity and ego ideal. 


Mood or Temperament 


Mood or temperament, located at the biophysical level of 
analysis, captures a person’s typical manner of displaying 
emotion and the predominant character of an individual’s 
affect, and the intensity and frequency with which he or she 
expresses it (Millon, 1986, 1990, p. 157). 

This domain provides a suitable fit for Barber’s (1972/1992) 
construal of presidential character along positive-negative 
(i.e., affective) and active-passive (i.e., predisposition to activ- 
ity, or temperamental) dimensions. In conjunction with the 
domain of cognitive style, mood or temperament also provides 
a conceptual frame of reference for the so-called pessimistic 
explanatory style of stable (vs. unstable), global (vs. specific), 
and internal (vs. external) causal attribution with respect to ad- 
versity, which, in combination with excessive rumination 
about problems, has been shown to predict not only suscepti- 
bility to helplessness and depression, but the electoral defeat of 
presidential candidates (Zullow & Seligman, 1990). 


FUNCTIONAL ATTRIBUTES OF PERSONALITY 


Functional attributes, according to Millon (1990), “represent 
dynamic processes that transpire within the intrapsychic 
world and between the individual’s self and psychosocial en- 
vironment” (p. 136). Millon (1986, 1990) has specified four 
functional attributes of personality, outlined in the next sec- 
tions. Where relevant, equivalent or compatible formulations 
in the field of political psychology are noted. 


Expressive Behavior 


Expressive behavior, located at the behavioral level of analy- 
sis, refers to a person’s characteristic behavior—how the 
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individual typically appears to others and what the individual 
knowingly or unknowingly reveals about him- or herself or 
wishes others to think or to know about him or her (Millon, 
1986, 1990, p. 137). 

Numerous personality traits commonly used to describe 
political behavior are accommodated by this domain, includ- 
ing assertiveness, confidence, competence, arrogance, suspi- 
ciousness, impulsiveness, prudence, and perfectionism. 


Interpersonal Conduct 


Interpersonal conduct, located at the behavioral level of 
analysis, includes a person’s typical style of interacting with 
others, the attitudes that underlie, prompt, and give shape to 
these actions, the methods by which the individual engages 
others to meet his or her needs, and the typical modes of cop- 
ing with social tensions and conflicts (Millon, 1986, 1990, 
pp. 137, 146). 

This domain accommodates the personal political charac- 
teristic of interpersonal style in Hermann’s (1980, 1987) con- 
ceptual scheme, including its two operational elements, 
distrust of others and task orientation. The domain of inter- 
personal conduct also offers a conceptual niche for Christie 
and Geis’s (1970) operationalization of Machiavellianism, 
which remains popular as a frame of reference for describing 
political behavior. 


Cognitive Style 


Cognitive style, located at the phenomenological level of 
analysis, signifies a person’s characteristic manner of focusing 
and allocating attention, encoding and processing informa- 
tion, organizing thoughts, making attributions, and communi- 
cating thoughts and ideas (Millon, 1986, 1990, p. 146). 

This domain accommodates the personal political charac- 
teristics of beliefs and decision style in Hermann’s (1980, 
1987) framework, most notably the conceptual complexity 
component of decision style, and integrative complexity (e.g., 
Suedfeld & Tetlock, 1977; Tetlock, 1985), which rose to 
prominence during the Cold War era as a major construct for 
operationalizing personality in politics. The domain of cogni- 
tive style is also compatible with the notions of nationalism 
and belief in one’s own ability to control events (the two key 
operational elements of beliefs in Hermann’s conceptual 
framework) and her operationalization of several beliefs as- 
sociated with contemporary reformulations of the operational 
code construct (George, 1969; Holsti, 1970; Walker, 1990), 
such as belief in the predictability of events and belief in the 
inevitability of conflict. 
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Regulatory Mechanisms 


The domain of regulatory mechanisms, located at the in- 
trapsychic level of analysis, involves a person’s characteristic 
mechanisms of self-protection, need gratification, and con- 
flict resolution (Millon, 1986, 1990, pp. 146-147). 

The need-gratification facet of the regulatory mechanisms 
domain provides a potential fit for Winter’s (1973, 1987, 
1991, 1998) approach to political personality, which empha- 
sizes needs for power, achievement, and affıliation, and for 
the related motives aspect of the personal characteristics com- 
ponent of Hermann’s (1980, 1987) conceptual scheme. 


PERSONALITY DESCRIPTION, PSYCHOGENETIC 
UNDERSTANDING, AND PREDICTIVE POWER 


The practical value of conceptual systems for assessing per- 
sonality in politics is proportionate to their predictive utility 
in anticipating political behavior. Moreover, there is consid- 
erable merit in a personality model’s capacity to promote ac- 
curate understanding of the developmental antecedents of 
political personality patterns. 


Developmental Causal Analysis 


The importance of a developmental component in a com- 
prehensive model of personality is implicit in Millon and 
Davis’s (2000) contention that, “once the subject has been 
conceptualized in terms of personality prototypes of the clas- 
sification system, biographical information can be added” 
to answer questions about the origin and development of the 
subject’s personality characteristics (p. 73). Greenstein (1992) 
cautions against “the fallacy of observing a pattern of behav- 
ior and simply attributing it to a particular developmental 
pattern, without documenting causality, and perhaps even 
without providing evidence that the pattern existed” (p. 121). 

Millon (1996, chapter 3) frames developmental causal 
analysis in terms of hypothesized biogenic factors and the 
subject’s characteristic developmental history. For the major- 
ity of present-day personality-in-politics investigators, who 
generally favor a descriptive approach to personality assess- 
ment, developmental questions are of secondary relevance; 
however, an explicit set of developmental relational state- 
ments is invaluable for psychobiographically oriented analy- 
sis. Moreover, precisely because each personality pattern has 
characteristic developmental antecedents, in-depth knowl- 
edge of a subject’s experiential history can be useful with re- 
spect to validating the results of descriptive personality 


assessment, or for suggesting alternative hypotheses (Millon 
& Davis, 2000, p. 74). This benefit notwithstanding, genetic 
reconstruction does not constitute an optimal basis for per- 
sonality assessment and description. 


A Framework for Risk Analysis 


As Sears (1987) has noted, a problem with existing concep- 
tualizations of personality in politics is the dichotomy be- 
tween pathology-oriented and competence-oriented analyses. 
Millon’s evolutionary theory of personality bridges the gap 
by offering a unified view of normality and psychopathology: 
“No sharp line divides normal from pathological behavior; 
they are relative concepts representing arbitrary points on a 
continuum or gradient” (Millon, 1994b, p. 283). The synthe- 
sis of normality and pathology is an aspect of Millon’s prin- 
ciple of syndromal continuity, which holds, in part, that 
personality disorders are simply “exaggerated and pathologi- 
cally distorted deviations emanating from a normal and 
healthy distribution of traits” (Millon & Everly, 1985, p. 34). 
Thus, whereas criteria for normality include “a capacity to 
function autonomously and competently, a tendency to adjust 
to one’s environment effectively and efficiently, a subjective 
sense of contentment and satisfaction, and the ability to actu- 
alize or to fulfill one’s potentials” (Millon, 1994b, p. 283), the 
presence of psychopathology is established by the degree to 
which a person is deficient, imbalanced, or conflicted in these 
areas (Millon, this volume). 

At base, then, Millon (1994b) regards pathology as result- 
ing “from the same forces . . . involved in the development of 
normal functioning . . ., [the determining influence being] the 
character, timing, and intensity” (p. 283) of these factors (see 
also Millon, 1996, pp. 12-13). From this perspective, risk 
analysis would entail the classification of individuals on a 
range of dimensions, each representing a normal-pathological 
continuum. 

Despite the emphasis of Millon’s (1996) clinical model on 
personality disorders, the absence of a conceptual distinction 
between normal and abnormal personality—the assertion that 
personality disorders are merely pathological distortions of 
normal personality attributes (Millon, 1990; Millon & Everly, 
1985)—his theoretical system is particularly well suited for 
studying the implications of personality for political perfor- 
mance, because implicit in the principle of syndromal conti- 
nuity is a built-in framework for risk analysis. In short, 
Millon’s system offers an integrated framework for constru- 
ing normal variability and personality pathology, and suggests 
the likely nature and direction of personality decompensation 
under conditions of catastrophic personality breakdown. 


ASSESSMENT METHODOLOGIES 


Approaches to the indirect assessment of personality in poli- 
tics can generally be classified into three categories: content 
analysis, expert ratings, and psychodiagnostic analysis of 
biographical data. 


Content Analysis 


The fundamental assumption of content-analytic techniques 
for at-a-distance (i.e., indirect) measures “is that it is possible 
to assess psychological characteristics of a leader by system- 
atically analyzing what leaders say and how they say it” 
(Schafer, 2000, p. 512). Content analysis remains the domi- 
nant approach to indirect personality assessment and is 
widely acknowledged in political psychology as a reliable 
data-analytic method. It draws on the assumptions and meth- 
ods of psychology, political science, and speech communica- 
tion (Schafer, 2000, p. 512) and predates the establishment of 
political psychology as a discrete field—having been used, 
for example, to analyze Nazi propaganda during World 
War II. Holsti’s (1977) classic overview of qualitative and 
quantitative content-analytic approaches in political psychol- 
ogy remains relevant today, including his examination of 
perennial validity concerns such as the logic of psychological 
inferences about communicators engaging in persuasive 
communication (pp. 133-134); the ambiguities of authorship 
in documentary sources other than interviews and press con- 
ferences (p. 134); and problems of coding (e.g., word or sym- 
bol vs. theme or sentence coding) and data analysis (e.g., 
frequency vs. contingency measures; pp. 134-137). Parallel- 
ing advances in information technology, a recent develop- 
ment has been “automated content analysis” (Dille & Young, 
2000), which “offers a less expensive, quicker, and more reli- 
able alternative to commissioning graduate students to pore 
over and content-analyze texts” (p. 595). 

Schafer (2000) and Walker (2000) provide good overviews 
of the current state of content-analytic at-a-distance assess- 
ment, its major conceptual and methodological issues, and 
future research directions. Clearly, content analysis can be a 
useful tool for dissecting political propaganda, examining psy- 
chologically relevant images in political rhetoric, and opera- 
tionalizing important, politically relevant psychological 
constructs such as motives and conceptual or integrative com- 
plexity. However, content analysis does not offer a congenial 
frame of reference for comprehensive, clinically oriented psy- 
chological assessment procedures capable, in the words of 
Millon and Davis (2000), of capturing the patterning of person- 
ality variables “across the entire matrix of the person” (p. 65). 
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Expert Ratings 


Paul Kowert (1996) has endeavored to move beyond the con- 
tent-analytic methods (e.g., Hermann, 1980; Walker, 1990; 
Winter, 1987) that dominated political personality inquiry 
during the Cold War era, by applying Q-sort methodology to 
single-case analysis. In view of the huge role of public opin- 
ion polling, focus groups, professional speech writers, and 
political spin in contemporary politics, it seems prudent to 
find alternatives to speeches and interviews as primary 
sources of data for psychological evaluation. 

An important advantage of expert ratings is that it yields 
coefficients of interrater reliability. However, this is offset by 
a variety of validity issues. Specifically, ratings by presiden- 
tial scholars are fundamentally impressionistic and not based 
on systematic personality assessment (see Etheredge, 1978, 
p. 438). In some cases, high interrater reliability may merely 
reflect a convergence of conventional wisdom and shared 
myths about the personality characteristics of past presidents. 

A major disadvantage of the expert-rating approach is that 
it is uneconomical, cumbersome, and impractical. To gather 
data for his study of the impact of personality on American 
presidential leadership, Kowert (1996) solicited 42 experts 
on American presidents. Rubenzer and his associates (2002), 
for their ambitious, highly resourceful study of U.S. presi- 
dents (employing primarily Big Five personality measures), 
attempted to contact nearly 1,000 biographers, presidential 
scholars, journalists, and former White House officials, 
eventually securing the cooperation of 115 raters who collec- 
tively completed 172 assessment packets, each containing 
620 items. 

A vexing difficulty with expert ratings is that it is impracti- 
cal for studying candidates in the heat of presidential cam- 
paigns, when—as noted by Renshon (1996b, chapter 13)— 
accurate personality assessment is critical with respect to 
assessing psychological suitability for office. Historians and 
presidential scholars are not optimal sources of information 
under these conditions. Journalists who cover presidential can- 
didates are potentially more reliable, but may be too immersed 
in their own reporting to offer much assistance. A more practi- 
cal approach would be to extract personality data directly from 
the writings of journalists, presidential scholars, biographers, 
and other experts, which obviates the need for soliciting their 
active cooperation. 


Psychodiagnostic Analysis of Biographical Data 


Simonton (1990) credits Lloyd Etheredge (1978) with estab- 
lishing the diagnostic utility “of abstracting individual traits 
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immediately from biographic data” to uncover the link be- 
tween personality and political leadership (p. 677). Simonton 
(1986) argues that “biographical materials [not only] . . . sup- 
ply arich set of facts about childhood experiences and career 
development... [but] such secondary sources can offer the 
basis for personality assessments as well” (p. 150). 
Etheredge (1978) used a hybrid psychodiagnostic/expert- 
rating approach. As subjects he selected 36 U.S. presidents, 
secretaries of state, and presidential advisors who served be- 
tween 1898 and 1968 and “assessed personality traits by 
searching scholarly works, insiders’ accounts, biographies, 
and autobiographies” of his subjects (p. 437). Specifically, 
Etheredge excerpted passages relevant to two dimensions: 
dominance-submission and introversion-extroversion. He 
deleted explicit information and cues regarding the identity 
of the political figures and then rated them on the two per- 
sonality dimensions of interest, along with two independent 
judges who were unaware of the subjects’ identities. 
Etheredge (1978), in commenting on “troublesome 
methodological issues” in such “second-hand assessment of 
historical figures,” raises an important problem with respect 
to atheoretical trait approaches to the study of personality: 


Aman like Secretary [John Foster] Dulles could be dominant over 
his subordinates yet deferential to a superior. This social context 
must be standardized explicitly. I chose to assess dominance by 
assessing dominance over nominal subordinates on the as- 
sumption that a person’s inner desire to dominate would be less 
inhibited and show itself more clearly in this sector of life. In ad- 
dition, since America’s use of force has often been directed 
against smaller countries, I felt this was the most relevant ten- 
dency of international behavior that would generalize. (p. 437) 


Etheredge’s concerns highlight the indispensability of sys- 
tematic import in personality-in-politics theorizing. Theory- 
driven conceptualization safeguards the psychodiagnostician 
against several pitfalls in Etheredge’s reasoning. Most impor- 
tant, in spuriously identifying a problem where none in fact 
existed, Etheredge introduced troubling confounds. The 
pattern that Etheredge observed with respect to Secretary 
Dulles transparently conveys a prototypical instance of the 
distinctive interpersonal conduct of highly conscientious (or 
compulsive) personalities. In stark contrast, highly dominant 
personalities consistently assert themselves in relation to 
both superiors and subordinates. 

In lacking a prior personality taxonomy and proceeding 
atheoretically, Etheredge missed an important, politically 
relevant distinction with respect to dominance. Clearly, a 
purely dimensional scale can obscure important distinctions 
among disparate personality types. In short, dimensional 


prominence provides a necessary but insufficient basis for 
personality assessment; it must be complemented by categor- 
ical distinctiveness—in other words, a comprehensive theory 
of types. 

This concern with categorical distinctiveness is reflected in 
the work of Lyons (1997), who used the Myers-Briggs Type 
Indicator (MBTI; Myers & McCaulley, 1985) as a frame of 
reference for systematically extracting data from secondary- 
source biographies to construct a typological profile of U.S. 
president Bill Clinton, which he then used as a framework for 
analyzing President Clinton’s leadership style. However, in 
applying the Myers-Briggs model qualitatively, Lyons’s ap- 
proach is somewhat impressionistic, lacking the empirical 
basis essential for assessing dimensional prominence and the 
nomothetic focus necessary for comparative study. 

A noteworthy aspect of Lyons’s method is that he used one 
set of biographies, predating Bill Clinton’s election as presi- 
dent, for extracting personality data and another set, focusing 
on the Clinton presidency, for inferring leadership style (see 
Lyons, 1997, p. 799). This is consistent with the solution 
implied in Greenstein’s (1992) critique that 


single-case and typological studies alike make inferences about 
the inner quality of human beings . . . from outer manifestations— 
their past and present environments . . . and the pattern over time 
of their political responses. . . . They then use those inferred con- 
structs to account for the same kind of phenomena from which 
they were inferred—responses in situational contexts. The danger 
of circularity is obvious, but tautology can be avoided by 
reconstructing personality from some response patterns and using 
the reconstruction to explain others. (pp. 120-121) 


Greenstein’s point is valid insofar as it highlights the in- 
herent danger of pseudoexplanations of leadership behaviors 
in terms of mere diagnostic labels. However, Lyons’s ap- 
proach seems overly reductionistic and risks reifying the 
scientific method. At the operational level, it may be useful to 
view personality as the independent variable and leadership 
as the dependent variable—as if they were causally related. 
Conceptually, however, the relationship is fundamentally cor- 
relational. The fallacy involved in construing personality and 
leadership as hypothetical cause and effect, respectively, is 
akin to the so-called third-variable problem in correlational 
studies: Rather than manifest personality properties (x) caus- 
ing observed leadership style (y), both variables likely 
express a common latent structure (z); to paraphrase Millon 
(1996), the “opaque or veiled inner traits” undergirding the 
“surface reality” (p. 4) of both observed variables. 

Millon’s system offers abundant prospects for psychodi- 
agnostic analysis of biographical data. Several personality in- 
ventories have been developed to assess personality from a 
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Millonian perspective. Best known among these is the widely 
used Millon Clinical Multiaxial Inventory-II (MCMI-II; 
Millon, Davis, & Millon, 1996), a standard clinical diagnos- 
tic tool employed worldwide. The Millon Index of Personal- 
ity Styles (MIPS; Millon, 1994a) was developed to assess 
and classify personality in nonclinical (e.g., corporate) set- 
tings. Similarly, Strack (1991) developed the Personality 
Adjective Check List (PACL) for gauging normal personality 
styles. Oldham and Morris, in their trade book, The New Per- 
sonality Self-Portrait (1995), offer a self-administered instru- 
ment congruent with Millon’s model. Immelman (1999; 
Immelman & Steinberg, 1999) adapted the Millon Inventory 
of Diagnostic Criteria (MIDC) from Millon’s work, specifi- 
cally for the assessment of personality in politics. 
Immelman (1998, 2002) uses the MIDC to synthesize, 
transform, and systematize diagnostically relevant informa- 
tion collected from the literature on political figures (primar- 
ily biographical sources and media reports) into Millon’s 
(1990) four data levels (behavioral, phenomenological, in- 
trapsychic, and biophysical). The next section outlines the 
Millonian approach to political personality assessment. 


A Theory-Driven Psychodiagnostic 
Assessment Methodology 


Favoring the more systematic, quantitative, nomothetic 
approach advocated by Simonton (1986, 1988, 1990), 
Immelman (1993, 1998, 2002) adapted Millon’s model of 
personality (1986, 1990, 1994a, 1996; Millon & Davis, 2000; 
Millon & Everly, 1985) for the indirect assessment of person- 
ality in politics. Immelman’s (1999) approach is equivalent to 
Simonton’s (1986, 1988) in that it quantifies, reduces, and or- 
ganizes qualitative data extracted from the public record. It is 
dedicated to quantitative measurement, but unlike the cur- 
rently popular five-factor model, which is atheoretical, the 
Millonian approach is theory driven. The assessment method- 
ology yields a personality profile derived from clinical analy- 
sis of diagnostically relevant content in biographical 
materials and media reports, which provides an empirical 
basis for predicting the subject’s political performance and 
policy orientation Immelman, 1998). 


Sources of Data 


Immelman (1998, 1999, 2002) gathers diagnostic informa- 
tion pertaining to the personal and public lives of political 
figures from a variety of published materials, selected with a 
view to securing broadly representative data sets. Pertinent 
selection criteria include comprehensiveness of scope (e.g., 
coverage of developmental history as well as political 


career), inclusiveness of literary genre (e.g., biography, auto- 
biography, scholarly analysis, and media reports), and the 
writer’s perspective (e.g., a balance between admiring and 
critical accounts). 


Personality Inventory 


Greenstein (1992) criticizes analysts who “categorize their 
subjects without providing the detailed criteria and justifi- 
cations for doing so” (p. 120). In Immelman’s (1999) 
approach, the diagnostic criteria are documented by means of 
a structured assessment instrument, the second edition of the 
MIDC (Immelman & Steinberg, 1999), which was compiled 
and adapted from Millon’s (1990, 1996; Millon & Everly, 
1985) prototypal features and diagnostic criteria for normal 
personality styles and their pathological variants. The justifi- 
cation for classification decisions is provided by documenta- 
tion from independent biographical sources. The Millon 
Inventory of Diagnostic Criteria Manual (Immelman, 1999) 
describes the construction, administration, scoring, and inter- 
pretation of the MIDC. The 12 MIDC scales (see Immelman, 
1999, 2002, for the full MIDC taxonomy) correspond to 
major personality patterns posited by Millon (e.g., 1994a, 
1996) and are coordinated with the normal personality styles 
described by Oldham and Morris (1995) and Strack (1997). 


Diagnostic Procedure 


The diagnostic procedure can be summarized as a three-part 
process: first, an analysis phase (data collection) in which 
source materials are reviewed and analyzed to extract and 
code diagnostically relevant psychobiographical content; 
second, a synthesis phase (scoring and interpretation) in 
which the unifying framework provided by the MIDC proto- 
typal features, keyed for attribute domain and personality 
pattern, is employed to classify the diagnostically relevant in- 
formation extracted in phase 1; and finally, an evaluation 
phase (inference) in which theoretically grounded descrip- 
tions, explanations, inferences, and predictions are extrapo- 
lated from Millon’s theory of personality, based on the 
personality profile constructed in phase 2 (Immelman, 1998, 
1999, 2002). 


SITUATIONAL VARIABLES, EXPERIENTIAL 
FILTERS, AND POLITICAL PERFORMANCE 


Greenstein (1992) cautions against “the psychologizing and 
clinical fallacies” of explaining behavior in terms of person- 
ality while ignoring situational determinants (p. 121). This, 
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of course, is simply the familiar fundamental attribution error 
(Ross, 1977). Clearly, a comprehensive model of personality 
in politics should account for the impact of situational vari- 
ables and the cultural context on political performance and 
recognize that certain personal characteristics (e.g., training 
and experience) serve as filters for the political expression of 
personality. 

The best known integrative framework for political psy- 
chology is the conceptual map developed by M. Brewster 
Smith (1968), which illustrates interactions among distal and 
proximate social antecedents, the social environment, the im- 
mediate situation, personality processes and dispositions of 
political actors, and political behavior. Smith’s conceptual 
map has been exhaustively detailed in the political psychol- 
ogy literature and will not be recapitulated here. The reader is 
referred to Smith (1968, 1973), Greenstein (1969, pp. 25-31; 
cf. Greenstein’s 1992, pp. 114-116, reformulation), and 
Stone and Schaffner (1988, pp. 32-43). 


Filter Variables That Modulate the Impact 
of Personality on Political Performance 


An important aspect of Hermann’s (1980, 1987) model of 
personality in politics is that it stipulates not only the condi- 
tions under which personal characteristics will most directly 
influence political behavior (e.g., the wide decision latitude 
of leaders in authoritarian regimes), but also specific filter 
variables that modulate the impact of personality on political 
performance. A high-level political leader’s training, experi- 
ence, or expertise has “a dampening effect” on the impact of 
personal characteristics on government behavior because it 
increases the range or repertoire of policy-relevant, role- 
related behaviors available to the leader (Hermann, 1987, 
p. 166). Sensitivity to the environment similarly inhibits the 
impact of personality in politics. According to Hermann 
(1987), “the more sensitive the leader is to cues from his po- 
litical environment, the more likely other types of factors are 
to intervene in this relationship” (p. 166). Hermann’s em- 
ployment of this particular variable as a filter is problematic 
in that social responsiveness is in essence a personality trait. 
Finally, interest in foreign affairs (or in any aspect of politics 
for that matter, depending on the political domain of interest) 
“acts aS a motivating force” (Hermann, 1980, p. 13); it 
“enhances the effect of a leader’s [personal] orientation on 
government policy” by increasing his or her participation in 
the decision-making process and restricting the delegation of 
authority in the political domain of interest (Hermann, 1987, 
p. 166). 

It is worth noting that Renshon’s construal of “skills and 
talents” that mediate the relationship between character and 


political performance (see Renshon, 1996a, p. 47; 1996b, 
pp. 194-199), stripped of its surplus Kohutian self-psycho- 
logical significance, is not incompatible with Hermann’s no- 
tion of experiential filters. 


Systematic Import in a Generative Theory of 
Personality and Political Performance 


In his introduction to a special issue of the journal Leadership 
Quarterly devoted to political leadership, guest editor Dean 
Keith Simonton (1998) asserted that “political leadership has 
received inadequate attention by researchers who specialize 
in the study of leadership” (p. 239). To highlight the dispro- 
portionate focus of leadership research on small problem- 
solving groups, Simonton noted that a recent edition of the 
classic Bass & Stogdill’s Handbook of Leadership (Bass, 
1990) dispensed with the topic of political leadership in only 
four pages. 

Hermann (1986) demarcated the requisite scope of inquiry 
by specifying five ingredients necessary for understanding 
political leadership: 


(1) the leader’s personality and background, as well as the [lead- 
ership] recruitment process . . .; (2) the characteristics of 
the groups and individuals whom the leader is leading; (3) the 
nature of the relationship between the leader and those he leads; 
(4) the context or setting in which the leadership is taking place; 
and (5) the outcomes of interactions between the leader and 


those led in specific situations. (p. 169) 


Clearly, Hermann accords personality a prominent place 
in the study of political leadership. She elaborates by speci- 
fying seven personal characteristics that influence political 
leadership: (a) the leader’s basic political beliefs, which in- 
fluence “the kinds of goals and strategies the leader will urge 
on his [or her] political unit’; (b) the leader’s political style, 
which contributes to the structure and function of the politi- 
cal unit; (c) the leader’s motivation for seeking a political 
leadership position, which shapes “the general focus of at- 
tention of the leader’s behavior’’; (d) the leader’s reaction to 
stress and pressure, which has a bearing on the kinds of is- 
sues prone “to cause problems for the leader and how detri- 
mental and pervasive stress is likely to be”; (e) the manner 
in which the leader was first recruited into a political leader- 
ship position, which is instrumental in determining “how 
free of political debts and obligations” he or she will be and 
predicts “the rhetoric and practices” that the leader will tend 
to revert to; (f) the leader’s previous political experience, 
which signifies how qualified he or she is for the position 
and “what strategies and styles have paid off for the leader” 
over time; and (g) the unique generational experiences of the 
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leader upon embarking on a political career, in terms of the 
prevailing political climate that helped “shape the norms and 
beliefs” of the leader and his or her constituents (pp. 173- 
180). 

Developing a comprehensive model of political leadership 
is beyond the scope of the present endeavor, which is dedi- 
cated primarily to mapping out a generative conceptual 
framework and methodology for studying personality in pol- 
itics. Nonetheless, there is heuristic value in broadly stipulat- 
ing the major tenets for an evolving theory of political 
leadership synergistically superimposed upon a comprehen- 
sive, generative model of personality in politics. Of central 
relevance in this regard are Hermann’s (1986) first two 
personal characteristics surmised to influence political lead- 
ership: political beliefs impinging on a leader’s goals or 
strategies, and stylistic elements that fashion the structural 
and functional attributes of political units. These core charac- 
teristics are important signposts for a generative theory of 
personality and political performance compliant with 
Hempel’s (1965) canon of systematic import. 

To this end, Millon’s evolutionary model of personality 
provides a practical point of departure. David Buss (1999) 
has bluntly asserted that “theories of personality inconsistent 
with evolutionary principles stand little or no chance of 
being correct” (p. 52). Paralleling Millon’s (1996, chapter 5) 
construal of a personologically based evolutionary model 
of psychotherapeutic intervention, an applied personologic 
model of leadership can be construed as encompassing both 
strategic and tactical modalities. From this frame of refer- 
ence, strategic dimensions of political leadership would 
consist of generalized, personality-based leadership orienta- 
tions, including higher-order political aims and long-term 
policy goals and preferences, whereas tactical (stylistic) 
modalities of political leadership would consist of more con- 
crete, focal leadership objectives and political maneuvers, 
typically dictated by circumstances but shaped both by the 
leader’s underlying structural and functional personality 
attributes and by his or her higher order strategic aims and 
goals. The distinction between strategic and tactical modali- 
ties of political performance is equivalent to the distinction 
between philosophical and instrumental beliefs in George’s 
(1969) operational code construct. Philosophical (epistemo- 
logical) beliefs include a leader’s “assumptions and 
premises” about “the fundamental nature of politics” and 
“the nature of political conflict,” whereas instrumental 
beliefs relate to “ends-means relationships in the context of 
political action” (p. 199). When reconceptualized in evolu- 
tionary terms, this general perspective provides a heuristic 
basis for an emergent personological interpretation of 
political performance. 


AN EVOLUTIONARY MODEL OF PERSONALITY 
AND POLITICAL PERFORMANCE: 
THE STRATEGIC MODALITIES 


Paralleling the conceptual foundations of his personological 
model, Millon’s (1996, chapter 5) strategic modalities of ap- 
plied psychological intervention are derived from three uni- 
versal, interacting domains or spheres of evolutionary and 
ecological principles (1990; Millon, this volume): existence 
(the pain-pleasure polarity), adaptation (the passive-active 
polarity), and replication (the other-self polarity). A practical 
operationalization of these three polarities is provided by the 
Millon Index of Personality Styles (MIPS; Millon, 1994a; 
cf. Millon, this volume), which assesses them in accordance 
with six “motivating aims”: life enhancement (pleasure seek- 
ing) versus life preservation (pain avoidance), ecologic mod- 
ification (active) versus ecologic accommodation (passive), 
and reproductive propagation (self-individuating) versus 
reproductive nurturance (other-nurturing). 

The MIPS also assesses four “cognitive modes,” or 
predilections of abstraction, consonant with Carl Jung’s 
(1921/1971) theory of types. Unlike the three universal moti- 
vating aims, the cognitive modes represent a distinctly 
human sphere of functioning and were thus redundant with 
respect to deriving Millon’s (1990, 1996) original taxonomy 
of adaptive and maladaptive personality styles from evolu- 
tionary ecology. However, precisely by virtue of the fact that 
abstraction “concerns the emergence of uniquely human 
competencies that foster anticipatory planning and reasoned 
decision making” (Millon, 1999, pp. 442-443; Millon, this 
volume), the cognitive modes are critical with respect to 
deducing a synergistic, personological model of political per- 
formance. Moreover, the four distinctly human cognitive 
propensities will likely be at the forefront of future advances 
in Millon’s personality system, judging from his current con- 
viction that predilections of abstraction, the most recent stage 
of evolution, comprise “central elements in personologic 
derivations” (Millon, this volume). 

It is noteworthy that in terms of evolutionary theory, 
Osgood’s (Osgood, Suci, & Tannenbaum, 1957) three seman- 
tic differential dimensions, namely evaluation (good—bad), 
potency (strong-weak), and activity (active-passive), can 
be conceptually linked to, respectively, Millon’s (1990) 
pleasure-pain, self-other, and active-passive polarities. 


Aims of Existence: The Pain-Pleasure Polarity 


The two-dimensional (i.e., two linearly independent vectors) 
pain-pleasure polarity (Millon, 1990, pp. 51-64; Millon, this 
volume) is conceptualized in terms of, respectively, life 
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enhancement (pleasure seeking) and life preservation (pain 
avoidance): “acts that are ‘attracted’ to what we experien- 
tially record as ‘pleasurable’ events (positive reinforcers) . . . 
[versus] behaviors oriented to ‘repel’ events experientially 
characterized as ‘painful’ (negative reinforcers)” (Millon, 
this volume). 

Hypothetically, the pain-pleasure polarity could partially 
account for individual differences in ideological (e.g., 
liberal-conservative) resonance in politics. In evolutionary 
terms, liberalism can be construed as a primary concern 
“with improvement in the quality of life” and “behaviors that 
improve survival chances,” 
ance of “actions or environments that threaten to jeopardize 
survival” (Millon & Davis, 2000, p. 58). Thus construed, 
liberals seek to maximize survival by seeking pleasure (life 
enhancement, or positive reinforcement), whereas conserva- 
tives seek to maximize survival by avoiding pain (life preser- 
vation, or negative reinforcement). 

The nature of the relationship between personality and ide- 
ology has been a perennial concern in political psychology 
and remains a topic worthy of study. As early as 1907, William 
James drew a personological distinction between two ideo- 
logically relevant philosophical temperaments: optimistic, 
idealistic tender-mindedness versus pessimistic, materialistic 
tough-mindedness—a position compatible with Millon’s 
(1990) life-enhancement and life-preservation polarities. 

Evolutionary theory also may shed new light on an unre- 
solved controversy in political psychology, namely the de- 
bate over authoritarianism as fundamentally a right-wing 
phenomenon versus authoritarianism as an expression of 
both right-wing and left-wing ideological extremism. 
Eysenck (1954) proposed a two-factor theory that among its 
classifications conceptualized fascists as tough-minded con- 
servatives, communists as tough-minded radicals, and liber- 
als as tender-minded moderates. Paul Sniderman (1975) 
conjectured that low self-esteem encourages both left-wing 
and right-wing extremism. More consonant with Millon’s 
pain-pleasure polarity, Silvan Tomkins’s (1963) polarity 
theory posits that people with more humanistic, left-wing, 
ideo-affective postures (or scripts) both express and are more 
receptive to positive affect, whereas those with more norma- 
tive, right-wing scripts tend to be more responsive to nega- 
tive affect. Stone (1980; Stone & Smith, 1993), a leading 
critic of what he calls the myth of left-wing authoritarianism, 
has argued on empirical grounds that the evidence for left- 
wing authoritarianism is flawed (see Altemeyer, 1996; 
McFarland, Ageyev, & Abalakina, 1993) and that authoritar- 
janism is, in essence, a right-wing phenomenon. 

In Hermann’s conceptual scheme, a core belief compo- 
nent shaping a leader’s worldview is nationalism, which 
emphasizes “the importance of maintaining national honor 


and conservatism as an avoid- 


and dignity” (Hermann, 1987, p. 167). In Millon’s evolution- 
ary terms, the motivating aim of nationalism clearly is a life- 
preserving (pain-avoidant) orientation. 

The pain-pleasure dimension also provides evolutionary 
underpinnings for Barber’s (1972/1992) fourfold (active— 
passive x positive-negative) categorization of presidential 
character, in which positivity-negativity is described in terms 
of enjoyment derived from political office. Positive leaders 
have a generally optimistic outlook and derive pleasure from 
the duties of public office, whereas negative leadership has a 
more pessimistic tone, being oriented toward pain aversion. 

Finally, the pain—pleasure dimension suggests a possible 
evolutionary basis for the three management models pro- 
posed by Johnson (1974) and employed by George and Stern 
(1998) to classify the policy-making structures and advisory 
systems favored by recent U.S. presidents. 

Formalistic chief executives prefer “an orderly policy- 
making [sic] structure, . . . well-defined procedures, hierar- 
chical lines of communication, and a structured staff system” 
(George & Stern, 1998, p. 203). In evolutionary terms, they 
seek to preserve life by minimizing pain. 

Competitive chief executives encourage “more open and 
uninhibited expression of diverse opinions, analysis, and ad- 
vice” and tolerate or encourage “organizational ambiguity, 
overlapping jurisdictions, and multiple channels of commu- 
nication to and from the president” (George & Stern, 1998, 
p. 203). In evolutionary terms, they seek to enhance life by 
maximizing pleasure. 

Collegial chief executives attempt to benefit from the ad- 
vantages of both the competitive and formalistic approaches 
while avoiding their pitfalls. Thus, they strive for “diversity 
and competition in the policymaking system,” balanced by 
“encouraging cabinet officers and advisers to identify at least 
partly with the presidential perspective” and “encouraging 
collegial participation” (George & Stern, 1998, p. 203). In 
evolutionary terms, collegial executives are intermediate on 
both the pleasure-seeking and pain-avoidant polarities. 

The systematic import of a generative theory is implicit 
in the suggestion that Johnson’s (1974) management model 
fails to account for at least two additional executive styles: com- 
plex types high on both the pleasure-seeking and pain-avoidant 
polarities, and undifferentiated types low on both valences. 


Modes of Adaptation: The Passive-Active Polarity 


The passive-active polarity (Millon, 1990, pp. 64-77; Millon, 
this volume) is conceptualized in terms of ecologic modifi- 
cation (active) and ecologic accommodation (passive); that is, 
“whether initiative is taken in altering and shaping life’s 
events or whether behaviors are reactive to and accommodate 
those events” (Millon, this volume). 
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The passive—active dimension provides evolutionary un- 
derpinnings for Barber’s (1972/1992) fourfold (active—passive 
x positive-negative) categorization of presidential character, 
in which activity-passivity is described in terms of energy in- 
vested in political office. In evolutionary terms, a passive 
orientation can be construed as “atendency to accommodate to 
a given ecological niche and accept what the environment 
offers,” whereas an active orientation can be construed as “a 
tendency to modify or intervene in the environment, thereby 
adapting it to oneself’ (Millon & Davis, 2000, p. 59). 

The passive-active dimension also provides an evolu- 
tionary basis for Etheredge’s (1978) fourfold (high-low 
dominance x introversion-extroversion) classification of 
personality-based differences in foreign-policy operating 
style and role orientation. High-dominance introverts (bloc 
or excluding leaders such as Woodrow Wilson and Herbert 
Hoover) actively seek to reshape the world, typically by 
means of containment policies or by tenaciously advancing 
a personal vision. High-dominance extraverts (world or 
integrating leaders such as Theodore Roosevelt, Franklin D. 
Roosevelt, John F. Kennedy, and Lyndon B. Johnson) ac- 
tively seek to reshape the world through advocacy and prag- 
matic leadership on a wide range of foreign-policy fronts. 
Low-dominance introverts (maintainers such as Calvin 
Coolidge) tend to persevere with the existing order, pas- 
sively pursuing a foreign policy that amounts to “a holding 
action for the status quo.” Low-dominance extraverts (con- 
ciliators such as William McKinley, William Taft, Warren 
Harding, Harry Truman, and Dwight D. Eisenhower), 
though revealing a preference for accommodating to exist- 
ing arrangements, are more flexible and open to change, 
tending “to respond to circumstances with the sympathetic 
hope that accommodations can be negotiated” (Etheredge, 
1978, pp. 449-450). 

Finally, in Hermann’s (1980, 1987) conceptual scheme, a 
core belief contributing to a leader’s worldview, along with 
nationalism, is belief in one’s own ability to control events. In 
evolutionary terms, a more efficacy-oriented, internal locus 
of control implies an active-modifying motivating aim, in 
contrast to a more external locus of control, which suggests a 
passive-accommodating mode of adaptation. Hermann’s 
(1987) expansionist, active-independent, and influential ori- 
entations are more actively oriented, whereas her mediator- 
integrator, opportunist, and developmental orientations are 
more passively oriented. 


Strategies of Replication: The Other-Self Polarity 


The other-self polarity (Millon, 1990, pp. 77-98) is conceptu- 
alized in terms of, respectively, reproductive nurturance (other) 
and reproductive propagation (self): a nurturing tendency to 


value the needs of others versus an individuating self- 
orientation that seeks to realize personal potentials before at- 
tending to the needs of others (Millon, 1994a, p. 6; Millon, this 
volume). 

In political psychology, three social motives (which in 
Hermann’s conceptual scheme are postulated to contribute to 
a leader’s worldview) are regarded as playing a key role in 
leader performance: need for power, need for achievement, 
and need for affiliation (Winter, 1987, 1998). In evolutionary 
terms, the need for power, involving “the desire to control, 
influence, or have an impact on other persons or groups” 
(Hermann, 1987, p. 167), suggests a self-individuating repli- 
cating strategy, as does the need for achievement, which 
involves “a concern for excellence” and personal accomplish- 
ment (Winter, 1998, p. 369). Conversely, the need for 
affiliation, reflecting “concern for establishing, maintaining, 
or restoring warm and friendly relations with other persons or 
groups” (Hermann, 1987, p. 167), suggests an other-nurturing 
replicating strategy. Hermann’s (1987) expansionist, active- 
independent, and influential orientations are more self- 
oriented, whereas her mediator-integrator, opportunist, and 
developmental orientations are more other-oriented. 

Hermann (1980) also posits two key elements of interper- 
sonal style that, in conjunction with decision style, shape a 
leader’s personal political style: distrust of others and task 
orientation (see Hermann, 1987, pp. 163, 167). In evolution- 
ary terms, the frust-distrust and task-relationship dimen- 
sions of leadership are easily reconceptualized as surface 
manifestations of the other-self bipolarity. 

The two key elements of decision style in Hermann’s (1980) 
framework are conceptual complexity and self-confidence, 
which she construes (following Ziller, Stone, Jackson, & 
Terbovic, 1977), as jointly determinative of “how ideological 
or pragmatic a political leader will be” (Hermann, 1987, 
p. 164). Ziller (1973) developed a social-psychological theory 
of personality that examines two components of the self- 
concept—self-esteem and complexity of the self-concept—in 
the context of responsiveness to the views of others. Ziller et al. 
(1977) conducted a series of important studies investigating 
the effects of the four self-other orientations (high/low self- 
esteem x high/low self-complexity) on political behavior. 
They found that, in terms of political behavior, persons with 
high self-esteem and high self-complexity (apoliticals) “have 
difficulty being responsive” to others; persons with low self- 
esteem and high self-complexity (pragmatists) “are quite re- 
sponsive” to the opinions of others; persons with high 
self-esteem and low self-complexity (ideologues) “are gener- 
ally nonresponsive” to the opinions of others; and persons with 
low self-esteem and low self-complexity (an indeterminate 
type) “are highly responsive within a narrow range of social 
stimuli” (Ziller et al., 1977, pp. 179-180). According to Ziller 
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and his coworkers, low self-esteem/high self-complexity prag- 
matists and high self-esteem—low self-complexity ideologues 
“parallel the two leadership roles which have been observed in 
small groups, the task role (ideologue) and the socio-emotional 
role (pragmatist)” (p. 193). 

Stone and Baril (1979), elaborating on the findings of 
Ziller et al. (1977), used self—other orientation as a concep- 
tual basis for postulating two distinctive political prototypes, 
each having a different motivational base. The pragmatist— 
akin to Barber’s (1965) active-negative advertiser—is moti- 
vated by power seeking to compensate for low self-esteem 
(as anticipated by Lasswell, 1948), being driven by self- 
enhancement and self-promotion. The second political per- 
sonality type, the ideologue—akin to Barber’s (1965) 
active-positive lawmaker—is more other oriented, appar- 
ently having a sincere interest in good legislation (defined 
as either pursuing ideological goals or as serving a 
constituency). 

Stone and Baril’s (1979) construal of self- and other- 
oriented political personality types, in concert with Barber’s 
(1965, 1972/1992) scheme, lends empirical and theoretical 
support for the utility of Millon’s (1990) other-self polarity 
in an overarching theory of political personality and perfor- 
mance. In addition, Ziller et al.’s (1977) explication of four 
self—other orientations in relation to social responsiveness 
offers a conceptual substrate for Hermann’s (1980, 1987) 
notion of “sensitivity to the [political] environment” as a fil- 
ter for modulating the influence of personal characteristics on 
political behavior. 


Predilections of Abstraction: The Cognitive Polarities 


The cognitive modes of abstraction (Millon, 1990, pp. 42-43, 
1994a, pp. 3-4, 6-7, 21-27), which encompass “the sources 
employed to gather knowledge about the experience of life and 
the manner in which this information is gathered and trans- 
formed” (Millon, this volume), are conceptualized in terms of 
four polarities subserving two superordinate functions, 
namely information sourcing and transformational process- 
ing of cognitive data: 


1. The external-internal orientation polarity involves extra- 
ceptive (extraversing) versus intraceptive (introversing) 
modes of information gathering or knowledge sourcing. 

2. The tangible-intangible disposition polarity entails realis- 
tic (sensory, concrete) versus intuitive (abstract) modes of 
attending to, selecting, and perceiving information. 

3. The ideational-emotional preference polarity pertains to 
intellective (thinking) versus affective (feeling) modes of 
information processing. 


4. The integrating-innovative bias polarity relates to assim- 
ilative (systematizing) versus imaginative (innovating) 
modes of knowledge transformation; that is, knowledge 
assimilation versus cognitive accommodation. 


Implicitly, Choiniere and Keirsey (1992) cross the 
tangible-intangible cognitive mode with the other-self moti- 
vating aim to yield a fourfold (realistic, concrete vs. intuitive, 
abstract mode of thought and speech X moral sanctioning vs. 
pragmatic utilitarian value orientation) categorization of U.S. 
presidents as Guardians (concrete sanctioners), Idealists 
(abstract sanctioners), Artisans (concrete utilitarians), and Ra- 
tionals (abstract utilitarians; pp. 8-10; see also pp. 598-602). 
Furthermore, Choiniere and Keirsey’s (1992) model of “pres- 
idential temperament” distinguishes two variants of each 
type—directing and reporting (pp. 11—12)—a distinction that 
appears to be a surface manifestation of Millon’s (1990) 
active-passive polarity. Thus, when reconceptualized in terms 
of Millon’s (1994a) three universal evolutionary motivating 
aims and four cognitive modes, there are eight distinct leader- 
ship styles: active-realist utilitarians (Operator Artisans such 
as Theodore Roosevelt, Franklin D Roosevelt, John F. 
Kennedy, and Lyndon B. Johnson); passive-realist utilitarians 
(Player Artisans such as Warren Harding and Ronald Reagan); 
active-intuitive utilitarians (Organizer Rationals such as 
Herbert Hoover and Dwight D. Eisenhower); passive-intuitive 
utilitarians (Engineer Rationals such as Thomas Jefferson 
and Abraham Lincoln); active-realist sanctioners (Monitor 
Guardians such as George Washington, Woodrow Wilson, 
Calvin Coolidge, Harry Truman, Jimmy Carter, and Richard 
Nixon); passive-realist sanctioners (Conservator Guardians 
such as William McKinley, William Taft, Gerald Ford, and 
George H. W. Bush); active-intuitive sanctioners (Mentor 
Idealists); and passive-intuitive sanctioners (Advocate Ideal- 
ists). There have been no Idealist U.S. presidents; however, 
Choiniere and Keirsey (1992) present Mohandas Gandhi and 
Eleanor Roosevelt as prototypes of, respectively, the Mentor 
and Advocate Idealist. 


AN EVOLUTIONARY MODEL OF PERSONALITY 
AND POLITICAL PERFORMANCE: 
THE TACTICAL MODALITIES 


Millon’s (1996, chapter 5) tactical modalities of applied psy- 
chological intervention are conceptually anchored to his 
eight structural and functional personality domains, encom- 
passing the behavioral, phenomenological, intrapsychic, and 
biophysical levels of analysis. Millon (1996) notes that the 
eight domains “are not themselves the parts of personality, 


but do serve as a means of classifying the parts or constructs 
of personality” (p. 183). There is heuristic value in employ- 
ing a parallel organizational scheme to classify the constructs 
of political performance (leadership and decision making). 
At a minimum, such a heuristic model establishes explicit 
links between the source domain of personality and the target 
domain of political performance. 


Biophysical Level 


Fundamentally, Barber’s (1972/1992) dimensions of activity— 
passivity and positive affect/negative affect constitute a 
temperamental (i.e., having a predisposition to activity and 
emotionality) construct. Thus, Barber’s construal of “presi- 
dential character” offers a congenial framework for deducing 
biophysical (temperamental and affective) modalities of 
presidential performance. 

The biophysical modality also is capable of accommodat- 
ing the notion of emotional intelligence, one of the six key 
qualities in Greenstein’s (2000) schema for describing presi- 
dential leadership style and job performance. The flawed 
presidencies of Lyndon B. Johnson, Richard Nixon, Jimmy 
Carter, and Bill Clinton all serve as stark reminders of the 
pernicious effects that failed emotional management can 
have on presidential performance. More significant, however, 
is that this modality offers a congenial framework for accom- 
modating the emerging biopolitical perspective (e.g., 
Marcus, 2001; Masters, 1989) on the psychology of politics. 


Behavioral Level 


The ubiquitous task—relationship dimension, prevalent in con- 
temporary theories of leadership (including that of Hermann, 
1986), presents a clear-cut instance of a personality-based 
leadership orientation observed at the behavioral level. The 
behavioral modality also represents the appropriate data level 
for assessing Renshon’s (1996b) “three distinct aspects of 
presidential and political leadership: mobilization, orchestra- 
tion, and consolidation” (p. 226). Three of Greenstein’s (2000) 
six stylistic and performance qualities can be assembled at the 
behavioral level of analysis: organizational capacity, effec- 
tiveness as a public communicator, and political skill. 


Phenomenological Level 


Numerous personality-based leadership traits and qualities 
converge on the phenomenological data level, including con- 
ceptual complexity (Hermann, 1974, 1987), integrative com- 
plexity (Suedfeld, 1994), cognitive style (George & Stern, 
1998), sense of efficacy and competence (George & Stern, 
1998), and judgment/decision making (Renshon, 1996b). 
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Two of Greenstein’s (2000) presidential leadership and per- 
formance qualities, namely vision (which subsumes both the 
power to inspire and consistency of viewpoint) and cognitive 
style, assemble at this data level. 


Intrapsychic Level 


Both Hermann’s (1987) trust-distrust dimension (a compo- 
nent of interpersonal style) and George and Stern’s (1998) 
orientation toward political conflict (which influences a 
leader’s choice of policy-making system) lend themselves to 
analysis at the intrapsychic data level. Indeed, numerous per- 
sonological and social-psychological perspectives relevant to 
political leadership, judgment, and decision making converge 
at the intrapsychic level, including the ego-defensive notion 
of scapegoating as a form of displaced aggression (Adorno 
etal., 1950; Hovland & Sears, 1940); the belief in a just 
world and blaming the victim (Lerner, 1970) as a form of de- 
fensive attribution; and the problem of defensive avoidance 
in political decision making (Janis & Mann, 1977). The in- 
trapsychic modality also offers a heuristic frame of reference 
for examining psychodynamic aspects of xenophobia, ethnic 
hatred, and the so-called roots of evil (Staub, 1989) as ex- 
pressed in political leadership. 


CONCLUSION 


Political psychologists recognize that political outcomes are 
governed by a multitude of factors, many of them indetermi- 
nate. Nonetheless, the study of personality in politics has 
advanced sufficiently to permit broad personality-based per- 
formance predictions and to pinpoint a political candidate’s 
specific strengths and limitations. 

A coherent psychodiagnostic framework capable of cap- 
turing the critical personological determinants of political 
performance, embedded in a broad range of attribute domains 
across the entire matrix of the person—not just the individ- 
ual’s motives, operational code, integrative complexity, or 
personality traits—is the one indispensable tool without 
which the assessment of personality in politics can neither 
prevail nor prosper. 

Although this chapter has but scratched the surface in 
breaking new ground for the construction of a generative, 
evolutionary foundation for personality-in-politics inquiry, I 
join Theodore Millon (coeditor of this volume) in reflecting 
as he did upon concluding his epoch-making Toward a New 
Personology: An Evolutionary Model (1990): 


Some may very well argue they just struggled through an au- 
thor’s need not only to impose an unnecessary order but to frame 
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its elements in an overly formalistic sequence; that I, the author, 
have forced the subject of personology into the procrustean bed 
of theoretical predilections, drawing on tangential topics of little 
or no relevance. If such a case be valid, I regret that my habit of 
seeking bridges between scientific domains has led me to cohere 
subjects best left disparate. It is hoped that this philosophic prej- 
udice, obviously inspired by a personally driven world view, will 
yet prove to have a modicum of empirical merit and theoretical 
value. (p. 177) 
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